Learn how Meta, Alibaba, ASOS, and Kuaishou Technology are delivering recommendations at scale (at GTC Spring 2023)

Published in

NVIDIA Merlin

4 min readMar 8, 2023

Over the last couple of months, the Merlin team has been all about aligning the features of our software with customer needs and developing world-class support for session-based recommendations.

Additionally, we have worked hard to bring you updated tutorials to make jumping into using our framework even easier!

In the upcoming GTC (join us online for free, March 20–23, 2023), several companies using NVIDIA GPUs and NVIDIA software will share with you how they are meeting business needs by accelerating and expanding their recommender system workflows!

GTC Personalization with NVIDIA Merlin

Before we get started — exciting news from the Merlin Team!

The GTC web portal is now using the Merlin Framework to recommend talks! This is a session-based solution that leverages Large Language Models for title and abstract preprocessing.

Please see the results in the screenshot below. If you would like to learn more about how we are doing recommendations ourselves, please tune in to a talk on this subject by our colleagues.

How are companies using Merlin in their recommender system pipelines?

Fast and Scalable Training of Deep Learning Recommendation Models- Sarunya Pumma is a software engineer in the AI system Co-Design at Meta. In this talk, she outlines the importance of GPU in powering Meta’s key applications, the unique computational challenges associated with these personalization and recommendation tasks, and how the kernel library, FBGEMM-GPU, solves them with various GPU optimization techniques.

Serving Large Recommender Models with 10x Performance Gain — Xiao Liang, a Software Architect at Kuaishou Technology shares how using various techniques (including Tensor Core and GPU-based caching) Kuaishou Technology got an average 10x performance gain in their mainstream models.

Implementing Model Serving at Scale — a team of machine learning engineers from ASOS (Rick Bruins and Neha Patel) will walk us through how to serve multiple models at scale with Triton, A/B test models, serve ensemble models, and monitor progress in production. They will also discuss the MLOps processes and the importance of a cross-functional approach from concept to deployment, and share performance results to illustrate the impact of this approach.

DeepRec: Toward High-Performance Recommendation Deep Learning Framework with GPU Acceleration — Tongxuan Liu, Staff Engineer at Alibaba and Shijie Liu from NVIDIA will introduce DeepRec to address the challenges of effectively and efficiently training Deep Learning models. The solutions that will be covered include a graph-based GPU memory allocator, a hybrid distributed training framework, and a multi-stream/CUDA Graph-based GPU runtime.

Summary

The NVIDIA GTC Spring 2023 is right around the corner! You can register online for free to reserve your spot here.

The conference will feature a lineup of great speakers including Demis Hassabis (DeepMind), Ilya Sutskever (OpenAI), Anima Anandkumar (NVIDIA), and of course Jensen Huang, the CEO of NVIDIA, who will deliver the keynote on the broader state of GPU acceleration and the technical breakthroughs happening now across multiple industries.

See you at the GTC!

And for further details on how we personalize GTC, please check out our blog post about the email use-case!

Learn how Meta, Alibaba, ASOS, and Kuaishou Technology are delivering recommendations at scale (at GTC Spring 2023)

GTC Personalization with NVIDIA Merlin

How are companies using Merlin in their recommender system pipelines?

Other talks and tutorials from the Merlin team

Summary

Written by Radek Osmulski