Universal User Modeling (UUM): A Foundation Model for User Understanding at Snapchat
Read Full ArticleSummary
The article discusses Universal User Modeling (UUM) at Snapchat, a foundational model designed to enhance user understanding across various product surfaces. UUM captures user behaviors over time by generating long-term embeddings that integrate cross-domain signals, thereby improving personalization in recommendations. The architecture utilizes sequence encoders, specifically transformers, to model user interactions, while a robust data pipeline leverages Spark and Iceberg for efficient data handling. The model is trained with a multi-task objective to predict user events, ensuring a comprehensive representation of user preferences and behaviors.
Key Learnings
- 1UUM enhances user modeling by integrating cross-surface signals, allowing for a more holistic understanding of user behavior.
- 2The architecture employs sequence encoders, particularly transformers, to effectively model user interaction sequences.
- 3A flexible data pipeline using Spark and Iceberg facilitates the aggregation and processing of user engagement data across multiple domains.
- 4The model's multi-task training approach reduces noise and improves the robustness of user preference representations.
- 5UUM's embeddings are designed to be reusable across different product surfaces, promoting efficiency in user personalization.
Who Should Read This
Senior Machine Learning Engineers designing cross-domain user modeling systems
Test Your Knowledge
What are the trade-offs of using dedicated sequence encoders for each domain versus a single cross-domain encoder?
How does the UUM model ensure compliance with privacy policies while processing user data?
What challenges arise in the data pipeline when aggregating user engagement events from multiple domains?
Why is it important to prioritize high-intent events in the user action sequence modeling?
How does the multi-task objective in training the UUM model contribute to better user preference representation?
Topics
More articles about Machine Learning
Explore Machine Learning engineering →Decoupled by Design: Billion-Scale Vector Search
The article discusses the challenges and solutions in building a billion-scale vector search system at Databricks. It highlights the limitations of traditional vector databases that couple storage...
Introducing Kasal
Kasal is a low-code platform developed by Databricks Labs for designing, deploying, and orchestrating agentic AI systems. It provides a visual interface that allows users, regardless of their...
Business Intelligence Analytics: A Complete Guide for the AI Era
The article discusses the evolution of business intelligence (BI) analytics, emphasizing the need for organizations to bridge the gap between data collection and actionable insights. It outlines the...
Engineering Platform Trust: Cutting Customer Case Volume 20x with Petabyte-Scale Health Signals
The article details the development of a Technical Health Score system at Salesforce, aimed at quantifying platform trust through analytics pipelines that handle petabytes of telemetry data. By...
Building What’s Next. Together. Introducing the Brickbuilder Partner Network for the Agentic AI Era
The Brickbuilder Partner Network is a newly established global partner program aimed at fostering growth and innovation among consulting firms, independent software vendors (ISVs), and data providers...
More from Snap (Snapchat) Engineering
View Snap (Snapchat) engineering blogs →Spectacles - EyeConnect
The article discusses EyeConnect, a feature designed to facilitate shared augmented reality experiences by allowing users to connect their Spectacles through a novel motion tracking algorithm. Unlike...
From Monolith to Multicloud Micro-Services: Inside Snap’s Service Mesh - Snap Engineering
The article outlines Snap Engineering's transition from a monolithic application architecture to a microservices architecture deployed across multiple cloud providers, specifically AWS and Google...
Don't Rewrite Your App, Unless You Have To - Snap Engineering
The article discusses the Snapchat Engineering team's experience in rewriting their Android app to enhance performance and reduce bugs. It outlines the challenges faced due to the app's complexity...
Making The Most of a Rewrite - Snap Engineering
The article outlines the process and considerations involved in rewriting the Snapchat application, focusing on architectural improvements to enhance performance and maintainability. It emphasizes...
Device-Distributed Machine Learning - Snap Engineering
The article presents Device-Distributed Machine Learning (DDML), a framework developed by Snapchat that enables training machine learning models directly on client devices while preserving user...