Unifying Ads Engagement Modeling Across Pinterest Surfaces
Read Full ArticleSummary
The article presents a comprehensive approach to unify ads engagement modeling across different surfaces at Pinterest, addressing the challenges posed by previously independent models. It outlines the fragmentation in user sequence modeling, feature representation, and training configurations that led to inefficiencies. The authors describe their strategy for creating a unified engagement framework, emphasizing the importance of maintaining operational safety, iterative development, and surface-specific calibration. Key architectural changes include the integration of multi-task learning and surface-specific exports, which enhance flexibility and performance while reducing operational costs. The article concludes with a discussion on future milestones aimed at further unifying the engagement model.
Key Learnings
- 1Unifying models can significantly reduce operational inefficiencies and improve iteration velocity by consolidating similar functionalities.
- 2Surface-specific calibration is crucial for accurate predictions, as different surfaces may have distinct traffic distributions.
- 3Implementing a multi-task learning design allows for flexibility in adapting surface-specific features while benefiting from shared learning.
- 4Efficiency improvements, such as request-level broadcasting and simplified compute paths, are essential to manage infrastructure costs effectively.
- 5Strategic architectural changes can lead to better performance metrics without compromising on cost-effectiveness.
Who Should Read This
Senior Machine Learning Engineers focusing on model optimization and scalability in ad tech environments.
Test Your Knowledge
What are the key architectural principles that guided the unification of the engagement models?
How does surface-specific calibration improve click-through rate (CTR) predictions compared to a global calibration approach?
What trade-offs were considered when merging features and modules from the separate models?
In what ways did the unified model enhance representation learning across different surfaces?
What specific efficiency optimizations were implemented to reduce infrastructure costs during model serving?
Topics
More articles about Machine Learning
Explore Machine Learning engineering →Decoupled by Design: Billion-Scale Vector Search
The article discusses the challenges and solutions in building a billion-scale vector search system at Databricks. It highlights the limitations of traditional vector databases that couple storage...
Introducing Kasal
Kasal is a low-code platform developed by Databricks Labs for designing, deploying, and orchestrating agentic AI systems. It provides a visual interface that allows users, regardless of their...
Business Intelligence Analytics: A Complete Guide for the AI Era
The article discusses the evolution of business intelligence (BI) analytics, emphasizing the need for organizations to bridge the gap between data collection and actionable insights. It outlines the...
Engineering Platform Trust: Cutting Customer Case Volume 20x with Petabyte-Scale Health Signals
The article details the development of a Technical Health Score system at Salesforce, aimed at quantifying platform trust through analytics pipelines that handle petabytes of telemetry data. By...
Building What’s Next. Together. Introducing the Brickbuilder Partner Network for the Agentic AI Era
The Brickbuilder Partner Network is a newly established global partner program aimed at fostering growth and innovation among consulting firms, independent software vendors (ISVs), and data providers...
More from Pinterest Engineering
View Pinterest engineering blogs →Unified Context-Intent Embeddings for Scalable Text-to-SQL
The article outlines Pinterest's evolution from basic Text-to-SQL systems to a sophisticated Analytics Agent that leverages unified context-intent embeddings for enhanced query understanding and SQL...
Bridging the Gap: Diagnosing Online–Offline Discrepancy in Pinterest’s L1 Conversion Models
The article discusses the challenges faced by Pinterest in reconciling offline and online performance metrics of their L1 conversion models. It highlights the discrepancies observed between strong...
Piqama: Pinterest Quota Management Ecosystem
The article introduces Piqama, Pinterest's comprehensive quota management ecosystem designed to oversee resource quotas across various systems. It outlines the architecture of Piqama, emphasizing its...
Drastically Reducing Out-of-Memory Errors in Apache Spark at Pinterest
This article details Pinterest's approach to significantly reduce out-of-memory (OOM) errors in their Apache Spark applications through a feature called Auto Memory Retries. By automatically...
GPU-Serving Two-Tower Models for Lightweight Ads Engagement Prediction
The article presents a significant advancement in Pinterest's ads recommendation system through the introduction of a GPU-serving two-tower model for lightweight ranking. This model architecture...