Building Dash: How RAG and AI agents help us meet the needs of businesses

Summary

The article outlines the challenges faced in building Dropbox Dash, a universal search and knowledge management tool designed for knowledge workers. It highlights issues such as data diversity, fragmentation, and modalities that complicate AI processing in business environments. The solution involves leveraging retrieval-augmented generation (RAG) to enhance the accuracy and relevance of responses to user queries, while also employing AI agents for multi-step orchestration of complex tasks. The article details the design decisions made to optimize the retrieval system and the evaluation of various large language models (LLMs) to ensure effective performance.

Key Learnings

1Understanding data diversity, fragmentation, and modalities is crucial for developing effective AI solutions in business contexts.
2Retrieval-augmented generation (RAG) combines external information retrieval with generative models to provide contextually relevant answers.
3The choice of retrieval system significantly impacts the user experience, balancing latency, quality, and data freshness.
4AI agents can autonomously break down complex queries into manageable steps, enhancing the capability of AI systems in business applications.
5Being model agnostic allows flexibility in adapting to advancements in large language models, which is essential for maintaining competitive AI solutions.

Who Should Read This

Senior AI Engineers developing enterprise-level AI solutions that require advanced data retrieval and processing capabilities.

Test Your Knowledge

What are the trade-offs between using embedding-based semantic searches versus traditional lexical-based systems in retrieval systems?

How does data fragmentation impact the efficiency of information retrieval in AI applications?

What criteria were used to evaluate the performance of different large language models for the Dash product?

Why is it important for AI agents to have a structured planning and execution process when handling complex user queries?

What challenges arise when integrating multiple data modalities into a single AI solution, and how can they be addressed?

Topics

Retrieval Augmented Generation Generative AI Machine Learning Large Language Models

Read Full Article at Dropbox

Building Dash: How RAG and AI agents help us meet the needs of businesses

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about Retrieval Augmented Generation

Unified Context-Intent Embeddings for Scalable Text-to-SQL

Scaling Search Relevance: Augmenting App Store Ranking with LLM-Generated Judgments

Using LLMs to amplify human labeling and improve Dash search relevance

Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Engineering VP Josh Clemm on how we use knowledge graphs, MCP, and DSPy in Dash

More from Dropbox Engineering

Using LLMs to amplify human labeling and improve Dash search relevance

How low-bit inference enables efficient AI

Insights from our executive roundtable on AI and engineering productivity

Engineering VP Josh Clemm on how we use knowledge graphs, MCP, and DSPy in Dash

Inside the feature store powering real-time AI in Dropbox Dash

Related topics