Databricks

•

8 min read

•March 2, 2026

Real-Time Mode: Ultra-low latency streaming on Spark APIs without a second engine

Summary

The article introduces Real-Time Mode (RTM) in Apache Spark, which unifies offline training and ultra-low-latency online feature engineering into a single engine, eliminating the need for separate systems like Apache Flink. It highlights the architectural changes that enable sub-second latencies, such as continuous data flow, pipeline scheduling, and streaming shuffle. The performance analysis demonstrates that Spark RTM can process events significantly faster than Flink, making it suitable for applications like fraud detection and real-time analytics. The article emphasizes the operational simplicity and reduced complexity in managing real-time applications, allowing teams to focus on business use cases rather than infrastructure management.

Key Learnings

1Real-Time Mode in Apache Spark allows for ultra-low latency processing without the need for additional systems, simplifying architecture.
2Key innovations in RTM include continuous data flow, pipeline scheduling, and streaming shuffle, which enhance performance.
3The unified API in Spark RTM minimizes logic drift between training and inference, ensuring consistency in machine learning applications.
4Real-time applications can be developed and scaled more efficiently within a single environment, reducing operational complexity.
5Early adopters of RTM have successfully implemented it for various low-latency applications, demonstrating its practical benefits.

Who Should Read This

Senior Data Engineers implementing real-time data processing solutions using Apache Spark

Test Your Knowledge

What are the architectural changes introduced in Spark RTM that contribute to its low-latency performance?

How does RTM minimize logic drift between model training and inference in real-time machine learning applications?

What trade-offs exist when transitioning from traditional Spark processing to Real-Time Mode?

In what scenarios might a team still consider using a specialized system like Flink despite the capabilities of Spark RTM?

How does the continuous data flow mechanism in RTM differ from traditional batch processing methods?

Topics

Apache Spark Streaming Etl Pipelines Data Quality Real-time Processing

Read Full Article at Databricks

More from Databricks Engineering

View Databricks engineering blogs →

Databricks

Transforming Healthcare Referrals with Fivetran, Agentic AI, and Databricks Genie

The article outlines how healthcare organizations can address fragmented data challenges by leveraging Fivetran for seamless data extraction and Databricks for data unification and AI deployment. It...

Databricks

17m

Decoupled by Design: Billion-Scale Vector Search

The article discusses the challenges and solutions in building a billion-scale vector search system at Databricks. It highlights the limitations of traditional vector databases that couple storage...

Databricks

The Professional Impact of Becoming Databricks Certified

The article highlights the significance of Databricks certifications in enhancing professional credibility and career opportunities for data and AI practitioners. It emphasizes that these...

Databricks

Introducing Kasal

Kasal is a low-code platform developed by Databricks Labs for designing, deploying, and orchestrating agentic AI systems. It provides a visual interface that allows users, regardless of their...

Databricks

13m

Business Intelligence Analytics: A Complete Guide for the AI Era

The article discusses the evolution of business intelligence (BI) analytics, emphasizing the need for organizations to bridge the gap between data collection and actionable insights. It outlines the...

Real-Time Mode: Ultra-low latency streaming on Spark APIs without a second engine

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about Apache Spark

Activate first-party data with Meta Conversions API on Databricks

Spark Declarative Pipelines: Why Data Engineering Needs to Become End-to-End Declarative

Drastically Reducing Out-of-Memory Errors in Apache Spark at Pinterest

Why Apache Spark Real-Time Mode Is A Game Changer for Ad Attribution

Next Generation DB Ingestion at Pinterest

More from Databricks Engineering

Transforming Healthcare Referrals with Fivetran, Agentic AI, and Databricks Genie

Decoupled by Design: Billion-Scale Vector Search

The Professional Impact of Becoming Databricks Certified

Introducing Kasal

Business Intelligence Analytics: A Complete Guide for the AI Era

Related topics