Atlassian

•

13 min read

•March 6, 2026

Scaling Jira cloud Migrations, One Bottleneck at a Time

Summary

The article chronicles the Jira Migrations team's journey in scaling their migration platform from handling 20,000 to 50,000 Monthly Paid Enabled Users (PEUs). It discusses the transition from an API-driven architecture to a Kafka-based ETL model, highlighting the challenges faced, such as API timeouts and database lock contentions. The team implemented a 'pull-based' model to enhance throughput and avoid overloading the target system. They also optimized various aspects of the migration process, including worker node configurations, polling timeouts, and entity processing strategies, ultimately achieving a significant increase in migration throughput and reliability for large-scale customers.

Key Learnings

1Transitioning from a push-based to a pull-based architecture can significantly improve system throughput and reduce bottlenecks.
2Optimizing worker node configurations and autoscaling rules is critical for maintaining high throughput during migrations.
3Addressing misconfigurations in timeout settings can lead to immediate performance improvements in data processing.
4Implementing micro-batching and per-entity parallel processing can enhance efficiency and reduce network overhead.
5Understanding the distribution of project sizes is essential for optimizing concurrency and resource allocation during migrations.

Who Should Read This

Senior Software Engineers specializing in distributed systems and data migration strategies, particularly those involved in scaling cloud-based applications.

Test Your Knowledge

What are the trade-offs between a push-based and a pull-based migration architecture in terms of throughput and system load?

How did the team identify and resolve the issue of database lock contention during the migration process?

What specific metrics were used to benchmark the performance of the new migration architecture compared to the old one?

In what ways did the team ensure that the migration system could handle the increased concurrency required for 50K-scale migrations?

What lessons were learned from the initial performance benchmarks that informed subsequent architectural decisions?

Topics

Backpressure High Availability Load Shedding Replication Service Discovery

Read Full Article at Atlassian

More from Atlassian Engineering

View Atlassian engineering blogs →

Atlassian

14m

How we catch and mitigate performance regressions at scale in Jira Cloud

The article discusses the complexities of detecting and mitigating performance regressions in Jira Cloud, a multi-tenant product. It highlights the challenges posed by diverse tenant configurations...

Atlassian

Get started on your work 30% faster with Rovo in Jira

The article discusses the implementation and analysis of Rovo, an AI tool integrated within Jira, aimed at enhancing user productivity. It presents a quasi-experimental study comparing two cohorts of...

Atlassian

How Rovo solves search challenges through entity linking

The article discusses how Atlassian addresses search challenges through advanced entity linking, transforming unstructured text into actionable knowledge. It highlights the importance of accurately...

Atlassian

23m

How We Unlocked Performance at Scale with Jira Platform

The article discusses the significant rearchitecture of the Jira Cloud platform, transitioning from a single-tenant database to a cloud-native, multi-tenant architecture designed for scalability,...

Atlassian

11m

Mobbing with AI

The article explores the integration of AI tools into mob programming to enhance software development efficiency without sacrificing code quality. It details a collaborative process where teams...

Scaling Jira cloud Migrations, One Bottleneck at a Time

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about Backpressure

From Static Rate Limiting to Adaptive Traffic Management in Airbnb’s Key-Value Store

Behind the Streams: Real-Time Recommendations for Live Events Part 3

More from Atlassian Engineering

How we catch and mitigate performance regressions at scale in Jira Cloud

Get started on your work 30% faster with Rovo in Jira

How Rovo solves search challenges through entity linking

How We Unlocked Performance at Scale with Jira Platform

Mobbing with AI

Related topics