Accelerate large-scale AI applications with the new Amazon EC2 P6-B300 instances

Summary

The article introduces the Amazon EC2 P6-B300 instances, powered by NVIDIA Blackwell Ultra GPUs, designed for high-performance AI applications. These instances provide significant enhancements in networking bandwidth and GPU memory, making them suitable for training and serving large-scale AI models. With features like 6.4Tbps Elastic Fabric Adapter networking and 2.1TB of GPU memory, the P6-B300 instances facilitate efficient model training and reduce communication overhead, particularly for complex models such as Mixture of Experts. The instances are now available in the US West (Oregon) AWS Region, with flexible pricing options.

Key Learnings

1The P6-B300 instances offer 2 times more networking bandwidth and 1.5 times more GPU memory compared to previous generations, enhancing performance for large-scale AI workloads.
2Utilizing the Elastic Fabric Adapter (EFA) allows for efficient communication across large GPU clusters, critical for distributed training of AI models.
3The integration of NVIDIA GPUDirect Storage with EFA can achieve up to 1.2Tbps throughput, optimizing data loading for AI applications.
4The instances support a variety of high-performance storage options, including Amazon FSx for Lustre and Amazon S3, tailored for different price-performance needs.
5The specifications of the P6-B300 instances make them ideal for organizations working with trillion-parameter models requiring extensive compute and memory resources.

Who Should Read This

Senior Cloud Engineers implementing large-scale AI solutions on AWS infrastructure

Test Your Knowledge

What are the specific advantages of using Elastic Fabric Adapter (EFA) in the context of distributed AI training?

How does the increase in GPU memory impact the performance of large-scale AI models, particularly in terms of model sharding?

What considerations should organizations take into account when choosing between Amazon FSx for Lustre and Amazon S3 for their AI workloads?

In what scenarios might the P6-B300 instances outperform previous generations in terms of cost-effectiveness for AI applications?

How do the architectural features of the AWS Nitro System contribute to the security and performance of the P6-B300 instances?

Topics

AWS AWS EC2 AWS Lambda Amazon S3 Elastic Fabric Adapter

Read Full Article at AWS

More from AWS Engineering

View AWS engineering blogs →

AWS

Accelerate large-scale AI applications with the new Amazon EC2 P6-B300 instances

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about AWS

Complexity is a choice. SASE migrations shouldn’t take years.

AWS Weekly Roundup: Amazon Connect Health, Bedrock AgentCore Policy, GameDay Europe, and more (March 9, 2026)

Native .NET Buildpack Support is Now Available on App Platform

Introducing OpenClaw on Amazon Lightsail to run your autonomous private AI agents

See risk, fix risk: introducing Remediation in Cloudflare CASB

More from AWS Engineering

AWS Weekly Roundup: Amazon Connect Health, Bedrock AgentCore Policy, GameDay Europe, and more (March 9, 2026)

Introducing OpenClaw on Amazon Lightsail to run your autonomous private AI agents

AWS Weekly Roundup: OpenAI partnership, AWS Elemental Inference, Strands Labs, and more (March 2, 2026)

AWS Security Hub Extended offers full-stack enterprise security with curated partner solutions

Transform live video for mobile audiences with AWS Elemental Inference

Related topics