Announcing Amazon EC2 G7e instances accelerated by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs

Summary

The article announces the availability of Amazon EC2 G7e instances, which are powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs. These instances are designed for generative AI inference workloads and provide significant performance improvements over previous models, including enhanced GPU memory and bandwidth. The G7e instances support multi-GPU configurations and advanced networking capabilities, making them suitable for demanding applications in spatial and scientific computing. The article also outlines the specifications for various instance sizes and their respective capabilities.

Key Learnings

1G7e instances deliver up to 2.3 times the inference performance compared to G6e instances, making them ideal for generative AI workloads.
2The NVIDIA RTX PRO 6000 GPUs in G7e instances offer double the GPU memory and significantly increased memory bandwidth, allowing for larger models to be run efficiently.
3Support for NVIDIA GPUDirect P2P enables reduced latency in multi-GPU workloads, enhancing performance for complex computations.
4G7e instances provide four times the networking bandwidth compared to G6e, facilitating better performance for multi-node workloads.
5The instances can be utilized with AWS Deep Learning AMIs and are compatible with various AWS services for a streamlined machine learning workflow.

Who Should Read This

Senior Cloud Engineers implementing high-performance generative AI workloads on AWS infrastructure

Test Your Knowledge

What are the key performance improvements of the G7e instances compared to the G6e instances?

How does the support for NVIDIA GPUDirect P2P impact the performance of multi-GPU workloads?

What considerations should be made when selecting instance sizes for specific generative AI tasks?

In what scenarios would the enhanced networking capabilities of G7e instances be critical for performance?

How can the increased GPU memory of G7e instances enable the use of larger models in machine learning applications?

Topics

AWS AWS EC2 Nvidia Elastic Fabric Adapter Generative AI

Read Full Article at AWS

More from AWS Engineering

View AWS engineering blogs →

AWS

Announcing Amazon EC2 G7e instances accelerated by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about AWS

Complexity is a choice. SASE migrations shouldn’t take years.

AWS Weekly Roundup: Amazon Connect Health, Bedrock AgentCore Policy, GameDay Europe, and more (March 9, 2026)

Native .NET Buildpack Support is Now Available on App Platform

Introducing OpenClaw on Amazon Lightsail to run your autonomous private AI agents

See risk, fix risk: introducing Remediation in Cloudflare CASB

More from AWS Engineering

AWS Weekly Roundup: Amazon Connect Health, Bedrock AgentCore Policy, GameDay Europe, and more (March 9, 2026)

Introducing OpenClaw on Amazon Lightsail to run your autonomous private AI agents

AWS Weekly Roundup: OpenAI partnership, AWS Elemental Inference, Strands Labs, and more (March 2, 2026)

AWS Security Hub Extended offers full-stack enterprise security with curated partner solutions

Transform live video for mobile audiences with AWS Elemental Inference

Related topics