AWS 计划在未来 12 个月内部署超过 100 万颗 Nvidia GPU

AWS to Deploy Over One Million Nvidia GPUs in Major AI Infrastructure Push

March 16, 2026

In a decisive move to solidify its position in the competitive artificial intelligence cloud market, Amazon Web Services (AWS) has announced plans to deploy more than one million Nvidia GPUs over the coming year. This massive infrastructure expansion underscores the intense demand for high-performance computing resources needed to train and run advanced AI models, a sector where cloud providers are racing to capture market share.

The deployment will incorporate Nvidia's latest and upcoming GPU architectures, including the currently available Blackwell Ultra chips and the future Rubin platform set to launch later this year. AWS stated it already offers the broadest collection of Nvidia GPU-based instances among cloud providers and emphasized its commitment to providing cutting-edge hardware. The scale of this planned rollout, exceeding one million units, represents one of the largest single commitments to GPU capacity by a hyperscaler to date.

This announcement follows recent comments by AWS CEO Matt Garman, who in February highlighted the sustained demand for AI compute, noting that the company was still operating six-year-old Nvidia A100 servers as there remained "so much more demand than supply." The swift integration of Blackwell GPUs since their general availability in December and the planned adoption of Rubin illustrate AWS's strategy to rapidly refresh its infrastructure with the most powerful silicon available.

Concurrently, AWS reaffirmed its investment in developing its proprietary AI accelerators, named Trainium, signaling a dual-path strategy of leveraging leading third-party chips while building in-house alternatives. This approach was bolstered by a significant external commitment in February, when OpenAI announced it would spend $2 billion on Trainium compute and GPUs on AWS, following a separate $50 billion investment from Amazon.

The implications of this GPU deployment are significant for the broader AI industry. By dramatically scaling its available AI-optimized compute, AWS is positioning itself to attract and retain large enterprise customers and AI startups requiring vast, reliable resources. This move intensifies the competitive dynamics with other cloud giants, potentially accelerating the pace of AI innovation by making powerful training infrastructure more accessible, albeit at an unprecedented scale and cost.

Source: datacenterdynamics

Read Also
AWS 计划在未来 12 个月内部署超过 100 万颗 Nvidia GPU
Bridge Data Centres Announces Multi-Billion Dollar Investment in Singapore's Digital Infrastructure
Data center pitched in Northumberland, UK
**Nscale in Advanced Talks to Acquire Massive 8GW AI Data Center Campus in West Virginia**
Underwater data center powered by tidal energy proposed off the coast of Maine
Microsoft Breaks Ground on Major AI and Cloud Data Center in Germany's Bergheim
QTS targets two data center campuses in Georgia
Russian Hosting Provider RUVDS Deploys Experimental Data Center in Antarctica
Korea Zinc in talks with US tech firms to extract rare earth minerals from old data center kit - report
DCX Expands Liquid Cooling Portfolio with New CDUs Ranging from 600kW to 2.6MW

Research