Latest

xAI’s Colossus 2: The First Gigawatt-Scale AI D...

September 18, 2025

Much has been written about xAI’s Colossus 1. The Memphis build belongs in the history books: the largest AI-training cluster, assembled from scratch in 122 days. With roughly 200,000 H100/H200s...

xAI’s Colossus 2: The First Gigawatt-Scale AI D...

September 18, 2025

Much has been written about xAI’s Colossus 1. The Memphis build belongs in the history books: the largest AI-training cluster, assembled from scratch in 122 days. With roughly 200,000 H100/H200s...

Another Giant Leap: Rubin CPX—Specialized Accel...

September 12, 2025

NVIDIA introduced Rubin CPX, a single-die accelerator aimed squarely at the prefill phase of inference. It prioritizes compute FLOPs over memory bandwidth—complementing decode-oriented parts—and unlocks the full promise of disaggregated serving...

Another Giant Leap: Rubin CPX—Specialized Accel...

September 12, 2025

NVIDIA introduced Rubin CPX, a single-die accelerator aimed squarely at the prefill phase of inference. It prioritizes compute FLOPs over memory bandwidth—complementing decode-oriented parts—and unlocks the full promise of disaggregated serving...

TSMC Continued Production and Supply Implicatio...

September 10, 2025

All in with Huawei's Chips Compute is the lifeblood of AI, and national strategies increasingly revolve around securing supply. The United States currently leads with well over 70% of deployed...

TSMC Continued Production and Supply Implicatio...

September 10, 2025

All in with Huawei's Chips Compute is the lifeblood of AI, and national strategies increasingly revolve around securing supply. The United States currently leads with well over 70% of deployed...

AWS and Anthropic Plan Multi-Gigawatt Trainium ...

August 22, 2025

AWS’s AI Resurgence: Anthropic, Trainium, and Multi-Gigawatt Buildout Overview Two-and-a-half years after warnings about a potential “cloud crisis” at Amazon Web Services (AWS), evidence now points to a different trajectory....

AWS and Anthropic Plan Multi-Gigawatt Trainium ...

August 22, 2025

AWS’s AI Resurgence: Anthropic, Trainium, and Multi-Gigawatt Buildout Overview Two-and-a-half years after warnings about a potential “cloud crisis” at Amazon Web Services (AWS), evidence now points to a different trajectory....

H100 vs GB200 NVL72: Training Efficiency, TCO, ...

August 22, 2025

H100 vs. GB200 NVL72: Power, TCO, Reliability, and Software Maturity Frontier-scale training now pushes accelerators and racks to physical, thermal, and operational limits. Cost, efficiency, perf-per-TCO, and reliability are the...

H100 vs GB200 NVL72: Training Efficiency, TCO, ...

August 22, 2025

H100 vs. GB200 NVL72: Power, TCO, Reliability, and Software Maturity Frontier-scale training now pushes accelerators and racks to physical, thermal, and operational limits. Cost, efficiency, perf-per-TCO, and reliability are the...

GPT-5 And The Path To Ad Monetization

August 15, 2025

The Router is the Release GPT-5 introduced a unified stack: a default efficient model, a deeper reasoning model (“GPT-5 Thinking”) for harder problems, and a real-time router that selects among...

GPT-5 And The Path To Ad Monetization

August 15, 2025

The Router is the Release GPT-5 introduced a unified stack: a default efficient model, a deeper reasoning model (“GPT-5 Thinking”) for harder problems, and a real-time router that selects among...