Wafer Blog Posts

Balaji Varadarajan and Wafer Team

The Inference Alpha: Maximizing Frontier Models on AMD

How DigitalOcean and Wafer unlock order-of-magnitude inference speedups on AMD GPUs for Kimi 2.5, DeepSeek V3.2, and GLM-5 through deep kernel and systems engineering.

Read all 14 min