Javlon Baxtiyorov
← Writing

The Chip Wars: Hyperscalers Surround NVIDIA With Custom Silicon

Amazon, Google, Microsoft, and Meta are building custom accelerators to cut their dependence on NVIDIA, signaling a slow unbundling of the AI hardware monopoly.

The Chip Wars: Hyperscalers Surround NVIDIA With Custom Silicon
Photo by CardMapr.nl on Unsplash

The big cloud providers have stopped waiting for NVIDIA to be the only answer. Amazon has Trainium, Google has the Ironwood TPU, Microsoft has Maia 200, and Meta has its MTIA accelerators. The framing in the trade press is that the hyperscalers are surrounding NVIDIA with custom application-specific chips to reduce their dependence on its hardware.

My first reaction isn't excitement about the chips. It's recognition of the pattern. This is what every company does when one supplier owns too much of its cost structure: it builds the second source. AWS did it with Graviton on the CPU side, and the lesson stuck — control the silicon, control the margin.

This is a dependency-management story

Strip away the hardware glamour and this is the same problem I deal with in software every week: a single dependency that everything routes through, that sets its own price, that I can't easily swap out. The hyperscalers have that dependency, it's spelled NVIDIA, and they are doing exactly what I'd tell a junior engineer to do about a load-bearing third-party package — reduce the blast radius.

What that looks like from where I sit:

  • Each custom chip is a hedge, not a replacement. NVIDIA still sets the pace; these are pressure valves on the bill.
  • The competition is good for me as a buyer, eventually, the way any second supplier eventually drags prices down.
  • But every custom accelerator is also a new lock-in surface. Trainium is an Amazon story. Maia is a Microsoft story. They don't interchange.

That last point is the one I keep coming back to. "Less dependent on NVIDIA" and "more dependent on my specific cloud" can be the same sentence. If I rewrite my inference path to lean on one provider's bespoke silicon to save money, I've traded a hardware lock-in for a cloud lock-in. Sometimes that's a fine trade. It is never a free one.

What I'd actually do about it

I don't get to pick the winner of a chip war, and I shouldn't try. What I can do is keep my own systems indifferent to who wins:

  • Stay one abstraction layer above the metal. If my code talks to a standard runtime instead of a vendor's exact accelerator API, I can follow the cheapest silicon around.
  • Treat "which chip" as an operational decision, not an architectural one — something I change with config, not a rewrite.
  • Keep a portable fallback that runs on commodity hardware, even if it's slower. Slower and available beats fast and stranded.

The quiet good news in all this is that monopolies under pressure tend to behave better. Four credible in-house alternatives is the most leverage NVIDIA's customers have had in years, and leverage upstream usually trickles into prices downstream. I'm not betting my architecture on any one of these chips. I'm just glad they exist, because a market with four serious challengers is a market that argues about price — and I am always happy to be in the room when suppliers start arguing about price.


Sources: AI Chip Wars 2026, AI Chip Wars.


← All writing Get in touch →