
Combined 2026 capex from Amazon, Microsoft, Alphabet, and Meta is tracking between $650 billion and $700 billion , with some Wall Street projections exceeding $1 trillion for 2027, and every hyperscaler has told investors that inference capacity is being absorbed as fast as it can be deployed. Internal developer consumption is obviously part of that absorption, and it sits alongside paying external customers in the usage data that informs the likes of capacity planning, GPU orders, HBM procurement, and power infrastructure.
Tokenmaxxing doesn’t mean the demand is fabricated — enterprise AI adoption is broadening, and inference workloads are scaling into production — but there’s a distinction between adoption and consumption intensity. The former is a durable driver of demand, whereas the latter is gameable, and it’s currently being amplified by the incentive structures that these companies built. The water is further muddied by reports that AI is more expensive than actual workers .
Meta's internal leaderboard lasted days after public exposure, and Amazon recently restricted visibility of team-wide usage statistics. And when measurement shifts, the consumption intensity they incentivized will shift with them.
Nvidia CEO Jensen Huang has highlighted per-engineer token consumption as a key metric, stating he’d be "deeply alarmed" if a $500,000-a-year engineer was not consuming at least $250,000 in tokens. Nvidia's inference growth obviously depends on that consumption being a productive workload that persists and compounds because every inflated token is real GPU time.
Get Tom's Hardware's best news and in-depth reviews, straight to your inbox.
Key considerations
- Investor positioning can change fast
- Volatility remains possible near catalysts
- Macro rates and liquidity can dominate flows
Reference reading
- https://www.tomshardware.com/tech-industry/big-tech/SPONSORED_LINK_URL
- https://www.tomshardware.com/tech-industry/big-tech/big-tech-has-a-tokenmaxxing-habit#main
- https://www.tomshardware.com
- Portable 40mm launcher kills drones by firing 6.5-feet-long steel chains at 80 m/s — German researchers' low-tech mechanical 'bola' outshines textile, drops qua
- Microsoft's massive Kenya AI data center would require switching off 'half the country' to meet power requirements, government says — $1 billion project stalls
- Nemotron Labs: What OpenClaw Agents Mean for Every Organization
- Into the Omniverse: Manufacturing’s Simulation-First Era Has Arrived
- AMD Ryzen 9 9950X3D2 vs Ryzen 9 9950X3D faceoff — How far does dual cache take you?
Informational only. No financial advice. Do your own research.