
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works .
Nvidia's Rubin Ultra GPU with four compute chiplets was arguably one of Nvidia's most ambitious projects in recent years, as it not only doubled performance compared to the original Rubin (which uses two compute chiplets), but also increased the complexity of Nvidia's data center GPUs to levels never seen before. However, connecting four near reticle-sized dies using existing advanced packaging technologies is a tremendous engineering challenge, and cooling four complex dies and 16 HBM4E modules is hard and costly. As a result, due to 'manufacturing execution concerns,' Nvidia reportedly canceled Rubin Ultra in its four compute dies form in favor of a design with two compute chiplets. Note that the information is unofficial, so take it with a grain of salt. We've reached out to Nvidia for comment.
As a consequence, Nvidia's 'new' Rubin Ultra would be around half as powerful as the original one, which would certainly make it less competitive against contending offerings, namely AMD's Instinct MI500-series. Of course, Nvidia will still likely optimize its Rubin Ultra design to squeeze some additional performance out of the AI accelerator to justify the upgrade. Also, keep in mind that Nvidia's Rubin Ultra uses HBM4E memory instead of HBM4 used by the original Rubin. Furthermore, starting with Rubin GPUs, Nvidia plans to offer liquid-cooled Kyber rack-scale systems that increase GPU count per scale-up domain to at least 144 packages, which will increase compute performance that Nvidia will sell to its customers.
SemiAnalysis notes that the impact of the cancellation of an AI accelerator with 16 HBM4E packages could have an impact on the HBM market in general, as the 'new' Rubin Ultra will only use eight HBM4E modules.
The purported cancellation of Rubin Ultra with four compute chiplets would also mean that one Rubin Ultra GPU with two compute chiplets will cost less than the original one. Meanwhile, since Nvidia is mostly focused on selling rack-scale solutions rather than on individual GPUs, it remains to be seen how this impacts the actual spending of Nvidia's partners, since if they have to buy more systems to get more GPUs, they will likely spend more than they would if they had to buy fewer systems with the same number of compute chiplets.
Intel's long-lost data center prototype 'Arctic Sound' Xe-HP multi-tile GPU surfaces in new engineering sample
Nvidia says AI cuts 10-month, eight-engineer GPU design task to overnight job
Key considerations
- Investor positioning can change fast
- Volatility remains possible near catalysts
- Macro rates and liquidity can dominate flows
Reference reading
- https://www.tomshardware.com/tech-industry/artificial-intelligence/SPONSORED_LINK_URL
- https://www.tomshardware.com/tech-industry/artificial-intelligence/nvidia-reportedly-cancels-quad-die-rubin-ultra-gpu-in-favor-of-dual-gpu-design-report-claims-complex-design-purportedly-scrapped-over-manufacturing-execution-concerns#main
- https://www.tomshardware.com/membership
- Into the Omniverse: Three Workflows for Improving Vision AI Agent Accuracy With Synthetic Data and Fine-Tuning
- The Ryzen 7 5800X3D is tough to find, but these Prime Day CPUs with DDR5 are cheaper — offset the cost of a DDR5 upgrade with a CPU discount
- Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins
- Sony WH-1000XM5 active noise-canceling headphones for an all-time low $198 at Amazon — audiophile-grade audio and ANC for an affordable price
- AMD engineer 3D-prints Steam Machine-a-like with diagonal mobo mounting — parts include a Mini ITX motherboard, RTX 5060, and a flex ATX PSU
Informational only. No financial advice. Do your own research.