
The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructur e. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling agentic AI applications.
NVIDIA AI Clouds are a growing ecosystem of purpose-built clouds serving the exploding token demand behind today’s most popular AI applications. These AI clouds have been co-designed with NVIDIA’s full-stack AI infrastructure to meet surging demand for AI from enterprises, startups and nations looking for new vendors and regional capacity.
They combine NVIDIA accelerated computing, networking and AI software to help partners support training, fine-tuning, inference, agentic AI, physical AI and sovereign AI deployments. Specific configurations vary by partner and workload.
AI cloud partners choose NVIDIA for the best economics — lowest token cost, best throughput per watt — to run frontier and open source AI. Built with NVIDIA accelerated computing, networking and AI software, these clouds bring AI factories closer to where data, developers, users and industries are, helping customers train, tune and run agentic AI applications at scale. The ecosystem spans nearly every geography, supporting regional and sovereign AI capacity for frontier model builders, enterprises, startups, software providers and national AI programs.
“Every company and every country needs AI factory infrastructure to turn data into intelligence,” said Jensen Huang, founder and CEO of NVIDIA . “NVIDIA AI Clouds bring full-stack AI factories closer to the regions, industries and developers building the next generation of AI, from model training to real-time inference and AI agents that will transform how people and organizations work.”
AI cloud providers, telcos, sovereign AI builders and vertically integrated infrastructure providers are building AI factories with NVIDIA to serve customers across frontier AI, enterprise AI, telecommunications, developer clouds and national AI programs.
Regional growth is accelerating across Southeast Asia, Australia and the Americas, with NVIDIA AI Clouds now reaching six continents following the addition of Cassava in Africa and Claro in South America.
NVIDIA AI Clouds are pairing large-scale AI factory buildouts with demand from leading AI labs, enterprises, governments and digital service providers. Partners including CoreWeave , Firmus , IREN and Nscale are expanding AI infrastructure to support frontier model development, enterprise AI, agentic applications and high-volume inference.
Across regions, NVIDIA AI Clouds are bringing AI factories closer to local industries and sovereign AI ecosystems. Partners including Firebird , GMI Cloud , I ndosat Ooredoo Hutchison , Lambda , Naver Cloud , Sharon AI , Yotta and YTL are supporting emerging AI companies, national AI initiatives, financial services, telecommunications, manufacturing, education, healthcare and developer ecosystems.
For governments and regulated industries, regional AI clouds can support sovereign controls and local compliance requirements. For developers and enterprises, they can reduce friction in accessing accelerated infrastructure for AI agents, enterprise copilots, digital workers and other AI services that must run close to users and data.
Firmus Technologies is expanding its AI factory footprint across South Australia and Southeast Asia, building energy-efficient infrastructure to support growing demand for large-scale training, inference and agentic AI workloads.
Through Project Southgate, Firmus is developing AI factories in Tasmania, Melbourne, South Australia and New South Wales, with an emphasis on renewable power, advanced cooling and modular infrastructure that can bring capacity online faster. The company has also deployed AI infrastructure in Singapore through a partnership with ST Telemedia Global Data Centres.
Firmus is using NVIDIA’s accelerated computing and reference architecture as part of its buildout, with NVIDIA DSX helping streamline AI factory design, deployment and operations.
Engineered in alignment with the NVIDIA DSX platform, the liquid-cooled Firmus HyperCube is designed to fast-track modular AI Factory builds and optimize for low cost per token. Firmus is innovating across the AI factory supply chain, including cooling and energy.
“AI agents are creating a new class of industrial-scale demand for tokens, and Asia-Pacific needs AI factories that can be built faster, liquid-cooled more efficiently and operated at gigawatt scale,” said Tim Rosenfield, co-CEO of Firmus . “Together with NVIDIA, Firmus is building liquid-cooled, AI infrastructure designed to deliver AI tokens as efficiently and rapidly as possible for the region’s most important customers.”
CoreWeave is expanding its NVIDIA AI Cloud platform to support the next wave of agentic AI, physical AI and frontier model workloads.
An early adopter of NVIDIA Vera Rubin and the NVIDIA Vera CPU, CoreWeave is also among the first to adopt NVIDIA Spectrum-X Ethernet Photonics, helping provide the networking foundation for million-GPU AI factories. CoreWeave is extending its platform for robotics and physical AI workflows, including using NVIDIA Cosmos 3 , the latest frontier world foundation model , to help teams generate synthetic data, fine-tune models and accelerate robotics data flywheels. Leading AI labs, including Anthropic , build on CoreWeave’s infrastructure to support frontier models at scale.
“AI factories are becoming the foundation for the agentic era,” said Michael Intrator, cofounder, chairman and CEO of CoreWeave . “Together with NVIDIA, CoreWeave is building the full-stack cloud infrastructure that gives AI labs, enterprises and developers the performance, scale and reliability they need to turn frontier models, AI agents and physical AI systems into production applications.”
Nebius is expanding its NVIDIA AI Cloud with a full-stack platform for training, inference and physical AI development.
An early adopter of NVIDIA Vera Rubin, Nebius is building integrated AI infrastructure from silicon to software, including its Nebius AI Cloud, Token Factory inference layer and new Physical AI Workbench. The workbench brings technologies including NVIDIA Cosmos 3, NVIDIA Isaac Sim and Isaac GR00T into composable workflows that can be assembled by AI agents, helping robotics and autonomous systems teams move faster from simulation and synthetic data to training and evaluation.
“Developers should be able to build AI systems without spending weeks wiring together infrastructure,” said Arkady Volozh, founder and CEO of Nebius . “With NVIDIA, Nebius is creating an AI cloud where AI agents can compose the tools, data and compute needed to accelerate AI workloads — from robotics and life sciences to the enterprise — from experimentation to production.”
Since NVIDIA introduced Exemplar Cloud last year, six NVIDIA Cloud Partners have achieved Exemplar Cloud status: CoreWeave , Crusoe , Lambda , Nebius , Vultr and YTL . The growing roster reflects increasing demand for AI cloud infrastructure that can deliver consistent performance, reliability and efficiency for production AI workloads.
These providers are helping raise the performance bar across the AI cloud ecosystem, giving enterprises, developers and AI labs more validated options for scaling training, inference and agentic AI services.
As AI shifts from model development to reasoning and high-volume inference, the measure of infrastructure is no longer just capacity announced but also the economics of token output driven by platform utilization, uptime, long asset life and the breadth and depth of useful AI agents people can put to work.
Built on NVIDIA full-stack AI factory platforms, AI Clouds help partners optimize infrastructure for these measures.
Cost per token is the total cost of ownership metric that directly accounts for hardware performance, software optimization, ecosystem support and real-world utilization. NVIDIA delivers the lowest cost per token in the industry, driven by delivered token throughput, software optimization and full-stack codesign across compute, networking, memory and storage.
NVIDIA AI Clouds are adopting the NVIDIA DSX platform to design, build and operate AI factories.
DSX brings together validated reference designs, simulation, software and ecosystem technologies to help cloud providers bring capacity online faster, operate more efficiently and maximize revenue.
DSX Sim helps teams model and validate AI factories before deployment. DSX Flex helps AI factories dynamically adapt workloads to grid conditions. DSX MaxLPS helps power-constrained AI factories maximize compute within a fixed power budget, enabling up to 40% more GPUs. DSX OS helps automate lifecycle management and operations at scale.
DSX helps AI Clouds reduce deployment risk, improve resiliency, deliver more tokens per watt and achieve the lowest cost token.
Key considerations
- Investor positioning can change fast
- Volatility remains possible near catalysts
- Macro rates and liquidity can dominate flows
Reference reading
- https://blogs.nvidia.com/blog/ai-cloud-ecosystem/#primary
- https://blogs.nvidia.com/blog/author/dionharris/
- https://blogs.nvidia.com/blog/ai-cloud-ecosystem/#disqus_thread
- Anbernic's retro gaming handhelds are quietly losing RAM capacity and being downgraded to older LPDDR3 memory — company says 1GB capacity is still the standard,
- Acer unveils its first Ryzen 9 9955X3D gaming laptop — refreshed Nitro 16 joins new Predator Helios 18 AI and streaming-only Nitro Blaze Link handheld
- Acer cranks its latest Predator gaming monitor up to 1,000 Hz — Model joins lineup with QD-OLED panels and an immersive 3D display
- NVIDIA Research Advances Robotics From Simulation to the Real World
- The 'ultimate mosquito killer' uses lasers and AI — custom model trained to detect and lock lasers on these pests
Informational only. No financial advice. Do your own research.