
Amazon launches Trainium3 AI accelerator, competing directly against Blackwell Ultra in FP8 performance
This design lets packets arrive out of order without confusing the software above it. Furthermore, because the change happens below the application level, existing apps, middleware, and Ethernet/IP networks can keep working as they are, which greatly simplifies the deployment of Ultra Ethernet in existing data centers.
Like a napoleon pastry, Ultra Ethernet technology has many layers that combine to form something greater than the sum of their parts. Their tailored evolutionary steps have created a major leap forward for networking technology aimed at hyperscale AI and HPC deployments.
The Physical Layer of Ultra Ethernet remains based on standard IEEE 802.3 Ethernet signaling and optics, meaning it uses the same cables, transceivers, and switch ports already deployed in today's data centers. But UE's physical layer introduces enhanced forward error correction (FEC), lower-latency link training, more precise timing and jitter specifications, better telemetry (to give operators more visibility into signal integrity and link health) and improved synchronization for large-scale AI and HPC deployments. Arguably the key improvements of Ultra Ethernet are in the link and transport layers.
The Link Layer manages direct connections between switches and network adapters and ensures signal integrity, low latency, and reliable performance across a data-center expanses. Ultra Ethernet's link layer introduces three optional upgrades: Credit-Based Flow Control (CBFC) keeps data flow steady between devices by avoiding buffer overloads; Link Layer Retry (LLR) quickly fixes transmission errors; and Packet Trimming (PT) prevents packet loss. Together, they help prevent congestion, reduce data loss, and keep network performance reliable and predictable.
The Transport Layer is the core of Ultra Ethernet as it enables quick, reliable, and secure data transfers in an out-of-order manner between systems. The layer uses the Ultra Ethernet Transport (UET) protocol, which is divided into four parts — Semantic, Packet Delivery, Congestion Management, and Transport Security — that manage how messages are sent, keep packets in order, control network traffic, and protect data through encryption. Even when packets take different routes and arrive out of order, the system reassembles them instantly, thus keeping latency low, throughput high, and performance steady across massive AI and HPC clusters.
The Storage Layer of Ultra Ethernet is more of an integration and optimization effort than a full redesign. It builds on existing Ethernet storage protocols like NVMe-over-Fabrics, RDMA, and RoCE rather than replacing them. Nonetheless, it optimizes the protocols to better suit large AI clusters by integrating them tightly with the UET to reduce I/O delays and improve consistency across thousands of nodes to connect high-speed networking with storage infrastructure more efficiently.
As for the Management Layer , Ultra Ethernet transforms management from basic device configuration into fabric-wide, automated network orchestration for large AI and HPC systems. To do so, it includes automated tools for device discovery, network topology mapping, and performance monitoring, which enables administrators to quickly diagnose and fix problems across thousands of interconnected nodes.
The Software Layer (or rather Software Stack ) in Ultra Ethernet — which interfaces applications and network — is not radically different from that in traditional Ethernet, but it is significantly expanded to make Ethernet more programmable and usable for AI and HPC deployments. The key improvement is that SL now integrates Libfabric , an open-source network API with unordered operations that allows programs to send data directly from memory to the network interface, bypassing the CPU and improving performance.
In addition, the Software Layer supports OpenConfig to standardize network management and the YANG data modeling language, which defines how configuration data is structured. In general, the software layer keeps Ethernet's core compatibility intact but adds programmability, cuts latency, and improves overall efficiency.
As with traditional Ethernet, the evolution of Ultra Ethernet is managed by a well-established ecosystem of standards organizations that form the Ultra Ethernet Consortium. Among these organizations, a Compliance Working Group sets the official standards and testing procedures to ensure that all Ultra Ethernet devices and software work together correctly and meet the required performance levels. There is also the Management Working Group, which creates tools and models for configuring, monitoring, and controlling UE networks so operators can manage large fabrics efficiently. Finally, there is a Performance and Debug Working Group to define benchmarks and diagnostic tools to test, measure, and troubleshoot UE systems to ensure high reliability and consistent performance.
While the UEC 1.0.1 specification defines the core architecture, transport, and link models, the technology is still in in its infancy in terms of its validation, adoption, and development. Most of its innovations — like the Ultra Ethernet Transport protocol, congestion management, packet trimming, and many others — are being tested, refined, and integrated into prototype hardware and software.
When it comes to hardware, there are already network cards said to be UEC-ready (rather than UEC 1.0-compliant), such as AMD's Pensando Pollara 400 AI NIC, which implies that for now it does not support all the features of the specification. While the card supports features like Intelligent Packet Spray, Out-of-order Packet Handling, Selective Retransmission, and Path-Aware Congestion Control, its developer does not mention link-level features like Packet Trimming, link-level Credit-Based Flow Control (CBFC), or advanced Link-Layer Retry (LLR) support. Then again, these are optional capabilities for now.
The consortium continues to develop future versions and extensions to the Ultra Ethernet specification, though only time will tell when these will be formally introduced.
Follow Tom's Hardware on Google News , or add us as a preferred source , to get our latest news, analysis, & reviews in your feeds.
Anton Shilov is a contributing writer at Tom\u2019s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends. ","collapsible":{"enabled":true,"maxHeight":250,"readMoreText":"Read more","readLessText":"Read less"}}), "https://slice.vanilla.futurecdn.net/13-4-13/js/authorBio.js"); } else { console.error('%c FTE ','background: #9306F9; color: #ffffff','no lazy slice hydration function available'); } Anton Shilov Social Links Navigation Contributing Writer Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.
Key considerations
- Investor positioning can change fast
- Volatility remains possible near catalysts
- Macro rates and liquidity can dominate flows
Reference reading
- https://www.tomshardware.com/networking/SPONSORED_LINK_URL
- https://www.tomshardware.com/networking/ultra-ethernet-the-data-center-interconnection-of-tomorrow-detailed#main
- https://www.tomshardware.com
- Intel XeSS 3 MFG mod triples Arc A380 triples performance in Cyberpunk 2077 — supercharged 6GB GPU pumps out 140 FPS at 1080p on low preset
- GeForce NOW Brings GeForce RTX Gaming to Linux PCs
- Semiconductor industry on track to hit $1 trillion in sales in 2026, SIA predicts — bumper forecast follows $791.7 billion haul for 2025
- AI Copilot Keeps Berkeley’s X-Ray Particle Accelerator on Track
- Developer creates real-time 3D shader demo for the Game Boy Color — you can download the ROM or interact with the 3D teapot online
Informational only. No financial advice. Do your own research.