China’s LineShine supercomputer dethrones US’ El Capitan, secures first place in Top 500 list — first machine in the rankings to sustain more than 2 ExaFLOPS of

China's LineShine supercomputer dethrones US' El Capitan, secures first place in Top 500 list — first machine in the rankings to sustain more than 2 ExaFLOPS of

Follow Tom's Hardware on Google News , or add us as a preferred source , to get our latest news, analysis, & reviews in your feeds.

Anton Shilov is a contributing writer at Tom\u2019s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends. ","collapsible":{"enabled":true,"maxHeight":250,"readMoreText":"Read more","readLessText":"Read less"}}), "https://slice.vanilla.futurecdn.net/13-4-24/js/authorBio.js"); } else { console.error('%c FTE ','background: #9306F9; color: #ffffff','no lazy slice hydration function available'); } Anton Shilov Social Links Navigation Contributing Writer Anton Shilov is a contributing writer at Tom’s Hardware. Over the past couple of decades, he has covered everything from CPUs and GPUs to supercomputers and from modern process technologies and latest fab tools to high-tech industry trends.

usertests Is this using TSMC nodes? The GFLOPS/W is comparable to others with the Green500 #1 being 41% more efficient, while El Capitan is only at 17%. Top500 website is broken as usual with nonfunctional links to Green500 lists. Reply

TechieTwo …and I have some ocean front property in Arkansas that you'all be interested in buying. 😉 Reply

bit_user The article said: Each LX2 CPU relies on two compute chiplets and has a total of 304 CPU cores organized into eight CPU clusters containing 38 cores each. … The chip features a rather unusual memory architecture that pairs 32 GB of on-package HBM, offering up to 4 TB/s of bandwidth with as much as 256 GB of external DDR5 So, the whitepaper they previously published goes into quite some detail about its memory subsystem, which is highly NUMA. In fact, to such an extent that they optimized their code on it by running separate MPI nodes on each of those 38-core clusters, since each cluster has a local HBM stack and relatively poor inter-cluster and inter-die bandwidth. Also, the dependence on that relatively small amount of HBM to scale performance makes it fairly unsuitable for general-purpose workloads, even though it can theoretically handle them. The article said: In any case, the very fact that a Chinese supercomputer has achieved extraordinary FP64 performance is remarkable. Not their first. See: https://en.wikipedia.org/wiki/Sunway_TaihuLight IMO, the remarkable thing is its efficiency (if accurate). I think scaling up ExaFLOPS is not that hard, if your pockets are deep enough, but efficiency would be a better signifier of sophistication and advancement. However, even that can be swung by $$$, since you can easily boost perf/W by simply running more nodes at lower clock speeds. Due to the relative cost of energy vs. silicon, most HPC operators prefer to run their chips towards the upper end of the power envelope, which hurts them on efficiency metrics. Reply

Key considerations

  • Investor positioning can change fast
  • Volatility remains possible near catalysts
  • Macro rates and liquidity can dominate flows

Reference reading

More on this site

Informational only. No financial advice. Do your own research.

Leave a Comment