The Next Platform
  • Home
  • Compute
  • Store
  • Connect
  • Control
  • Code
  • AI
  • HPC
  • Enterprise
  • Hyperscale
  • Cloud
  • Edge
Latest
  • [ May 9, 2025 ] China Export Controls Whack AMD Datacenter GPU Business AI
  • [ May 8, 2025 ] Supermicro Hiccups On Hopper, Pulls $40 Billion Guidance For Fiscal 2026 Compute
  • [ May 6, 2025 ] Cisco Pulls Together A Quantum Network Architecture Connect
  • [ May 2, 2025 ] Amazon Says It Can Embiggen AWS Past “Multi-$100 Billion” With AI Cloud
  • [ May 1, 2025 ] AI Steady, Cloud Accelerating Gives Microsoft A Big Datacenter Boost AI
  • [ April 30, 2025 ] With Its Llama API Service, Meta Platforms Finally Becomes A Cloud AI
  • [ April 30, 2025 ] Google Cloud Revenues And Profits Flattening Out Cloud
  • [ April 28, 2025 ] If NSF Snoozes, Then TACC’s “Horizon” Supercomputer Loses HPC
HomeFault Tolerance

Fault Tolerance

HPC

Who Shoulders the Supercomputing Resiliency Burden?

January 11, 2021 Nicole Hemsoth Prickett 0

While the related topics of fault tolerance and resiliency do not garner the same attention as performance and efficiency, being able to recover from and work around failures, especially as applications take over ever-large and increasingly heterogenous machines, is more important than ever. …

About

The Next Platform is part of the Situation Publishing family, which includes the enterprise and business technology publication, The Register.

TNP  offers in-depth coverage of high-end computing at large enterprises, supercomputing centers, hyperscale data centers, and public clouds. Read more…

Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

  • RSS
  • Twitter
  • Facebook
  • LinkedIn
  • Email the editor
  • About
  • Contributors
  • Contact
  • Sales
  • Newsletter
  • Books
  • Events
  • Privacy
  • Ts&Cs
  • Cookies
  • Do not sell my personal information

All Content Copyright The Next Platform