The Next Platform
  • Home
  • Compute
  • Store
  • Connect
  • Control
  • Code
  • AI
  • HPC
  • Enterprise
  • Hyperscale
  • Cloud
  • Edge
Latest
  • [ March 31, 2023 ] A Peek Into The Future Of AI Inference At Nvidia AI
  • [ March 31, 2023 ] The Age Of Acceleration Engines AI
  • [ March 31, 2023 ] Finally: Some Good News For The Intel Xeon CPU Roadmap Compute
  • [ March 29, 2023 ] Cerebras Smashes AI Wide Open, Countering Hypocrites AI
  • [ March 29, 2023 ] DDoS DNS Attacks Are Old-School, Unsophisticated . . . And They’re Back Security
  • [ March 28, 2023 ] Enfabrica Converges Extended Memory And I/O Down To One Chip Connect
  • [ March 28, 2023 ] Pushing The Boundaries Of Webscale Optical DCI Performance Cloud
  • [ March 27, 2023 ] The Dream Of Placing Blocks On Chip Designs With AI AI
HomeDMTCP

DMTCP

Store

Memory Snapshots Bring Checkpointing Into The 21st Century

December 9, 2021 Timothy Prickett Morgan 0

When you have a massively distributed computing job that can take months to run across thousands to hundreds of thousands of compute elements, one software hardware or software crash can mean losing an enormous amount of work. …

About

The Next Platform is published by Stackhouse Publishing Inc in partnership with the UK’s top technology publication, The Register.

It offers in-depth coverage of high-end computing at large enterprises, supercomputing centers, hyperscale data centers, and public clouds. Read more…

Newsletter

Featuring highlights, analysis, and stories from the week directly from us to your inbox with nothing in between.
Subscribe now

  • RSS
  • Twitter
  • Facebook
  • LinkedIn
  • Email the editor
  • About
  • Contributors
  • Contact
  • Sales
  • Newsletter
  • Books
  • Events
  • Privacy
  • Ts&Cs
  • Cookies
  • Do not sell my personal information

All Content Copyright The Next Platform