NERSC is hosting the First International Symposium on Checkpointing for Supercomputing (SuperCheck21), which will be held February 4-5, 2021. This free event will be held online and will feature the latest work in checkpoint/restart research, tools development, and production use.
Important Dates
About the Workshop
Checkpoint/Restart (C/R) is critical for fault-tolerant computing in high-performance computing (HPC). While there has been much research and development on C/R and C/R tools, few HPC end users are able to use these tools in production workloads. Although research codes often demonstrate promising C/R capabilities, there are no feasible C/R options for diverse production workloads, especially on cutting-edge HPC systems. In this workshop, we will bring together C/R researchers, practitioners, application developers, and end users to share both the latest research results and experiences on adopting C/R tools in production. The goal of this workshop is to showcase the latest research on C/R, motivate the development of usable C/R tools, and boost the adoption of C/R tools in HPC production workloads. Paper submissions will be peer-reviewed, and a venue for accepted papers will be identified. We encourage PhD students and HPC end users to participate.
Workshop Scope
The workshop scope includes any and all aspects of checkpointing for science and engineering in the High Performance Computing (HPC) context, including the latest research results and development, deployment, and application experiences. The workshop scope includes but is not limited to:
C/R research and tools development:
C/R use in production (including all levels of checkpointing: application, job, and system levels):
Participation
We encourage participation from researchers and end-users, professionals and students.