Tutorial Details: Checkpointing HPC Applications
|Date:||Wednesday, April 25, 2012, 01:00 pm - 04:00 pm|
|Instructor(s):||Jeffrey McDonald, MSI, Shuxia Zhang, MSI|
Checkpointing HPC applications has been a challenging, but highly desired functionality for saving the state of long-running applications. This functionality hedges against failure modes from unexpected events that can cause premature failure of an application.
This workshop will teach MSI users the available checkpointing tools that MSI supports and how to use them for checkpointing your application without modifying the code. A hands-on session of 1.5 hours will be provided, allowing attendees to implement the procedures for checkpointing your jobs.
|Prerequisites:||Some knowledge of Unix and parallel computing|