Best Practices for Queueing and Running Jobs on Theta

Help Desk

Slides
Best Practices for Queueing and Running Jobs on Theta

This session focuses on effective scheduling and configuration of jobs on the ALCF's Theta supercomputer to improve user experience. After a brief overview of the Cobalt scheduler, we describe best practices for writing batch scripts, working interactively, such as with Jupyter notebooks, and considerations that affect queue turnaround time. We present example scripts for common simulation, data, and learning workloads, such as utilizing SSDs for local storage, using Singularity to run containerized jobs, and launching ensemble runs. We also show how high-throughput workloads can leverage Balsam to launch many applications per Cobalt job.

Presenter
Chris Knight, Argonne National Laboratory
Adrian Pope, Argonne National Laboratory
Misha Salim, Argonne National Laboratory
Applications
Systems