The ALCF's annual Simulation, Data, and Learning Workshop will be held virtually December 8-10, 2020. The interactive workshop is aimed at researchers with near-term goals of applying for a major allocation award.
The ALCF's Simulation, Data, and Learning Workshop is designed to help researchers improve the performance and productivity of simulation, data science, and machine learning applications on ALCF systems. Workshop participants will have the opportunity to:
REGISTRATION DEADLINES
Note: Registrants will be reviewed for experience level and will be asked to provide goals for attending.
12/8 Day One (10AM-3PM Central Time) will be a hands-on tutorial for introducing distributed data parallel training on ALCF systems. There will be experts on hand as you run through examples from our Git repo. These examples will teach you how to run deep learning training on multi-GPU nodes and on multiple nodes of ThetaGPU, or a multi-CPU system like Theta. There will also be discussion of how to build proper data pipelines to keep your workflows humming.
12/9 Day Two (10AM-3PM Central Time) will focus on DeepHyper, a tool for distributed hyperparameter optimization. Again, this will be done via tutorial using examples from our Git repos, with experts walking you through the steps. We will also cover how to identify performance issues using common profilers such as VTune, TAU, and the built-in Tensorflow Profiler.
12/10 Day Three (10AM-3PM Central Time) Now that you have a performant, trained, deep network, Day Three will cover the important topic of how to deploy it at scale in a simulation. Integrating model inference into distributed simulations will be covered using tutorials from our Git repo.
Visit our GitHub repo for workshop materials.
Day 1: Tuesday,
|
Topic |
Speaker(s) |
9:30 - 10:00 am (CT) | Attendee check-in | |
10:00 - 10:10 am (CT) | Welcome | Michael Papka (ALCF Director) |
10:10 - 11:10 am (CT) |
Intro to SDL Workshop
|
Taylor Childers (Argonne) |
11:10 am - 1:00 pm (CT) | Distributed Deep Learning [Video 1, Video 2] |
Huihuo Zheng, Corey Adams (Argonne) |
1:00 - 2:00 pm (CT) | Lunch Break | |
2:00 - 4:00 pm (CT) |
Building Data Pipelines |
Taylor Childers (Argonne) |
Day 2: Wednesday,
|
||
10:30 - 11:00 am (CT) | Attendee check-in | |
11:00 am - 1:00 pm (CT) | Distributed Hyper Optimization [Video 1, Video 2, Video 3] |
Kyle Felker, Misha Salim, Romit Maulik, Sam Foreman (Argonne) |
1:00 - 2:00 pm (CT) | Lunch Break | |
2:00 - 4:00 pm (CT) | Huihuo Zheng, Murali Emani, Taylor Childers (Argonne) | |
Day 3: Thursday,
|
||
10:30 - 11:00 am (CT) | Attendee check-in | |
11:00 am - 1:00 pm (CT) |
Integrating Inference into Simulation |
Bethany Lusch, Romit Maulik (Argonne) |
1:00 - 2:00 pm (CT) | Lunch Break | |
2:00 - 4:00 pm (CT) |
Hands-on Session |
|
4:00 - 4:30 pm (CT) | Applying for ALCF Allocation Programs [Video] |
Katherine Riley (Argonne) |
4:30 - 5:00 pm (CT) | Closing Remarks and Wrap-Up/Next Steps | Ray Loy (Argonne) |