This is an interactive videoconference on using HPCToolkit to analyze parallel program performance. HPCToolkit uses low-overhead asynchronous sampling to obtain deep insight into performance bottlenecks and identify opportunities for tuning both within and across the nodes of a parallel system.