NVIDIA Developer Tools are available for detailed performance analysis of HPC applications running on NVIDIA A100 systems, such as ALCF's ThetaGPU and NERSC's Perlmutter. Nsight Systems provides developers a system-wide visualization of an application's performance. Developers can optimize bottlenecks to scale efficiently across any number or size of CPUs and GPUs on ThetaGPU. Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging via a user interface and command line tool. In this session, several use-cases of Nsight Systems and Nsight Compute will be presented via a demo with simple HPC benchmarks on ThetaGPU.