Large Language Models: Embeddings and Tokenization

Help Desk

Email: support@alcf.anl.gov

 

MyALCF

Portal: my.alcf.anl.gov

Video

The video covers essential concepts of sequential data modeling and modeling approaches such as transformers. 

 

Archit Vasan is a postdoctoral appointee in the Argonne Leadership Computing Facility with a background in computational biophysics. His research interests at ALCF involve the discovery of cancer drugs using machine Learning coupled to exascale computing. Archit received a BA in Physics and Mathematics from Austin College in 2016. He then received his PhD in Biophysics from the University of Illinois at Urbana-Champaign in 2023 under the guidance of Dr. Emad Tajkhorshid.

 

As a lead of the ALCF Catalyst team, Chris Knight works closely with researchers to help them accomplish their scientific goals using leadership computational resources. To address the unique challenges of efficiently using leadership-scale resources, Chris assists researchers with profiling and debugging their codes, discusses strategies and provides general guidance on code parallelization, I/O, load-balancing, workflow design, and data management. Important components of this work are training users on key high-performance computing topics and collaborating with researchers to advance their scientific mission.

Systems