Argonne Leadership Computing Facility

ALCF Resources
- Computing Resources
  
  Aurora
  
  Polaris
  
  Sophia
  
  Crux
  
  ALCF AI Testbed
  
  Evaluation Testbeds
  
  Storage and Networking
- Facility Expertise
  
  Facility Expertise
Leadership Computing Resources

The ALCF provides users with access to supercomputing resources that are significantly more powerful than systems typically used for open scientific research.

Featured: Aurora
Science and Engineering
- Output
  
  Projects
  
  Publications
  
  Case Studies
- Allocation Programs
  
  INCITE Program
  
  ALCC Program
  
  Director’s Discretionary
  
  Early Science Program
  
  NAIRR Program
Computational Science

The ALCF is accelerating scientific discoveries in many disciplines, ranging from chemistry and engineering to physics and materials science.

Featured: Engineering
Community and Outreach
- Partnerships
  
  Industry
  
  Collaborations
- Educational Outreach
  
  Women in STEM
  
  Student Programs
- Community
  
  NAIRR Pilot
  
  ALCF Lighthouse Initiative
  
  Exascale Computing Roundtable
Growing the HPC Community

The ALCF is committed to providing training and outreach opportunities that prepare researchers to efficiently use its leadership computing systems, while also cultivating a diverse and skilled HPC workforce for the future.
About
- Get to Know More
  
  Leadership
  
  People
  
  Organizational Chart
  
  Code of Conduct
  
  User Advisory Council
  
  History
- Visit
  
  Visiting ALCF
  
  Tours
- Latest
  
  News
  
  Careers
- Press Kits
  
  ALCF Media Kit
  
  Aurora Media Kit
  
  Reports Archive
Accelerating Science

The Argonne Leadership Computing Facility enables breakthroughs in science and engineering by providing supercomputing resources and expertise to the research community.
Support
- Current
  
  Machine Status
  
  Facility Updates
  
  MyALCF
- Training
  
  Training Videos & Slides
  
  Training Overview
  
  Training and Events
Support Center

The ALCF Support Center assists users with support requests related to their ALCF projects.

Help Desk
Hours: 9:00am-5:00pm CT M-F
Email: support@alcf.anl.gov
Guides
Featured: Get Started

Featured: MyALCF

Foundation Models for Predictive Molecular Epidemiology

PI Arvind Ramanathan, Argonne National Laboratory

Co-PI Thomas Brettin, Argonne National Laboratory
Anima Anandkumar, California Institute of Technology and NVIDIA Inc.
Nicholas Chia, Mayo Clinic
Christian Dallago, NVIDIA Inc.
Thomas Gibbs, NVIDIA Inc.
Ian Foster, Argonne National Laboratory
Logan Ward, Argonne National Laboratory
Christopher Henry, Argonne National Laboratory
James Davis, Argonne National Laboratory
Maulik Shukla, Argonne National Laboratory
Azton Wells, Argonne National Laboratory
Carla Mann, Argonne National Laboratory
Venkatram Vishwananth, Argonne National Laboratory

Award INCITE

Hours Polaris: 150,000 node-hours; Aurora: 1,600,000 node-hours

Total Hours 1,750,000 node-hours

Year 2025

Domain Biological Sciences

Ramanathan INCITE 2024

Genome-scale language models (GenSLMs) for predictive molecular epidemiology.

Project Description

The potential for extant and emerging pathogens to become global health crises necessitates the development of novel methods for proactively engaging these threats before they become pandemic. Recent advances in machine learning and artificial intelligence—specifically, large language models (LLMs)—provide powerful tools for predictive modeling and monitoring of pathogens of concern. The team’s prior work developing genome-scale Language Models (GenSLMs demonstrated the potential for LLMs to predict future SARS-CoV-2 variants of concern prior to their emergence by modeling the evolutionary process. In this project the team builds on that work by scaling GenSLMs beyond the (relatively) simple SARS-CoV-2 to multi-segmented viruses and comparatively enormous bacterial genomes, and even further to more complex eukaryotic organisms including yeast and humans. This project will thus increase biopreparedness by providing a continuous watchlist of pandemic-potential variants across several different pathogens; and will additionally benefit the community by making GenSLM models, data, and code available to a broad user base, who can fine-tune our foundation models for their own downstream predictive tasks.

Domains

Biological Sciences

Allocations

INCITE