MProt-DPO: Breaking the ExaFLOPS Barrier for Multimodal Protein Design Workflows

Join us on January 29, 2025, for a webinar covering MProt-DPO, an AI-driven multimodal framework for protein design.

Argonne's Gautham Dharuman and Väinö Hatanpää will present a scalable, end-to-end workflow for protein design. This work was done on five major supercomputing resources, including Aurora, Argonne's exascale machine. By augmenting protein sequences with natural language descriptions of their biochemical properties, the team trained generative models that can be preferentially aligned with protein fitness landscapes.

Through complex experimental and simulation-based observations, the team was able to integrate these measures as preferred parameters for generating new protein variants and demonstrate our workflow on five diverse supercomputers. The team's implementation thus sets high watermarks for multimodal protein design workflows.

Gautham Dharuman is an Assistant Computational Scientist at the Data Science and Learning division of Argonne National Laboratory. He earned his dual Ph.D. in Computational Science and Engineering and Electrical Engineering from Michigan State University in 2018. His research focuses on developing and applying advanced AI models and methods at scale, including multimodal models for protein design workflows, preference optimization and reinforcement learning methods for incorporating experimental feedback from automated laboratories, neural operator surrogates for complex dynamical systems, agentic frameworks for scientific workflows, and scaling large language model training frameworks on emerging Exascale systems, to tackle challenges in the space of automated scientific discovery.

Väinö Hatanpää is an Assistant Computer Scientist at Argonne Leadership Computing Facility (ALCF) of Argonne National Laboratory. Väinö received his Master of Science on Machine Learning from Aalto University, Finland. His research interests are large scale deep learning applications on Exascale systems, with focus on communication and computational patterns of large language models.

Argonne Leadership Computing Facility

Leadership Computing Resources

Featured: Aurora

Computational Science

Featured: Engineering

Growing the HPC Community

Accelerating Science

Support Center

Featured: Get Started

Featured: MyALCF

MProt-DPO: Breaking the ExaFLOPS Barrier for Multimodal Protein Design Workflows

01/29/2025, 11am – 12pm CT