About

Recent advances in high-throughput experimental methods and machine learning approaches have fueled interest in ML-driven protein design. These advances may enable more rapid development of designed proteins with applications ranging from biopharmaceuticals, catalysis, material design and basic science research. However, this excitement has exposed important research questions across the foundation of this emerging engineering discipline. For example:

  • What experimental approaches can feed the data-driven design cycle?

  • Which machine learning models and parameterizations of proteins hold the right inductive biases?

  • What are the limits of the growing structural and evolutionary data in the PDB and UniProt?

  • How do we use our trained models to guide data collection?

We think these questions will be best addressed by a collaborative, interdisciplinary community. Thus, the ML4Protein Engineering community runs a bi-weekly seminar series to address these advances and other outstanding problems, such as high-throughput screening, model-based optimization, and representation learning.

To access announcements, please follow us on Twitter! You can also visit our YouTube to see recordings of past talks! Also, be sure to join our NEW Slack Community, where we discuss even more opportunities beyond the seminar series!

Check out our Slack Community!

Upcoming Seminars

Every other Tuesday 4-5pm EST unless otherwise noted

For a list of past seminars and recordings, check the full schedule page.

October-December 2024:

October 1st — Amy Lu, PhD student (UC Berkeley)

Tokenized and Continuous Embedding Compressions of Protein Sequence and Structure

October 15th — Kapil Devkota, PhD (Duke)

Template-based protein editing using Raygun

October 29th — Andre Cornman, PhD (Tatta Bio)

The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling

November 12th — Arda Goreci (Ligo Biosciences)

Lessons from implementing AlphaFold3 in the wild

November 19th — Will Hua (McGill University)

AI-Assisted De Novo Enzyme Design

November 26th — Jin Su (Westlake University)

ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning

December 12th — Baker Lab (University of Washington)

Multistate and functional protein design using RoseTTAFold sequence space diffusion

 

Organizers

Meg Taylor
Duke University Biomedical Engineering PhD Student

Divya Nori
MIT Computer Science BS & MEng Candidate

Jason Yang
CalTech Chemical Engineering PhD Candidate

Ria Vinod
Brown University Computational Biology PhD Candidate

Leo Zang
Duke University Biomedical Engineering Masters Candidate

Bo Qiang
University of Washington Computer Science PhD Student

Past Organizers

Kevin K. Yang
Senior Researcher, Microsoft Research

Brian L. Trippe
Postdoctoral Fellow, Columbia University

Ava P. Soleimany
Senior Researcher, Microsoft Research

Lucy Colwell
Research Scientist, Google Research

Jody Mou
MIT HST PhD Student

Amy Lu
UC Berkeley EECS PhD Student

Alex X. Lu
Senior Researcher, Microsoft Research

Marshall Case
Computational Biologist, Manifold Bio

Tianyu Lu
Stanford Bioengineering PhD Student

David Belanger
Research Scientist, Google Research

Andreea Gane
Research Scientist, Google Research