Machine Learning in Structural Biology Workshop

Workshop

Machine Learning in Structural Biology Workshop

Hannah Wayment-Steele · Roshan Rao · Ellen Zhong · Sergey Ovchinnikov · Gabriele Corso · Gina El Nesr

Fri 15 Dec, 6:30 a.m. PST

[ Abstract ] Workshop Website

Structural biology, the study of the 3D structure or shape of proteins and other biomolecules, has been transformed by breakthroughs from machine learning algorithms. While methods such as AlphaFold2 have made exponential progress in certain areas, many active and open challenges for the field remain, including modeling protein dynamics, predicting the structure of other classes of biomolecules such as RNA, and ultimately relating the structure of isolated proteins to the in vivo and contextual nature of their underlying function. These challenges are diverse and require interdisciplinary collaboration between ML and structural biology researchers. The 4th edition of the Machine Learning in Structural Biology (MLSB) workshop focuses on these challenges and opportunities. In a unique commitment of support, PRX Life journal has committed to waiving publication fees for accepted papers in a special collection for interested authors. We anticipate this workshop will be of significant interest to both ML researchers as well as computational / experimental biologists and will stimulate continued problem-solving and new directions in the field.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 6:30 a.m. - 6:35 a.m.	Opening Remarks ( Remarks ) > SlidesLive Video	🔗
Fri 6:35 a.m. - 7:00 a.m.	Health system scale language models for clinical and operational decision making ( Talk ) > SlidesLive Video	Kyunghyun Cho 🔗
Fri 7:00 a.m. - 7:15 a.m.	Validation of de novo designed water-soluble and membrane proteins by in silico folding and melting ( Contributed ) > SlidesLive Video	🔗
Fri 7:15 a.m. - 7:40 a.m.	Accurate and tunable de novo protein shapes for new functions ( Talk ) > SlidesLive Video	Tanja Kortemme 🔗
Fri 8:00 a.m. - 8:25 a.m.	A CryoET Data Portal to Foster a Collaboration between the Machine Learning and CryoET Communities ( Talk ) > SlidesLive Video	Bridget Carragher 🔗
Fri 8:25 a.m. - 8:40 a.m.	AlphaFold Meets Flow Matching for Generating Protein Ensembles ( Contributed ) > SlidesLive Video	🔗
Fri 8:40 a.m. - 8:55 a.m.	DSMBind: an unsupervised generative modeling framework for binding energy prediction ( Contributed ) > SlidesLive Video	🔗
Fri 8:55 a.m. - 9:20 a.m.	Leveraging microfluidics for high-throughput and quantitative biochemistry and biophysics ( Talk ) > SlidesLive Video	Polly Fordyce 🔗
Fri 9:20 a.m. - 10:40 a.m.	Poster Session 1/Lunch ( Poster session ) >	🔗
Fri 10:40 a.m. - 11:05 a.m.	Illuminating protein space with a programmable generative model ( Talk ) > SlidesLive Video	Gevorg Grigoryan 🔗
Fri 11:05 a.m. - 11:20 a.m.	Protein generation with evolutionary diffusion: sequence is all you need ( Contributed ) > SlidesLive Video	🔗
Fri 11:20 a.m. - 11:45 a.m.	De novo design of protein structure and function with RFdiffusion ( Talk ) > SlidesLive Video	Jason Yim · Brian L Trippe 🔗
Fri 12:00 p.m. - 12:15 p.m.	DiffDock-Pocket: Diffusion for Pocket-Level Docking with Sidechain Flexibility ( Contributed ) > SlidesLive Video	🔗
Fri 12:15 p.m. - 12:30 p.m.	PoseCheck: Generative Models for 3D Structure-based Drug Design Produce Unrealistic Poses ( Contributed ) > SlidesLive Video	🔗
Fri 12:30 p.m. - 12:55 p.m.	World-wide competitions and the RNA folding problem ( Talk ) > SlidesLive Video	Rhiju Das 🔗
Fri 1:00 p.m. - 2:00 p.m.	Panel Session ( Session ) > SlidesLive Video	🔗
Fri 2:00 p.m. - 3:00 p.m.	Poster Session 2 / Happy Hour ( Poster Session ) >	🔗
Fri 3:00 p.m. - 3:05 p.m.	Closing Remarks ( Remarks ) > SlidesLive Video	🔗
-	ESMFold Hallucinates Native-Like Protein Sequences ( Poster ) >	Jeliazko Jeliazkov · Diego del Alamo · Joel Karpiak 🔗
-	Conditioned Protein Structure Prediction ( Poster ) >	Tengyu Xie · Zilin Song · Jing Huang 🔗
-	Stable Online and Offline Reinforcement Learning for Antibody CDRH3 Design ( Poster ) >	Yannick Vogt · Mehdi Naouar · Maria Kalweit · Christoph Cornelius Miething · Justus Duyster · Roland Mertelsmann · Gabriel Kalweit · Joschka Boedecker 🔗
-	Guiding diffusion models for antibody sequence and structure co-design with developability properties ( Poster ) >	Amelia Villegas-Morcillo · Jana M. Weber · Marcel Reinders 🔗
-	AlphaFold Distillation for Protein Design ( Poster ) >	Igor Melnyk · Aurelie Lozano · Payel Das · Vijil Chenthamarakshan 🔗
-	Binding Oracle: Fine-Tuning From Stability to Binding Free Energy ( Poster ) >	Chengyue Gong · Adam Klivans · Jordan Wells · James Loy · Qiang Liu · Alex Dimakis · Daniel Diaz 🔗
-	Scalable Multimer Structure Prediction using Diffusion Models ( Poster ) >	Peter Pao-Huang · Bowen Jing · Dr. Bonnie Berger 🔗
-	Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular Property Prediction ( Poster ) >	Christopher Fifty · Joseph M Paggi · Ehsan Amid · Jure Leskovec · Ron Dror 🔗
-	Molecular Diffusion Models with Virtual Receptors ( Poster ) >	Matan Halfon · Eyal Rozenberg · Ehud Rivlin · Daniel Freedman 🔗
-	CESPED: a new benchmark for supervised particle pose estimation in Cryo-EM. ( Poster ) >	Ruben Sanchez Garcia · Michael Saur · Javier Vargas · Carl Poelking · Charlotte Deane 🔗
-	Learning Scalar Fields for Molecular Docking with Fast Fourier Transforms ( Poster ) >	Bowen Jing · Tommi Jaakkola · Dr. Bonnie Berger 🔗
-	VN-EGNN: Equivariant Graph Neural Networks with Virtual Nodes Enhance Protein Binding Site Identification ( Poster ) >	Florian Sestak · Lisa Schneckenreiter · Sepp Hochreiter · Andreas Mayr · Günter Klambauer 🔗
-	Enhancing Ligand Pose Sampling for Machine Learning–Based Docking ( Poster ) >	Patricia Suriana · Ron Dror 🔗
-	Improved encoding of ensembles in PDBx/mmCIF ( Poster ) >	Stephanie Wankowicz · James Fraser 🔗
-	AlphaFold Meets Flow Matching for Generating Protein Ensembles ( Poster ) >	Bowen Jing · Dr. Bonnie Berger · Tommi Jaakkola 🔗
-	AlphaFold Meets Flow Matching for Generating Protein Ensembles ( Oral ) >	Bowen Jing · Dr. Bonnie Berger · Tommi Jaakkola 🔗
-	The Discovery of Binding Modes Requires Rethinking Docking Generalization ( Poster ) >	Gabriele Corso · Arthur Deng · Nicholas Polizzi · Regina Barzilay · Tommi Jaakkola 🔗
-	Conformational sampling and interpolation using language-based protein folding neural networks ( Poster ) >	Diego del Alamo · Jeliazko Jeliazkov · Daphne Truan · Joel Karpiak 🔗
-	FLIGHTED: Inferring Fitness Landscapes from Noisy High-Throughput Experimental Data ( Poster ) >	Vikram Sundar · Boqiang Tu · Lindsey Guan · Kevin Esvelt 🔗
-	Contrasting Sequence with Structure: \Pre-training Graph Representations with PLMs ( Poster ) >	Louis Robinson · Timothy Atkinson · Liviu Copoiu · Patrick Bordes · Thomas PIERROT · Thomas Barrett 🔗
-	Target-Aware Variational Auto-Encoders for Ligand Generation with Multi-Modal Protein Modeling ( Poster ) >	Khang Ngo · Truong Son Hy 🔗
-	DSMBind: an unsupervised generative modeling framework for binding energy prediction ( Poster ) >	Wengong Jin · Caroline Uhler · Nir HaCohen 🔗
-	DSMBind: an unsupervised generative modeling framework for binding energy prediction ( Oral ) >	Wengong Jin · Caroline Uhler · Nir HaCohen 🔗
-	Fast non-autoregressive inverse folding with discrete diffusion ( Poster ) >	John Yang · Jason Yim · Tommi Jaakkola · Regina Barzilay 🔗
-	TopoDiff: Improve Protein Backbone Generation with Topology-aware Latent Encoding ( Poster ) >	Yuyang Zhang · Zinnia Ma · Haipeng Gong 🔗
-	Harmonic Prior Self-conditioned Flow Matching for Multi-Ligand Docking and Binding Site Design ( Poster ) >	Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola 🔗
-	CrysFormer: Protein Crystallography Prediction via 3d Patterson Maps and Partial Structure Attention ( Poster ) >	Chen Dun · Tom Pan · Shikai Jin · Ria Stevens · Mitchell D. Miller · George Phillips · Anastasios Kyrillidis 🔗
-	PoseCheck: Generative Models for 3D Structure-based Drug Design Produce Unrealistic Poses ( Poster ) >	Charles Harris · Kieran Didi · Arian Jamasb · Chaitanya Joshi · Simon Mathis · Pietro Lió · Tom Blundell 🔗
-	PoseCheck: Generative Models for 3D Structure-based Drug Design Produce Unrealistic Poses ( Oral ) >	Charles Harris · Kieran Didi · Arian Jamasb · Chaitanya Joshi · Simon Mathis · Pietro Lió · Tom Blundell 🔗
-	Sampling Protein Language Models for Functional Protein Design ( Poster ) >	Jeremie Theddy Darmawan · Yarin Gal · Pascal Notin 🔗
-	A framework for conditional diffusion modelling with applications in protein design ( Poster ) >	Kieran Didi · Francisco Vargas · Simon Mathis · Vincent Dutordoir · Emile Mathieu · Urszula Julia Komorowska · Pietro Lió 🔗
-	DiffRNAFold: Generating RNA Tertiary Structures with Latent Space Diffusion ( Poster ) >	Mihir Bafna · Vikranth Keerthipati · Subhash Kanaparthi · Ruochi Zhang 🔗
-	Pair-EGRET: Enhancing the prediction of protein-protein interaction sites through graph attention networks and protein language models ( Poster ) >	Ramisa Alam · Sazan Mahbub · Md. Shamsuzzoha Bayzid 🔗
-	FlexiDock: Compositional diffusion models for flexible molecular docking ( Poster ) >	Zichen Wang · Balasubramaniam Srinivasan · Zhengyuan Shen · George Karypis · Huzefa Rangwala 🔗
-	In vitro validated antibody design against multiple therapeutic antigens using generative inverse folding ( Poster ) >	Amir Shanehsazzadeh 🔗
-	Evaluating Zero-Shot Scoring for In Vitro Antibody Binding Prediction with Experimental Validation ( Poster ) >	Divya Nori · Simon Mathis · Amir Shanehsazzadeh 🔗
-	PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design ( Poster ) >	Chuanrui WANG · Bozitao Zhong · Zuobai Zhang · Narendra Chaudhary · Sanchit Misra · Jian Tang 🔗
-	Optimizing protein language models with Sentence Transformers ( Poster ) >	Istvan Redl 🔗
-	DiffDock-Pocket: Diffusion for Pocket-Level Docking with Sidechain Flexibility ( Poster ) >	Michael Plainer · Marcella Toth · Simon Dobers · Hannes Stärk · Gabriele Corso · Céline Marquet · Regina Barzilay 🔗
-	DiffDock-Pocket: Diffusion for Pocket-Level Docking with Sidechain Flexibility ( Oral ) >	Michael Plainer · Marcella Toth · Simon Dobers · Hannes Stärk · Gabriele Corso · Céline Marquet · Regina Barzilay 🔗
-	Transition Path Sampling with Boltzmann Generator-based MCMC Moves ( Poster ) >	Michael Plainer · Hannes Stärk · Charlotte Bunne · Stephan Günnemann 🔗
-	Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning ( Poster ) >	Youhan Lee · Hasun Yu · Jaemyung Lee · Jaehoon Kim 🔗
-	Inpainting Protein Sequence and Structure with ProtFill ( Poster ) >	Elizaveta Kozlova · Daniel Nakhaee-Zadeh Gutierrez · Arthur Valentin 🔗
-	Investigating Protein-DNA Binding Energetic of Mismatched DNA ( Poster ) >	Ruben Solozabal · Tamir Avioz · Yunxiang LI · Le Song · Martin Takac · Ariel Afek 🔗
-	AntiFold: Improved antibody structure design using inverse folding ( Poster ) >	Alissa M Hummer · Magnus H Høie · Tobias Olsen · Morten Nielsen · Charlotte Deane 🔗
-	Improved B-cell epitope prediction using AlphaFold2 modeling and inverse folding latent representations ( Poster ) >	Paolo Marcatili 🔗
-	Combining Structure and Sequence for Superior Fitness Prediction ( Poster ) >	Steffanie Paul · Pascal Notin · Aaron Kollasch · Debora Marks 🔗
-	Epitope-specific antibody design using diffusion models on the latent space of ESM embeddings ( Poster ) >	Tomer Cohen · Dina Schneidman 🔗
-	Protein language models learn evolutionary statistics of interacting sequence motifs ( Poster ) >	Zhidian Zhang · Hannah Wayment-Steele · Garyk Brixi · Matteo Dal Peraro · Dorothee Kern · Sergey Ovchinnikov 🔗
-	Using artificial sequence coevolution to predict disulfide-rich peptide structures with experimental connectivity in AlphaFold ( Poster ) >	Gabriella Gerlach · John Nicoludis 🔗
-	Preferential Bayesian Optimisation for Protein Design with Ranking-Based Fitness Predictors ( Poster ) >	Alex Hawkins-Hooker · Paul Duckworth · Oliver Bent 🔗
-	FAFormer: Frame Averaging Transformer for Predicting Nucleic Acid-Protein Interactions ( Poster ) >	Tinglin Huang · Zhenqiao Song · Rex Ying · Wengong Jin 🔗
-	LightMHC: A Light Model for pMHC Structure Prediction with Graph Neural Networks ( Poster ) >	12 presenters Antoine Delaunay · Yunguan Fu · Nikolai Gorbushin · Robert McHardy · Bachir Djermani · Liviu Copoiu · Michael Rooney · Maren Lang · Andrey Tovchigrechko · Ugur Sahin · Karim Beguir · Nicolas Lopez Carranza 🔗
-	FrameDiPT: SE(3) Diffusion Model for Protein Structure Inpainting ( Poster ) >	15 presenters Cheng ZHANG · Adam Leach · Thomas Makkink · Miguel Arbesú · Ibtissem Kadri · Daniel Luo · Liron Mizrahi · Sabrine Krichen · Maren Lang · Andrey Tovchigrechko · Nicolas Lopez Carranza · Ugur Sahin · Karim Beguir · Michael Rooney · Yunguan Fu 🔗
-	An Active Learning Framework for ML-Assisted Labeling of Cryo-EM Micrographs ( Poster ) >	Robert Kiewisz · Tristan Bepler 🔗
-	Validation of de novo designed water-soluble and membrane proteins by in silico folding and melting. ( Poster ) >	Alvaro Martin · Carolin Berner · Sergey Ovchinnikov · Anastassia Vorobieva 🔗
-	Validation of de novo designed water-soluble and membrane proteins by in silico folding and melting. ( Oral ) >	Alvaro Martin · Carolin Berner · Sergey Ovchinnikov · Anastassia Vorobieva 🔗
-	Structure, Surface and Interface Informed Protein Language Model ( Poster ) >	Ioan Ieremie 🔗
-	De Novo Short Linear Motif (SLiM) Discovery With AlphaFold-Multimer ( Poster ) >	Theo Sternlieb · · Davian Ho · Jeffrey Chan 🔗
-	AF2BIND: Predicting ligand-binding sites using the pair representation of AlphaFold2 ( Poster ) >	Artem Gazizov · Anna Lian · Casper Goverde · Sergey Ovchinnikov · Nicholas Polizzi 🔗
-	Protein generation with evolutionary diffusion: sequence is all you need ( Poster ) >	Sarah Alamdari · Nitya Thakkar · Rianne van den Berg · Alex X Lu · Nicolo Fusi · Ava Amini · Kevin Yang 🔗
-	Protein generation with evolutionary diffusion: sequence is all you need ( Oral ) >	Sarah Alamdari · Nitya Thakkar · Rianne van den Berg · Alex X Lu · Nicolo Fusi · Ava Amini · Kevin Yang 🔗
-	Protein-Protein Docking with Latent Diffusion ( Poster ) >	Matt McPartlon · Céline Marquet · Tomas Geffner · Daniel Kovtun · Alexander Goncearenco · Zachary Carpenter · Luca Naef · Michael Bronstein · Jinbo Xu 🔗
-	HiFi-NN annotates the microbial dark matter with Enzyme Commission numbers ( Poster ) >	Gavin Ayres 🔗
-	Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion ( Poster ) > link Link	Alex Morehead · Jeffrey Ruffolo · Aadyot Bhatnagar · Ali Madani 🔗
-	SO(3)-Equivariant Representation Learning in 2D Images ( Poster ) >	Darnell Granberry · Alireza Nasiri · Jiayi Shou · Alex J. Noble · Tristan Bepler 🔗
-	HelixDiff: Conditional Full-atom Design of Peptides With Diffusion Models ( Poster ) >	Xuezhi Xie · Pedro A Valiente · Jisun Kim · Philip Kim 🔗
-	DIFFMASIF: Score-Based Diffusion Models for Protein Surfaces ( Poster ) >	14 presenters Mehmet Akdel · Freyr Sverrisson · Dylan Abramson · Jean Feydy · Alexander Goncearenco · Yusuf Adeshina · Daniel Kovtun · Céline Marquet · Xuejin Zhang · David Baugher · Zachary Carpenter · Luca Naef · Michael Bronstein · Bruno Correia 🔗
-	FLAb: Benchmarking deep learning methods for antibody fitness prediction ( Poster ) >	Michael F Chungyoun · Jeffrey Ruffolo · Jeffrey Gray 🔗
-	Parameter-Efficient Fine Tuning of Protein Language Models Improves Prediction of Protein-Protein Interactions ( Poster ) >	Samuel Sledzieski · Meghana Kshirsagar · Rahul Dodhia · Bonnie Berger · Juan Lavista Ferres 🔗
-	TriFold: A New Architecture for Predicting Protein Sequences from Structural Data ( Poster ) >	Harish Srinivasan · Jian Zhou 🔗
-	End-to-End Sidechain Modeling in AlphaFold2: Attention May or May Not Be All That You Need ( Poster ) >	Jonathan King · David Koes 🔗
-	Coarse-graining via reparametrization avoids force-matching and back-mapping ( Poster ) >	Nima Dehmamy · Csaba Both · Subhro Das · Tommi Jaakkola 🔗
-	SE3Lig: SE(3)-equivariant CNNs for the reconstruction of cofactors and ligands in protein structures ( Poster ) >	Guillaume Lamoureux · Sid Bhadra-Lobo · Anushriya Subedy · Sagar Khare 🔗
-	Cramming Protein Language Model Training in 24 GPU Hours ( Poster ) >	Nathan Frey · Taylor Joren · Aya Ismail · Allen Goodman · Stephen Ra · Kyunghyun Cho · Richard Bonneau · Vladimir Gligorijevic 🔗
-	Preparation Of Labeled Cryo-ET Datasets For Training And Evaluation Of Machine Learning Models ( Poster ) >	Aygul Ishemgulova · Alex J. Noble · Tristan Bepler · Alex De Marco 🔗
-	EMPOT: partial alignment of density maps and rigid body fitting using unbalanced Gromov-Wasserstein divergence ( Poster ) >	Aryan Tajmir Riahi · Chenwei Zhang · James Chen · Anne Condon · Khanh Dao Duc 🔗
-	Fast protein backbone generation with SE(3) flow matching ( Poster ) >	11 presenters Jason Yim · Andrew Campbell · Yue Kwang Foong · Sarah Lewis · Victor Satorras · Michael Gastegger · Bas Veeling · Jose Jimenez-Luna · Regina Barzilay · Tommi Jaakkola · Frank Noe 🔗
-	Frame2seq: structure-conditioned masked language modeling for protein sequence design ( Poster ) >	Deniz Akpinaroglu · Kosuke Seki · Eleanor Zhu · Tanja Kortemme 🔗
-	Structure-Conditioned Generative Models for De Novo Ligand Generation: A Pharmacophore Assessment ( Poster ) >	Shannon Smith · Leo Gendelev · Kangway Chuang · Seth Harris 🔗
-	Jointly Embedding Protein Structures and Sequences through Residue Level Alignment ( Poster ) >	Foster Birnbaum · Saachi Jain · Amy Keating · Aleksander Madry 🔗
-	Evaluating Representation Learning on the Protein Structure Universe ( Poster ) >	11 presenters Arian Jamasb · Alex Morehead · Zuobai Zhang · Chaitanya K. Joshi · Kieran Didi · Simon Mathis · Charles Harris · Jian Tang · Jianlin Cheng · Pietro Lió · Tom Blundell 🔗
-	Enhancing Antibody Language Models with Structural Information ( Poster ) >	Justin Barton · Jacob Galson · Jinwoo Leem 🔗
-	Amortized Pose Estimation for X-Ray Single Particle Imaging ( Poster ) >	Jay Shenoy · Axel Levy · Frederic Poitevin · Gordon Wetzstein 🔗
-	Rethinking Performance Measures of RNA Secondary Structure Problems ( Poster ) >	Frederic Runge · Jörg Franke · Daniel Fertmann · Frank Hutter 🔗
-	Structure-based and leakage-free data splits for rigorous protein function evaluation ( Poster ) >	Charlotte Rochereau · Mohammed AlQuraishi · Arthur Valentin · Gergo Nikolenyi 🔗
-	Uncovering sequence diversity from a known protein structure ( Poster ) >	Luca Alessandro Silva · Barthélémy Meynard · Carlo Lucibello · Christoph Feinauer 🔗
-	Exploiting language models for protein discovery with latent walk-jump sampling ( Poster ) >	Sai Pooja Mahajan · Nathan Frey · Dan Berenberg · Joseph Kleinhenz · Richard Bonneau · Vladimir Gligorijevic · Andrew Watkins · Saeed Saremi 🔗