Timezone: »
Prediction of a molecule’s 3D conformer ensemble from the molecular graph holds a key role in areas of cheminformatics and drug discovery. Existing generative models have several drawbacks including lack of modeling important molecular geometry elements (e.g., torsion angles), separate optimization stages prone to error accumulation, and the need for structure fine-tuning based on approximate classical force-fields or computationally expensive methods. We propose GEOMOL --- an end-to-end, non-autoregressive, and SE(3)-invariant machine learning approach to generate distributions of low-energy molecular 3D conformers. Leveraging the power of message passing neural networks (MPNNs) to capture local and global graph information, we predict local atomic 3D structures and torsion angles, avoid- ing unnecessary over-parameterization of the geometric degrees of freedom (e.g., one angle per non-terminal bond). Such local predictions suffice both for both the training loss computation and for the full deterministic conformer assembly (at test time). We devise a non-adversarial optimal transport based loss function to promote diverse conformer generation. GEOMOL predominantly outperforms popular open-source, commercial, or state-of-the-art machine learning (ML) models, while achieving significant speed-ups. We expect such differentiable 3D structure generators to significantly impact molecular modeling and related applications.
Author Information
Octavian Ganea (MIT)
Lagnajit Pattanaik (Massachusetts Institute of Technology)
Connor Coley (MIT)
Regina Barzilay (Massachusetts Institute of Technology)
Klavs Jensen (Massachusetts Institute of Technology)
William Green (Massachusetts Institute of Technology)
Tommi Jaakkola (MIT)
Tommi Jaakkola is a professor of Electrical Engineering and Computer Science at MIT. He received an M.Sc. degree in theoretical physics from Helsinki University of Technology, and Ph.D. from MIT in computational neuroscience. Following a Sloan postdoctoral fellowship in computational molecular biology, he joined the MIT faculty in 1998. His research interests include statistical inference, graphical models, and large scale modern estimation problems with predominantly incomplete data.
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Poster: GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles »
Tue. Dec 7th 04:30 -- 06:00 PM Room
More from the Same Authors
-
2021 : Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development »
Kexin Huang · Tianfan Fu · Wenhao Gao · Yue Zhao · Yusuf Roohani · Jure Leskovec · Connor Coley · Cao Xiao · Jimeng Sun · Marinka Zitnik -
2021 : Consistent Accelerated Inference via Confident Adaptive Transformers »
Tal Schuster · Adam Fisch · Tommi Jaakkola · Regina Barzilay -
2021 : Fragment-Based Sequential Translation for Molecular Optimization »
Benson Chen · Xiang Fu · Regina Barzilay · Tommi Jaakkola -
2021 : Bringing Atomistic Deep Learning to Prime Time »
Nathan Frey · Siddharth Samsi · Bharath Ramsundar · Connor Coley -
2021 : Scalable Geometric Deep Learning on Molecular Graphs »
Nathan Frey · Siddharth Samsi · Lin Li · Connor Coley -
2021 : Crystal Diffusion Variational Autoencoder for Periodic Material Generation »
Tian Xie · Xiang Fu · Octavian Ganea · Regina Barzilay · Tommi Jaakkola -
2022 : DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking »
Gabriele Corso · Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola -
2022 : De novo PROTAC design using graph-based deep generative models »
Divya Nori · Connor Coley · Rocío Mercado -
2022 : Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations »
Xiang Fu · Zhenghao Wu · Wujie Wang · Tian Xie · Sinan Keten · Rafael Gomez-Bombarelli · Tommi Jaakkola -
2022 : De novo PROTAC design using graph-based deep generative models »
Divya Nori · Connor Coley · Rocío Mercado -
2022 : DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking »
Gabriele Corso · Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola -
2022 : Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem »
Jason Yim · Brian L Trippe · Doug Tischer · David Baker · Tamara Broderick · Regina Barzilay · Tommi Jaakkola -
2022 : DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking »
Gabriele Corso · Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola -
2022 : Is Conditional Generative Modeling all you need for Decision-Making? »
Anurag Ajay · Yilun Du · Abhi Gupta · Josh Tenenbaum · Tommi Jaakkola · Pulkit Agrawal -
2022 : Molecular Docking with Diffusion Generative Models »
Gabriele Corso · Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola -
2022 Spotlight: Poisson Flow Generative Models »
Yilun Xu · Ziming Liu · Max Tegmark · Tommi Jaakkola -
2022 Spotlight: Lightning Talks 6B-1 »
Yushun Zhang · Duc Nguyen · Jiancong Xiao · Wei Jiang · Yaohua Wang · Yilun Xu · Zhen LI · Anderson Ye Zhang · Ziming Liu · Fangyi Zhang · Gilles Stoltz · Congliang Chen · Gang Li · Yanbo Fan · Ruoyu Sun · Naichen Shi · Yibo Wang · Ming Lin · Max Tegmark · Lijun Zhang · Jue Wang · Ruoyu Sun · Tommi Jaakkola · Senzhang Wang · Zhi-Quan Luo · Xiuyu Sun · Zhi-Quan Luo · Tianbao Yang · Rong Jin -
2022 : DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking »
Gabriele Corso · Hannes Stärk · Bowen Jing · Regina Barzilay · Tommi Jaakkola -
2022 : A High-Throughput Platform for Efficient Exploration of Polypeptides Chemical Space via Automation and Machine Learning »
Guangqi Wu · Connor Coley · Hua Lu -
2022 : Automated Materials Synthesis Keynote »
Connor Coley -
2022 : MolPAL: Software for Sample Efficient High-Throughput Virtual Screening »
David Graff · Connor Coley -
2022 : Invited Talk: Tommi Jaakkola »
Tommi Jaakkola -
2022 Poster: Reinforced Genetic Algorithm for Structure-based Drug Design »
Tianfan Fu · Wenhao Gao · Connor Coley · Jimeng Sun -
2022 Poster: Torsional Diffusion for Molecular Conformer Generation »
Bowen Jing · Gabriele Corso · Jeffrey Chang · Regina Barzilay · Tommi Jaakkola -
2022 Poster: Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization »
Wenhao Gao · Tianfan Fu · Jimeng Sun · Connor Coley -
2022 Poster: Poisson Flow Generative Models »
Yilun Xu · Ziming Liu · Max Tegmark · Tommi Jaakkola -
2021 : Data Opportunities: unsolved medical problems and where new data can help »
Bin Yu · Regina Barzilay · Marzyeh Ghassemi · Emma Pierson -
2021 : Session 2 Keynote 1 »
Regina Barzilay -
2021 : Invited Talk 5: Regina Barzilay: Infusing biology into molecular models for property prediction »
Regina Barzilay -
2021 : Consistent Accelerated Inference via Confident Adaptive Transformers »
Tal Schuster · Adam Fisch · Tommi Jaakkola · Regina Barzilay -
2021 : AI X Chemistry »
Connor Coley -
2021 Poster: Learning Graph Models for Retrosynthesis Prediction »
Vignesh Ram Somnath · Charlotte Bunne · Connor Coley · Andreas Krause · Regina Barzilay -
2021 Poster: Understanding Interlocking Dynamics of Cooperative Rationalization »
Mo Yu · Yang Zhang · Shiyu Chang · Tommi Jaakkola -
2020 : Spotlight Talk: Message Passing Networks for Molecules with Tetrahedral Chirality - Lagnajit Pattanaik, Octavian Ganea, Ian Coley, Klavs Jensen, William Green and Connor Coley. »
Lagnajit Pattanaik -
2019 Poster: Solving graph compression via optimal transport »
Vikas Garg · Tommi Jaakkola -
2019 Poster: Generative Models for Graph-Based Protein Design »
John Ingraham · Vikas Garg · Regina Barzilay · Tommi Jaakkola -
2019 Poster: Direct Optimization through $\arg \max$ for Discrete Variational Auto-Encoder »
Guy Lorberbom · Andreea Gane · Tommi Jaakkola · Tamir Hazan -
2019 Poster: Retrosynthesis Prediction with Conditional Graph Logic Network »
Hanjun Dai · Chengtao Li · Connor Coley · Bo Dai · Le Song -
2019 Poster: Tight Certificates of Adversarial Robustness for Randomly Smoothed Classifiers »
Guang-He Lee · Yang Yuan · Shiyu Chang · Tommi Jaakkola -
2019 Poster: A Game Theoretic Approach to Class-wise Selective Rationalization »
Shiyu Chang · Yang Zhang · Mo Yu · Tommi Jaakkola -
2018 : Invited Talk Session 3 »
Alexandre Tkatchenko · Tommi Jaakkola · Jennifer Wei -
2018 Poster: Hyperbolic Neural Networks »
Octavian Ganea · Gary Becigneul · Thomas Hofmann -
2018 Spotlight: Hyperbolic Neural Networks »
Octavian Ganea · Gary Becigneul · Thomas Hofmann -
2018 Poster: Towards Robust Interpretability with Self-Explaining Neural Networks »
David Alvarez-Melis · Tommi Jaakkola -
2017 : Machine Learning in Organic Synthesis Planning And Execution »
Klavs Jensen -
2017 Poster: Local Aggregative Games »
Vikas Garg · Tommi Jaakkola -
2017 Poster: Style Transfer from Non-Parallel Text by Cross-Alignment »
Tianxiao Shen · Tao Lei · Regina Barzilay · Tommi Jaakkola -
2017 Spotlight: Style Transfer from Non-parallel Text by Cross-Alignment »
Tianxiao Shen · Tao Lei · Regina Barzilay · Tommi Jaakkola -
2017 Poster: Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network »
Wengong Jin · Connor Coley · Regina Barzilay · Tommi Jaakkola -
2016 Poster: Learning Tree Structured Potential Games »
Vikas Garg · Tommi Jaakkola -
2015 Poster: From random walks to distances on unweighted graphs »
Tatsunori Hashimoto · Yi Sun · Tommi Jaakkola -
2015 Poster: Principal Differences Analysis: Interpretable Characterization of Differences between Distributions »
Jonas Mueller · Tommi Jaakkola -
2014 Poster: Controlling privacy in recommender systems »
Yu Xin · Tommi Jaakkola -
2013 Poster: Learning Efficient Random Maximum A-Posteriori Predictors with Non-Decomposable Loss Functions »
Tamir Hazan · Subhransu Maji · Joseph Keshet · Tommi Jaakkola -
2013 Poster: On Sampling from the Gibbs Distribution with Random Maximum A-Posteriori Perturbations »
Tamir Hazan · Subhransu Maji · Tommi Jaakkola -
2012 Workshop: Machine Learning Approaches to Mobile Context Awareness »
Katherine Ellis · Gert Lanckriet · Tommi Jaakkola · Lenny Grokop -
2012 Poster: Convergence Rate Analysis of MAP Coordinate Minimization Algorithms »
Ofer Meshi · Tommi Jaakkola · Amir Globerson -
2011 Tutorial: Linear Programming Relaxations for Graphical Models »
Amir Globerson · Tommi Jaakkola -
2010 Spotlight: More data means less inference: A pseudo-max approach to structured learning »
David Sontag · Ofer Meshi · Tommi Jaakkola · Amir Globerson -
2010 Poster: More data means less inference: A pseudo-max approach to structured learning »
David Sontag · Ofer Meshi · Tommi Jaakkola · Amir Globerson -
2008 Workshop: Approximate inference - how far have we come? »
Amir Globerson · David Sontag · Tommi Jaakkola -
2008 Poster: Clusters and Coarse Partitions in LP Relaxations »
David Sontag · Amir Globerson · Tommi Jaakkola -
2008 Spotlight: Clusters and Coarse Partitions in LP Relaxations »
David Sontag · Amir Globerson · Tommi Jaakkola -
2007 Oral: New Outer Bounds on the Marginal Polytope »
David Sontag · Tommi Jaakkola -
2007 Poster: New Outer Bounds on the Marginal Polytope »
David Sontag · Tommi Jaakkola -
2007 Poster: Fixing Max-Product: Convergent Message Passing Algorithms for MAP LP-Relaxations »
Amir Globerson · Tommi Jaakkola -
2006 Talk: Approximate inference using planar graph decomposition »
Amir Globerson · Tommi Jaakkola -
2006 Poster: Approximate inference using planar graph decomposition »
Amir Globerson · Tommi Jaakkola -
2006 Poster: Game Theoretic Algorithms for Protein-DNA binding »
Luis Perez-Breva · Luis E Ortiz · Chen-Hsiang Yeang · Tommi Jaakkola -
2006 Spotlight: Game Theoretic Algorithms for Protein-DNA binding »
Luis Perez-Breva · Luis E Ortiz · Chen-Hsiang Yeang · Tommi Jaakkola -
2006 Poster: Parameter Expanded Variational Bayesian Methods »
Yuan (Alan) Qi · Tommi Jaakkola