Timezone: »
Multiple Sequence Alignments (MSAs) of homologous sequences contain information on structural and functional constraints and their evolutionary histories. Despite their importance for many downstream tasks, such as structure prediction, MSA generation is often treated as a separate pre-processing step, without any guidance from the application it will be used for. Here, we implement a smooth and differentiable version of the Smith-Waterman pairwise alignment algorithm that enables jointly learning an MSA and a downstream machine learning system in an end-to-end fashion. To demonstrate its utility, we introduce SMURF (Smooth Markov Unaligned Random Field), a new method that jointly learns an alignment and the parameters of a Markov Random Field for unsupervised contact prediction. We find that SMURF mildly improves contact prediction on a diverse set of protein and RNA families. In another application, we demonstrate that connecting our differentiable alignment module to AlphaFold2 and optimizing the learnable alignments leads to improved structure predictions with a higher confidence. This work highlights the potential of differentiable dynamic programming to improve neural network pipelines that rely on an alignment.
Author Information
Samantha Petti (Harvard University)
Nicholas Bhattacharya (UC Berkeley)
Roshan Rao (UC Berkeley)
Justas Dauparas (University of Washington)
Neil Thomas (UC Berkeley)
Juannan Zhou (University of Florida)
Alexander Rush (Cornell University)
Peter Koo (Cold Spring Harbor Laboratory)
I am an Assistant Professor in the Simons Center for Quantitative Biology at Cold Spring Harbor Laboratory. My lab's research explores how representations of biological sequences learned by deep neural networks can provide novel insights into biological processes.
Sergey Ovchinnikov (Harvard)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 : End-to-end learning of multiple sequence alignmentswith differentiable Smith-Waterman »
Mon. Dec 13th 07:30 -- 07:40 PM Room
More from the Same Authors
-
2021 : FLIP: Benchmark tasks in fitness landscape inference for proteins »
Christian Dallago · Jody Mou · Kadina Johnston · Bruce Wittmann · Nicholas Bhattacharya · Samuel Goldman · Ali Madani · Kevin Yang -
2021 : Towards trustworthy explanations with gradient-based attribution methods »
Ethan Labelson · Rohit Tripathy · Peter Koo -
2022 : Predicting conformational landscapes of known and putative fold-switching proteins using AlphaFold2 »
Hannah Wayment-Steele · Sergey Ovchinnikov · Lucy Colwell · Dorothee Kern -
2022 : SWAMPNN: End-to-end protein structures alignment »
Jeanne Trinquier · Samantha Petti · Shihao Feng · Johannes Soeding · Martin Steinegger · Sergey Ovchinnikov -
2022 : Tuned Quadratic Landscapes for Benchmarking Model-Guided Protein Design »
Neil Thomas · Atish Agarwala · David Belanger · Yun Song · Lucy Colwell -
2022 : SWAMPNN: End-to-end protein structures alignment »
Jeanne Trinquier · Samantha Petti · Shihao Feng · Johannes Soeding · Martin Steinegger · Sergey Ovchinnikov -
2022 : Predicting conformational landscapes of known and putative fold-switching proteins using AlphaFold2 »
Hannah Wayment-Steele · Sergey Ovchinnikov · Lucy Colwell · Dorothee Kern -
2021 : Differential Inference: A Criminally Underused Tool. - Alexander Rush - Cornell University »
Alexander Rush -
2021 : Differentiable Programming for Protein Sequences and Structure »
Sergey Ovchinnikov -
2021 : Uncovering motif interactions from convolutional-attention networks for genomics »
Rohan Ghotra · Peter Koo -
2021 Poster: Low-Rank Constraints for Fast Inference in Structured Models »
Justin Chiu · Yuntian Deng · Alexander Rush -
2020 : The structure-fitness landscape of pairwise relations in generative sequence models »
dylan marshall · Peter Koo · Sergey Ovchinnikov -
2020 : Chaok Seok intro »
Sergey Ovchinnikov -
2020 : Afternoon Poster Session »
Roshan Rao -
2020 : Contributed talks intro »
Roshan Rao -
2020 : Possu Huang intro »
Roshan Rao -
2020 Workshop: Machine Learning for Structural Biology »
Raphael Townshend · Stephan Eismann · Ron Dror · Ellen Zhong · Namrata Anand · John Ingraham · Wouter Boomsma · Sergey Ovchinnikov · Roshan Rao · Per Greisen · Rachel Kolodny · Bonnie Berger -
2020 Poster: Latent Template Induction with Gumbel-CRFs »
Yao Fu · Chuanqi Tan · Bin Bi · Mosha Chen · Yansong Feng · Alexander Rush -
2020 Poster: Cascaded Text Generation with Markov Transformers »
Yuntian Deng · Alexander Rush -
2020 Poster: Movement Pruning: Adaptive Sparsity by Fine-Tuning »
Victor Sanh · Thomas Wolf · Alexander Rush -
2019 Poster: Evaluating Protein Transfer Learning with TAPE »
Roshan Rao · Nicholas Bhattacharya · Neil Thomas · Yan Duan · Peter Chen · John Canny · Pieter Abbeel · Yun Song -
2019 Spotlight: Evaluating Protein Transfer Learning with TAPE »
Roshan Rao · Nicholas Bhattacharya · Neil Thomas · Yan Duan · Peter Chen · John Canny · Pieter Abbeel · Yun Song