Timezone: »
A major goal of computational genomics is to understand how sequence patterns, called motifs, interact to regulate gene expression. In principle, convolution-attention networks (CANs) should provide an inductive bias to infer motif interactions; convolutions can capture motifs while self-attention learns their interactions. However, it is unclear the extent to which this is true in practice. Here we perform an empirical study on synthetic data to test the efficacy of uncovering motif interactions in CANs. We find that irrespective of design choice, interpreting local attention (i.e. on an individual sequence basis) is noisy, leading to many false positive motif interactions. To address this issue, we propose Global Interactions via Filter Activity Correlations (GLIFAC). GLIFAC robustly uncovers motif interactions across a wide spectrum of model choices. This work provides guidance on design choices for CANs that lead to better interpretability for regulatory genomics without sacrificing generalization performance.
Author Information
Rohan Ghotra (Cold Spring Harbor Laboratory)
Peter Koo (Cold Spring Harbor Laboratory)
I am an Assistant Professor in the Simons Center for Quantitative Biology at Cold Spring Harbor Laboratory. My lab's research explores how representations of biological sequences learned by deep neural networks can provide novel insights into biological processes.
More from the Same Authors
-
2021 : End-to-end learning of multiple sequence alignmentswith differentiable Smith-Waterman »
Samantha Petti · Nicholas Bhattacharya · Roshan Rao · Justas Dauparas · Neil Thomas · Juannan Zhou · Alexander Rush · Peter Koo · Sergey Ovchinnikov -
2021 : Towards trustworthy explanations with gradient-based attribution methods »
Ethan Labelson · Rohit Tripathy · Peter Koo -
2021 : End-to-end learning of multiple sequence alignmentswith differentiable Smith-Waterman »
Samantha Petti · Nicholas Bhattacharya · Roshan Rao · Justas Dauparas · Neil Thomas · Juannan Zhou · Alexander Rush · Peter Koo · Sergey Ovchinnikov -
2020 : The structure-fitness landscape of pairwise relations in generative sequence models »
dylan marshall · Peter Koo · Sergey Ovchinnikov