Timezone: »
Attribute extrapolation in sample generation is challenging for deep neural networks operating beyond the training distribution. We formulate a new task for extrapolation in sequence generation, focusing on natural language and proteins, and propose GENhance, a generative framework that enhances attributes through a learned latent space. Trained on movie reviews and a computed protein stability dataset, GENhance can generate strongly-positive text reviews and highly stable protein sequences without being exposed to similar data during training. We release our benchmark tasks and models to contribute to the study of generative modeling extrapolation and data-driven design in biology and chemistry.
Author Information
Alvin Chan (Nanyang Technological University)
Ali Madani (Salesforce Research)
Ben Krause (Salesforce)
Nikhil Naik (Salesforce Research)
More from the Same Authors
-
2021 : FLIP: Benchmark tasks in fitness landscape inference for proteins »
Christian Dallago · Jody Mou · Kadina Johnston · Bruce Wittmann · Nicholas Bhattacharya · Samuel Goldman · Ali Madani · Kevin Yang -
2021 Poster: Self-Instantiated Recurrent Units with Dynamic Soft Recursion »
Aston Zhang · Yi Tay · Yikang Shen · Alvin Chan · SHUAI ZHANG -
2020 : Profile Prediction: An Alignment-Based Pre-Training Task for Protein Sequence Models »
Jesse Vig · Ali Madani -
2020 : Contributed Talk - ProGen: Language Modeling for Protein Generation »
Ali Madani · Bryan McCann · Nikhil Naik · · Possu Huang · Richard Socher -
2018 Poster: Maximum-Entropy Fine Grained Classification »
Abhimanyu Dubey · Otkrist Gupta · Ramesh Raskar · Nikhil Naik