Timezone: »

 
De novo PROTAC design using graph-based deep generative models
Divya Nori · Connor Coley · Rocío Mercado

PROteolysis TArgeting Chimeras (PROTACs) are an emerging therapeutic modality that degrade a protein of interest (POI) by marking it for degradation by the proteasome. They often take on a three-component structure consisting of two binding domains and a linker. While a promising modality, it can be challenging to predict whether a new PROTAC will lead to protein degradation as that is dependent on the cooperation of all subunits to form a successful ternary structure. As such, PROTACs remain a laborious and unpredictable modality to design because the functionalities of each component are highly interdependent. Recent developments in artificial intelligence suggest that deep generative models can assist with de novo design of molecules displaying desired properties, yet their application to PROTAC design remains largely unexplored. Additionally, while previous AI-based approaches have optimized the linker component given two active domains, generative models have not yet been applied to optimization of the other two – the warhead and E3 ligand. Here, we show that a graph-based deep generative model (DGM) can be used to propose novel PROTAC structures. The DGM follows the approach of GraphINVENT, a gated-graph neural network which iteratively samples an action space and formulates a sequence of steps to build a new molecular graph. We also demonstrate that this model can be guided towards the generation of PROTACs that are predicted to degrade a POI through policy gradient reinforcement learning. Rewards during RL are applied based on a boosted tree surrogate model that predicts a PROTAC's degradation potential for a POI, showing that a nonlinear scoring function can fine-tune a deep molecular generative model towards desired properties. Using this approach, we achieve a model where activity against IRAK3 (a pseudokinase implicated in oncologic signaling) is predicted for >80% of sampled PROTACs after RL, compared to 50% predicted activity before any fine-tuning.

Author Information

Divya Nori (Massachusetts Institute of Technology)
Divya Nori

Divya is an undergraduate at MIT, studying computer science and mathematics with a minor in biology. As a researcher at the Coley Group in MIT's College of Computing and the Uhler Group at Broad Institute, she works on deep learning for drug development and kernel methods for precision medicine.

Connor Coley (MIT)
Rocío Mercado (Massachusetts Institute of Technology)
Rocío Mercado

Rocío is currently a Postdoctoral Associate in the Coley group at MIT. She received her PhD in Chemistry from UC Berkeley in 2018, after which she did an industrial postdoc in the Molecular AI team at AstraZeneca. Her research interests lie at the intersection of chemistry and artificial intelligence for molecular discovery, with a particular focus on deep generative models and how these can be used to traverse chemical space in search of promising new therapeutics.

More from the Same Authors