DeepRank

Scoring 3D protein-protein interaction models using deep learning

Image by: NIH Image Gallery

Interactions between biomolecules control all cellular processes. Understanding those interactions requires adding a three dimensional structural dimension. Next to experimental structural biology techniques, this can be done by docking, a complementary and high-throughput computational method allowing to model complexes from their known components.

A challenge in docking is scoring – the identification of correct (near-native) models from a large pool of docked models – due to our still limited knowledge of interaction rules. We will tackle this challenge by training deep networks (dNNs) to learn complex interaction patterns from the huge amount of experimental data in the Protein Data Bank (a valuable source of information not yet fully exploited). Our innovative strategy is to treat this problem as a 3D image classification problem: The interfaces of docked models will be represented as 3D images and dNNs will be trained to classify whether they are near-native or not. Unlike other machine learning techniques, dNNs are now able to learn from millions of data without reaching a performance plateau quickly, which is computationally tractable by harvesting GPU and Hadoop technologies.

The resulting scoring function, DeepRank, will markedly enhance our capability to reliably model biomolecular complexes, assisting the scientific community to gain insights into macromolecular aspects of life. It will be implemented in our HADDOCK modelling platform and freely distributed through GitHub and eStep repositories, ensuring a wide dissemination. The impact will be broad since 3D image-based dNNs have applications in many other domains, such as medical diagnoses (MRI), cryo-electron microscopy and computer vision.

Participating organisations

Netherlands eScience Center
Utrecht University
Life Sciences
Life Sciences

Impact

Output

Team

AB
Alexandre M.J.J. Bonvin
Principal investigator
Utrecht University
Cunliang Geng
Research Software Engineer
Netherlands eScience Center
Lars Ridder
Lars Ridder
Research Software Engineer
Netherlands eScience Center
LX
Li Xue
Principal investigator
Radboud University Medical Center
MR
Manon Réau
Research Software Engineer
Utrecht University
Nicolas Renaud
eScience Research Engineer
Netherlands eScience Center
Sonja Georgievska
Sonja Georgievska
eScience Research Engineer
Netherlands eScience Center

Related projects

3D-Vac

Personalized cancer vaccine design through 3D modelling boosted geometric learning

Updated 7 months ago
In progress

Ivresse

Virtual Research Environment for Integrative Modelling of Biomolecular Complexes

Updated 6 days ago
In progress

FEDMix

Fusible evolutionary deep neural network mixture learning from distributed data for robust medical...

Updated 20 months ago
Finished

Integrated omics analysis for small molecule-mediated host-microbiome interactions

Advancing our understanding of molecular mechanisms of health and disease

Updated 24 months ago
Finished

Googling the cancer genome

Identification and prioritization of cancer-causing structural variations in whole genomes

Updated 2 days ago
Finished

3D Printing of human body parts

Deep learning algorithms for more accurate implants

Updated 19 months ago
Finished

Data quality in a distributed learning environment

Vast amounts of data to improve cancer treatment decisions

Updated 24 months ago
Finished

Enhancing Protein-Drug Binding Prediction

Combining molecular simulation and eScience technologies

Updated 7 days ago
Finished

Related software

DeepRank

DE

Deep learning framework for data mining protein-protein interactions using CNN

Updated 12 months ago
79 8

DeepRank2

DE

DeepRank2 is an open-source deep learning framework for data mining of protein-protein interfaces or single-residue missense variants. This package is an improved and unified version of three previously developed packages: DeepRank, DeepRank-GNN and DeepRank-Mut.

Updated 7 months ago
16 11

DeepRank GNN

DE

DeepRank-GNN is the graph neural network of our DeepRank package. DeepRank GNN allows to train graph neural networks to classify protein-protein interface

Updated 18 hours ago
80 2

iScore

IS

A framework and predictor based on support vector machine and random walk graph kernel for scoring protein-protein interfaces.

Updated 12 months ago
73 4

pdb2sql

PD

Fast and versatile Python package that leverages SQL queries to parse, manipulate and process biomolecular structure files. The structure files should be in the PDB format and are available on www.rcsb.org.

Updated 28 months ago
8 2

PSSMGen

PS

Generates consistent PSSM and PDB files for protein-protein complexes

Updated 28 months ago
6 2