Sign in

DeepRank

Scoring 3D protein-protein interaction models using deep learning

Image by: NIH Image Gallery

Interactions between biomolecules control all cellular processes. Understanding those interactions requires adding a three dimensional structural dimension. Next to experimental structural biology techniques, this can be done by docking, a complementary and high-throughput computational method allowing to model complexes from their known components.

A challenge in docking is scoring – the identification of correct (near-native) models from a large pool of docked models – due to our still limited knowledge of interaction rules. We will tackle this challenge by training deep networks (dNNs) to learn complex interaction patterns from the huge amount of experimental data in the Protein Data Bank (a valuable source of information not yet fully exploited). Our innovative strategy is to treat this problem as a 3D image classification problem: The interfaces of docked models will be represented as 3D images and dNNs will be trained to classify whether they are near-native or not. Unlike other machine learning techniques, dNNs are now able to learn from millions of data without reaching a performance plateau quickly, which is computationally tractable by harvesting GPU and Hadoop technologies.

The resulting scoring function, DeepRank, will markedly enhance our capability to reliably model biomolecular complexes, assisting the scientific community to gain insights into macromolecular aspects of life. It will be implemented in our HADDOCK modelling platform and freely distributed through GitHub and eStep repositories, ensuring a wide dissemination. The impact will be broad since 3D image-based dNNs have applications in many other domains, such as medical diagnoses (MRI), cryo-electron microscopy and computer vision.

Participating organisations

Netherlands eScience Center
Utrecht University

Impact

Output

Team

AB
Alexandre M.J.J. Bonvin
Principal investigator
Utrecht University
Cunliang Geng
Cunliang Geng
Research Software Engineer
Netherlands eScience Center
Lars Ridder
Lars Ridder
Research Software Engineer
Netherlands eScience Center
LX
Li Xue
Principal investigator
Radboud University Medical Center
MR
Manon Réau
Research Software Engineer
Utrecht University
Nicolas Renaud
Nicolas Renaud
eScience Research Engineer
Netherlands eScience Center
Sonja Georgievska
Sonja Georgievska
eScience Research Engineer
Netherlands eScience Center

Related projects

FEDMix

Fusible evolutionary deep neural network mixture learning from distributed data for robust medical...

Updated 7 days ago
Finished

Googling the cancer genome

Identification and prioritization of cancer-causing structural variations in whole genomes

Updated 7 days ago
Finished

Data quality in a distributed learning environment

Vast amounts of data to improve cancer treatment decisions

Updated 7 days ago
Finished

Enhancing Protein-Drug Binding Prediction

Combining molecular simulation and eScience technologies

Updated 7 days ago
Finished

Related tools

DeepRank

DE

Deep learning framework for data mining protein-protein interactions using CNN

Updated 5 months ago
4 8

deeprank-core

DE

deeprank-core is the refactorized version of DeepRank GNN, the graph neural network of our DeepRank package. It allows to train graph neural networks to classify protein-protein interface with a greater flexibility for the user.

Updated 5 months ago
2

DeepRank GNN

DE

DeepRank-GNN is the graph neural network of our DeepRank package. DeepRank GNN allows to train graph neural networks to classify protein-protein interface

Updated 5 months ago
1 2

iScore

IS

A framework and predictor based on support vector machine and random walk graph kernel for scoring protein-protein interfaces.

Updated 5 months ago
3 4

pdb2sql

PD

Fast and versatile Python package that leverages SQL queries to parse, manipulate and process biomolecular structure files. The structure files should be in the PDB format and are available on www.rcsb.org.

Updated 5 months ago
3 2

PSSMGen

PS

Generates consistent PSSM and PDB files for protein-protein complexes

Updated 5 months ago
2 2