Googling the cancer genome

Identification and prioritization of cancer-causing structural variations in whole genomes

Image: WallpaperUP

Structural variants (SVs) are a particular class of mutations that have been associated to cancer formation and progression. Cancer-associated SVs can be used to determine the cancer subtype, monitor its progression and to develop novel targeted treatments. However SV analysis for personalized medicine carries many computational and algorithmic challenges. To address these challenges we developed a suite of methods for SV simulation, detection and filtering. sv-callers is a computational workflow that enable highly reproducible, portable and scalable deployment and execution of state-of-the-art SV detection algorithms across multiple high performance computing architectures. sv-gen is a workflow that can be used to generate artificial read alignment data from genomes where multiple types of SVs have been introduced at known positions. These data can serve to study how SV signals are generated at SV breakpoint positions and to benchmark SV calling methods. sv-channels is a novel deep learning-based approach for SV calling and filtering that uses one dimensional convolutional neural networks to distinguish true SVs from regions that do not contain SVs. These methods aim at improving the accuracy and cost-efficiency of SV analysis in clinical studies. This will help realize the potentials of cancer genomics for personalized medicine.

Participating organisations

Netherlands eScience Center
University Medical Center Utrecht
Life Sciences
Life Sciences

Impact

Output

Portable HPC workflows with Snakemake, Conda, and Xenon

Author(s): Jurriaan H. Spaaks
Published in 2018

Teaching machines to recognize cancer

Author(s): Netherlands eScience Center
Published in 2017

Team

Arnold Kuzniar
Arnold Kuzniar
eScience Research Engineer
Netherlands eScience Center
JdR
Jeroen de Ridder
Principal investigator
University Medical Center Utrecht
Lars Ridder
Lars Ridder
eScience Coordinator
Netherlands eScience Center
LS
Luca Santuari
PhD student
University Medical Center Utrecht
Sonja Georgievska
Sonja Georgievska
eScience Research Engineer
Netherlands eScience Center

Related projects

TADPOLE-SHARE

Sharing TADPOLE’s algorithms for reuse and evaluation

Updated 20 months ago
Finished

Integrated omics analysis for small molecule-mediated host-microbiome interactions

Advancing our understanding of molecular mechanisms of health and disease

Updated 24 months ago
Finished

DeepRank

Scoring 3D protein-protein interaction models using deep learning

Updated 20 months ago
Finished

3D Printing of human body parts

Deep learning algorithms for more accurate implants

Updated 19 months ago
Finished

Data quality in a distributed learning environment

Vast amounts of data to improve cancer treatment decisions

Updated 24 months ago
Finished

Classifying activity types

Gaining insights from wearable movement sensors

Updated 20 months ago
Finished

Diagnosis of active epilepsy in resource-poor setting

Prediction models based on EEG characteristics

Updated 20 months ago
Finished

Enhancing Protein-Drug Binding Prediction

Combining molecular simulation and eScience technologies

Updated 7 days ago
Finished

Biomarker Boosting

Better biomarkers through datasharing

Updated 20 months ago
Finished

TraIT

A sustainable infrastructure for translational medical research

Updated 20 months ago
Finished

Related software

mcfly

MC

Helps you find a suitable neural network configuration for deep learning on time series.

Updated 23 months ago
42 12

sv-callers

SV

Highly portable parallel workflow to detect structural variants in cancer genomes.

Updated 13 months ago
32 4

sv-channels

SV

Genome-wide detection of structural variants using deep learning

Updated 28 months ago
4 4

sv-gen

SV

Highly portable parallel workflow to generate artificial genomes with structural variants.

Updated 28 months ago
5

Xenon

XE

If you are using remote machines to do your computations, and don’t feel like learning and implementing many different APIs, Xenon is the tool for you.

Updated 13 months ago
13 11

Xenon command line interface

XE

A command line interface for the Xenon library that allows you to use remote machines to do your computations.

Updated 13 months ago
9 2

yatiml

YA

Python library for YAML type inference, schema checking and syntactic sugar.

Updated 28 months ago
1