Sign in
Ctrl K

Integrated omics analysis for small molecule-mediated host-microbiome interactions

Advancing our understanding of molecular mechanisms of health and disease

The microbes in our bodies are fundamental to our health. At the molecular level, many of their interactions with human tissues are mediated by microbial specialized metabolites.

While metabolomics provides a powerful technique to profile these, most microbial molecules have unknown structures; hence, over 95% of detected masses cannot be functionally interpreted or linked to their producers. This currently thwarts efforts to understand important diseased states of our microbiome.

Many innovative computational workflows have recently been designed to predict molecular (sub)structures from genomic or metabolomic data; however, these efforts have remained largely unconnected. Integrating these data will make it possible to complement partial information provided by each field to yield much better functional predictions.

Moreover, it will connect vital information from both data types: while metabolomics informs about in vivo relevance, genomics informs about biological origin. Here, we propose to design a novel algorithm to connect molecular substructures identified in tandem mass-spectrometric data to sets of genes within biosynthetic gene clusters (BGCs) detected in (meta)genomic data. Subsequently, we will integrate this algorithm with our previous methods for metabolome (spectral networking, substructure detection) and genome analysis (BGC identification and clustering) in one comprehensive eScience workflow.

Finally, we will demonstrate its potential by identifying molecules prominent during periods of relapse in a longitudinal study of inflammatory bowel disease (IBD) and connecting them to their producers. Ultimately, our workflow will illuminate the vast unknown metabolic space within the human microbial metabolome, and greatly advance our understanding of molecular mechanisms of health and disease.

Participating organisations

Life Sciences
Life Sciences
Netherlands eScience Center
Wageningen University & Research




Florian Huber
Florian Huber
eScience Research Engineer
Netherlands eScience Center
Justin J. J. van der Hooft
Principal investigator
Wageningen University and Research
Lars Ridder
Lars Ridder
eScience Coordinator
Netherlands eScience Center
Stefan Verhoeven
Senior eScience Research Engineer
Netherlands eScience Center

Related projects


A community-supported workflow connecting microbial genes, and organisms to their molecular products

Updated 2 months ago
In progress


Fusible evolutionary deep neural network mixture learning from distributed data for robust medical...

Updated 14 months ago


Scoring 3D protein-protein interaction models using deep learning

Updated 14 months ago

Googling the cancer genome

Identification and prioritization of cancer-causing structural variations in whole genomes

Updated 3 months ago

Classifying activity types

Gaining insights from wearable movement sensors

Updated 14 months ago

Enhancing Protein-Drug Binding Prediction

Combining molecular simulation and eScience technologies

Updated 14 months ago

Related software



Python library for fuzzy comparison of mass spectrum data and other Python objects

Updated 8 months ago
63 14



Deep learning based similarity measure of mass spectrometry data.

Updated 23 months ago
3 2

Paired omics data platform


If you do metabolomics experiments with mass spectra and have sequenced the genomes of the samples, then the platform can help you link them.

Updated 23 months ago
83 2



spec2vec is a novel similarity measure for comparing mass spectrometry data, which learns peak representations using Word2Vec.

Updated 2 weeks ago
121 10