spec2vec

spec2vec is a novel similarity measure for comparing mass spectrometry data, which learns peak representations using Word2Vec.

164
mentions
10
contributors

Cite this software

What spec2vec can do for you

  • Allows to learn abstract mass spectra representations from large mass spectral data sets (unsupervised learning).
  • Computes mass spectra similarities that show a high correlation with actual molecular similarity.

Spec2vec is a novel spectral similarity score inspired by a natural language processing algorithm -- Word2Vec. Where Word2Vec learns relationships between words in sentences, spec2vec does so for mass fragments and neutral losses in MS/MS spectra. The spectral similarity score is based on spectral embeddings learnt from the fragmental relationships within a large set of spectral data.

Logo of spec2vec
Keywords
Programming languages
  • Python 98%
  • Batchfile 1%
  • Makefile 1%
License
</>Source code

Participating organisations

Wageningen University & Research
Netherlands eScience Center
Hochschule Düsseldorf University of Applied Sciences
Life Sciences
Life Sciences
University of Glasgow

Reference papers

Mentions

Build a mass spectrometry analysis pipeline in Python using matchms — part II: Spec2Vec

Author(s): Florian Huber
Published in 2021

Build a mass spectrometry analysis pipeline in Python using matchms — part III: molecular…

Author(s): Florian Huber
Published in 2021

Contributors

Adam Belloum
Adam Belloum
Faruk Diblen
Faruk Diblen
Florian Huber
Florian Huber
Hanno Spreeuw
Hanno Spreeuw
Jurriaan H. Spaaks
Jurriaan H. Spaaks
JvdH
Justin J. J. van der Hooft
SR

Related projects

Integrated omics analysis for small molecule-mediated host-microbiome interactions

Advancing our understanding of molecular mechanisms of health and disease

Updated 24 months ago
Finished

Related software

matchms

MA

Python library for fuzzy comparison of mass spectrum data and other Python objects

Updated 13 months ago
90 14