Identification of metabolites from tandem mass spectra with a machine learning approach utilizing structural features release_ix4gmusqkndbvolrr4x6twavni

Published in Bioinformatics by Oxford University Press (OUP).

2019   Volume 36, Issue 4, p1213-1218

Abstract

Untargeted mass spectrometry (MS/MS) is a powerful method for detecting metabolites in biological samples. However, fast and accurate identification of the metabolites' structures from MS/MS spectra is still a great challenge. We present a new analysis method, called SubFragment-Matching (SF-Matching) that is based on the hypothesis that molecules with similar structural features will exhibit similar fragmentation patterns. We combine information on fragmentation patterns of molecules with shared substructures and then use random forest models to predict whether a given structure can yield a certain fragmentation pattern. These models can then be used to score candidate molecules for a given mass spectrum. For rapid identification, we pre-compute such scores for common biological molecular structure databases. Using benchmarking datasets, we find that our method has similar performance to CSI: FingerID and those very high accuracies can be achieved by combining our method with CSI: FingerID. Rarefaction analysis of the training dataset shows that the performance of our method will increase as more experimental data become available. SF-Matching is available from http://www.bork.embl.de/Docu/sf_matching. Supplementary data are available at Bioinformatics online.
In text/plain format

Archived Files and Locations

application/pdf  1.7 MB
file_c3jjmrasnvbj3hb7sxt4v5b7uu
edoc.mdc-berlin.de (web)
web.archive.org (webarchive)
application/pdf  3.0 MB
file_q6yg2je2sveebgp5zb5hezblua
archive-ouverte.unige.ch (web)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2020-02-15
Language   en ?
Container Metadata
Not in DOAJ
In Keepers Registry
ISSN-L:  1367-4803
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 6a840686-52c2-440e-95c6-d3ada8cc5772
API URL: JSON