Reproducible mass spectrometry data processing and compound annotation in MZmine 3Show others and affiliations
2024 (English)In: Nature Protocols, ISSN 1754-2189, E-ISSN 1750-2799, Vol. 19, no 9, p. 2597-2641Article, review/survey (Refereed) Published
Abstract [en]
Untargeted mass spectrometry (MS) experiments produce complex, multidimensional data that are practically impossible to investigate manually. For this reason, computational pipelines are needed to extract relevant information from raw spectral data and convert it into a more comprehensible format. Depending on the sample type and/or goal of the study, a variety of MS platforms can be used for such analysis. MZmine is an open-source software for the processing of raw spectral data generated by different MS platforms. Examples include liquid chromatography-MS, gas chromatography-MS and MS-imaging. These data might typically be associated with various applications including metabolomics and lipidomics. Moreover, the third version of the software, described herein, supports the processing of ion mobility spectrometry (IMS) data. The present protocol provides three distinct procedures to perform feature detection and annotation of untargeted MS data produced by different instrumental setups: liquid chromatography-(IMS-)MS, gas chromatography-MS and (IMS-)MS imaging. For training purposes, example datasets are provided together with configuration batch files (i.e., list of processing steps and parameters) to allow new users to easily replicate the described workflows. Depending on the number of data files and available computing resources, we anticipate this to take between 2 and 24 h for new MZmine users and nonexperts. Within each procedure, we provide a detailed description for all processing parameters together with instructions/recommendations for their optimization. The main generated outputs are represented by aligned feature tables and fragmentation spectra lists that can be used by other third-party tools for further downstream analysis.
Place, publisher, year, edition, pages
Nature Publishing Group, 2024. Vol. 19, no 9, p. 2597-2641
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:oru:diva-113749DOI: 10.1038/s41596-024-00996-yISI: 001228380000001PubMedID: 38769143Scopus ID: 2-s2.0-85193731416OAI: oai:DiVA.org:oru-113749DiVA, id: diva2:1859683
Funder
EU, Horizon 2020
Note
Study Protocol
T.P. is supported by the Czech Science Foundation (GA CR) grant 21-11563M and by the European Union's Horizon 2020 research and innovation programme under Marie Sklodowska-Curie grant agreement no. 891397. T.D. is supported by the European Regional Development Fund, Programme Johannes Amos Comenius project 'IOCB MSCA PF Mobility' no. CZ.02.01.01/00/22_010/0002733. C.B. is supported by the Czech Academy of Sciences Program to Support Prospective Human Resources. A.S. and X.D. are supported by the National Institutes of Health grant U01CA235507. P.C.D. is supported by R01GM107550, R03OD034493, R01DK136117 and NSF 2152526.
2024-05-222024-05-222025-01-20Bibliographically approved