A computational framework to integrate high-throughput '-omics' datasets for the identification of potential mechanistic linksShow others and affiliations
2018 (English)In: Nature Protocols, ISSN 1754-2189, E-ISSN 1750-2799, Vol. 13, no 12, p. 2781-2800Article in journal (Refereed) Published
Abstract [en]
We recently presented a three-pronged association study that integrated human intestinal microbiome data derived from shotgun-based sequencing with untargeted serum metabolome data and measures of host physiology. Metabolome and microbiome data are high dimensional, posing a major challenge for data integration. Here, we present a step-by-step computational protocol that details and discusses the dimensionality-reduction techniques used and methods for subsequent integration and interpretation of such heterogeneous types of data. Dimensionality reduction was achieved through a combination of data normalization approaches, binning of co-abundant genes and metabolites, and integration of prior biological knowledge. The use of prior knowledge to overcome functional redundancy across microbiome species is one central advance of our method over available alternative approaches. Applying this framework, other investigators can integrate various '-omics' readouts with variables of host physiology or any other phenotype of interest (e.g., connecting host and microbiome readouts to disease severity or treatment outcome in a clinical cohort) in a three-pronged association analysis to identify potential mechanistic links to be tested in experimental settings. Although we originally developed the framework for a human metabolome-microbiome study, it is generalizable to other organisms and environmental metagenomes, as well as to studies including other -omics domains such as transcriptomics and proteomics. The provided R code runs in ~1 h on a standard PC.
Place, publisher, year, edition, pages
Nature Publishing Group, 2018. Vol. 13, no 12, p. 2781-2800
National Category
Bioinformatics and Computational Biology Medical Biotechnology (with a focus on Cell Biology (including Stem Cell Biology), Molecular Biology, Microbiology, Biochemistry or Biopharmacy)
Identifiers
URN: urn:nbn:se:oru:diva-69996DOI: 10.1038/s41596-018-0064-zISI: 000451343400004PubMedID: 30382244Scopus ID: 2-s2.0-85055979835OAI: oai:DiVA.org:oru-69996DiVA, id: diva2:1261532
Funder
Novo Nordisk, NNF14CC0001
Note
Funding Agencies:
European Community HEALTH-F4-2007-201052
MetaCardis HEALTH-2012-305312
Innovative Medicines Initiative Joint Undertaking 115317
Agence Nationale de la Recherche MetaGenoPolis grant 'Investissements d'avenir' ANR-11-DPBS-0001
Lundbeck Foundation R218-2016-1367
European Union
EFPIA
2018-11-072018-11-072025-02-05Bibliographically approved