oru.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Biomarker discovery in heterogeneous tissue samples: taking the in-silico deconfounding approach
Department of Genetics and Biometry, Research Institute for the Biology of Farm Animals, Dummerstorf, Germany.ORCID iD: 0000-0002-7173-5579
Bioinformatics Chair, Institute for Biochemistry and Biology at the University of Potsdam, Potsdam-Golm, Germany.
Department of Genetics and Biometry, Research Institute for the Biology of Farm Animals, Dummerstorf, Germany .
Molecular Biology and Human Genetics, University of Stellenbosch, Tygerberg, Cape Town, South Africa .
Show others and affiliations
2010 (English)In: BMC Bioinformatics, ISSN 1471-2105, E-ISSN 1471-2105, Vol. 11, article id 27Article in journal (Refereed) Published
Abstract [en]

Background: For heterogeneous tissues, such as blood, measurements of gene expression are confounded by relative proportions of cell types involved. Conclusions have to rely on estimation of gene expression signals for homogeneous cell populations, e.g. by applying micro-dissection, fluorescence activated cell sorting, or in-silico deconfounding. We studied feasibility and validity of a non-negative matrix decomposition algorithm using experimental gene expression data for blood and sorted cells from the same donor samples. Our objective was to optimize the algorithm regarding detection of differentially expressed genes and to enable its use for classification in the difficult scenario of reversely regulated genes. This would be of importance for the identification of candidate biomarkers in heterogeneous tissues.

Results: Experimental data and simulation studies involving noise parameters estimated from these data revealed that for valid detection of differential gene expression, quantile normalization and use of non-log data are optimal. We demonstrate the feasibility of predicting proportions of constituting cell types from gene expression data of single samples, as a prerequisite for a deconfounding-based classification approach.Classification cross-validation errors with and without using deconfounding results are reported as well as sample-size dependencies. Implementation of the algorithm, simulation and analysis scripts are available.

Conclusions: The deconfounding algorithm without decorrelation using quantile normalization on non-log data is proposed for biomarkers that are difficult to detect, and for cases where confounding by varying proportions of cell types is the suspected reason. In this case, a deconfounding ranking approach can be used as a powerful alternative to, or complement of, other statistical learning approaches to define candidate biomarkers for molecular diagnosis and prediction in biomedicine, in realistically noisy conditions and with moderate sample sizes.

Place, publisher, year, edition, pages
London, UK: BioMed Central, 2010. Vol. 11, article id 27
National Category
Bioinformatics and Systems Biology
Identifiers
URN: urn:nbn:se:oru:diva-40628DOI: 10.1186/1471-2105-11-27ISI: 000275199300001PubMedID: 20070912Scopus ID: 2-s2.0-77249157887OAI: oai:DiVA.org:oru-40628DiVA, id: diva2:778009
Available from: 2015-01-09 Created: 2015-01-09 Last updated: 2018-01-30Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records BETA

Repsilber, Dirk

Search in DiVA

By author/editor
Repsilber, Dirk
In the same journal
BMC Bioinformatics
Bioinformatics and Systems Biology

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 60 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf