Till Örebro universitet

oru.seÖrebro universitets publikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Multitask Modeling with Confidence Using Matrix Factorization and Conformal Prediction
Swetox, Karolinska Institute, Unit of Toxicology Sciences, Södertälje, Sweden; Department of Computer and Systems Sciences, Stockholm University, Kista, Sweden.ORCID-id: 0000-0003-3107-331X
Alzheimer's Research UK UCL Drug Discovery Institute, University College, London, England; Francis Crick Institute, London, England.ORCID-id: 0000-0002-5556-8133
2019 (Engelska)Ingår i: Journal of Chemical Information and Modeling, ISSN 1549-9596, E-ISSN 1549-960X, Vol. 59, nr 4, s. 1598-1604Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Multitask prediction of bioactivities is often faced with challenges relating to the sparsity of data and imbalance between different labels. We propose class conditional (Mondrian) conformal predictors using underlying Macau models as a novel approach for large scale bioactivity prediction. This approach handles both high degrees of missing data and label imbalances while still producing high quality predictive models. When applied to ten assay end points from PubChem, the models generated valid models with an efficiency of 74.0-80.1% at the 80% confidence level with similar performance both for the minority and majority class. Also when deleting progressively larger portions of the available data (0-80%) the performance of the models remained robust with only minor deterioration (reduction in efficiency between 5 and 10%). Compared to using Macau without conformal prediction the method presented here significantly improves the performance on imbalanced data sets.

Ort, förlag, år, upplaga, sidor
Washington: American Chemical Society (ACS), 2019. Vol. 59, nr 4, s. 1598-1604
Nationell ämneskategori
Farmakologi och toxikologi Bioinformatik och beräkningsbiologi Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:oru:diva-83145DOI: 10.1021/acs.jcim.9b00027ISI: 000465644500030PubMedID: 30908915Scopus ID: 2-s2.0-85064354183OAI: oai:DiVA.org:oru-83145DiVA, id: diva2:1440452
Anmärkning

Forskningsfinansiärer:

Alzheimer's Research UK, Grant Number: 1077089, SC042474

Cancer Research UK, Grant Number: FC001002

UK Medical Research Council, Grant Number: FC001002

Wellcome Trust, Grant Number: FC001002

Tillgänglig från: 2019-05-27 Skapad: 2020-06-15 Senast uppdaterad: 2025-02-05Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Norinder, Ulf

Sök vidare i DiVA

Av författaren/redaktören
Norinder, UlfSvensson, Fredrik
I samma tidskrift
Journal of Chemical Information and Modeling
Farmakologi och toxikologiBioinformatik och beräkningsbiologiDatavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 89 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf