Predicting the Skin Sensitization Potential of Small Molecules with Machine Learning Models Trained on Biologically Meaningful DescriptorsShow others and affiliations
2021 (English)In: Pharmaceuticals, E-ISSN 1424-8247, Vol. 14, no 8, article id 790Article in journal (Refereed) Published
Abstract [en]
In recent years, a number of machine learning models for the prediction of the skin sensitization potential of small organic molecules have been reported and become available. These models generally perform well within their applicability domains but, as a result of the use of molecular fingerprints and other non-intuitive descriptors, the interpretability of the existing models is limited. The aim of this work is to develop a strategy to replace the non-intuitive features by predicted outcomes of bioassays. We show that such replacement is indeed possible and that as few as ten interpretable, predicted bioactivities are sufficient to reach competitive performance. On a holdout data set of 257 compounds, the best model ("Skin Doctor CP:Bio") obtained an efficiency of 0.82 and an MCC of 0.52 (at the significance level of 0.20). Skin Doctor CP:Bio is available free of charge for academic research. The modeling strategies explored in this work are easily transferable and could be adopted for the development of more interpretable machine learning models for the prediction of the bioactivity and toxicity of small organic compounds.
Place, publisher, year, edition, pages
MDPI, 2021. Vol. 14, no 8, article id 790
Keywords [en]
Bioactivity descriptors, conformal prediction, in silico prediction, machine learning, random forest, skin sensitization, toxicity prediction
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:oru:diva-93959DOI: 10.3390/ph14080790ISI: 000689893400001PubMedID: 34451887Scopus ID: 2-s2.0-85113182766OAI: oai:DiVA.org:oru-93959DiVA, id: diva2:1589332
Note
Funding agencies:
Trond Mohn Foundation BFS2017TMT01
German Research Foundation (DFG)KI 2085/1-1
Beiersdorf AG through HITeC e.V.
2021-08-312021-08-312024-01-16Bibliographically approved