To Örebro University

oru.seÖrebro University Publications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 51) Show all publications
Hinojosa, M., Johanson, G., Norinder, U. & Forsby, A. (2026). Classification of industrial chemicals for respiratory chemosensory irritation using the TRPV1-expressing neuronal SH-SY5Y cell model and machine learning. Archives of Toxicology, 100, 1301-1320
Open this publication in new window or tab >>Classification of industrial chemicals for respiratory chemosensory irritation using the TRPV1-expressing neuronal SH-SY5Y cell model and machine learning
2026 (English)In: Archives of Toxicology, ISSN 0340-5761, E-ISSN 1432-0738, Vol. 100, p. 1301-1320Article in journal (Refereed) Published
Abstract [en]

Respiratory sensory irritation is the basis for many occupational exposure limits. Irritation thresholds have hitherto mainly been identified in humans by questionnaires and in mice by measuring the inhaled concentration causing 50% respiratory depression (RD50). Both methods are ethically questionable. We investigated an alternative New Approach Methodology (NAM) approach, namely the neuronal SH-SY5Y cell model expressing the sensory receptor TRPV1 in combination with random forest-based machine learning. The intracellular Ca2+ concentration was monitored during acute exposure to different concentrations of 34 organic chemicals. Potency and efficacy were determined with and without the TRPV1 antagonist capsazepine (CZ). Fifteen of the chemicals induced TRPV1 activation at some concentrations, however only phenol appeared as a true TRPV1 agonist. Using machine learning, the parameters EC20, Emax, concentration at Emax, the two first components from principal component analyses, and pH were analysed against previously published RD50 data to classify each chemical as non-irritant/irritant or as non-irritant/ intermediate/irritant. The best 2-class model (accuracy 0.90, outlier frequency 4.8%) was the one using experiments with CZ present, suggesting that the irritancy was not mediated by TRPV1 activation. The best 3-class model (accuracy 0.77, outlier frequency 5.7%) was the one using data without CZ, indicating that TRPV1 activation may play a role for intermediate irritation. The three false negative chemicals, as predicted by the NAMs, were the most irritating chemicals according to RD50 determined in vivo, indicating that other processes may also be important.

Place, publisher, year, edition, pages
Springer, 2026
Keywords
Calcium influx, RD50, Random forest, Respiratory sensory irritation, SH-SY5Y, TRPV1
National Category
Neurosciences
Identifiers
urn:nbn:se:oru:diva-126514 (URN)10.1007/s00204-025-04288-6 (DOI)001667106400001 ()41566048 (PubMedID)
Funder
Karolinska InstituteSwedish Research Council, 2020-02039
Available from: 2026-01-22 Created: 2026-01-22 Last updated: 2026-06-02Bibliographically approved
Kreutzer, Y., Rahu, I., Norinder, U. & Kruve, A. (2026). Molecular networking, conformal predictions and revised fingerprint-based models for discovering endocrine disruptors in mixtures. Analytical and Bioanalytical Chemistry, 418, 1445-1457
Open this publication in new window or tab >>Molecular networking, conformal predictions and revised fingerprint-based models for discovering endocrine disruptors in mixtures
2026 (English)In: Analytical and Bioanalytical Chemistry, ISSN 1618-2642, E-ISSN 1618-2650, Vol. 418, p. 1445-1457Article in journal (Refereed) Published
Abstract [en]

Prioritizing high-risk features is a key step to reduce workload in non-targeted screening (NTS) when identifying environmental contaminants. Machine learning models from the MS2Tox toolbox have shown promise for feature prioritization, but rely heavily on the accuracy of molecular formulas and fingerprint features provided by SIRIUS + CSI:FingerID. In this study, we introduce and evaluate two new approaches-molecular networking (MN) and conformal predictions-to discover unidentified compounds potentially posing endocrine-disrupting activity based on tandem mass spectral similarity. Furthermore, we revised the previously published MS2Tox models, leveraging molecular fingerprints for seven Tox21 Data Challenge endpoints. The fingerprint-based MS2Tox models achieved the lowest false positive rate, 0.35, at 90% recall on the test set, while MN and CP yielded 0.82 and 0.68, respectively. In a case study of transformation products and persistent chemicals in wastewater, these three approaches prioritized 29 features in influent and effluent samples as potentially associated with AhR agonism among 189 LC/HRMS features corresponding to transformation products and persistent chemicals. All candidate structures for prioritized features showed scaffolds related to AhR binding affinity. Three features were identified on level 1, showcasing potential in using combined feature prioritization strategies.

Place, publisher, year, edition, pages
Springer, 2026
Keywords
Hazard, High-resolution mass spectrometry, Machine learning, Toxicity, Untargeted screening
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:oru:diva-126511 (URN)10.1007/s00216-025-06303-2 (DOI)001666416500001 ()41566047 (PubMedID)
Funder
Stockholm UniversitySwedish Research Council, 2022-01353Carl Tryggers foundation , 22:2336EU, Horizon 2020, 101124488
Note

Funding Agencies:

Open access funding provided by Stockholm University. The authors thank Stockholm University for providing open access funding 

Y.K. thanks for financial support from the Swedish Research Council, grant number 2022-01353 “MS2Tox: Deep Learning for Automated Prediction of the Endocrine Disruptive Potency of Chemicals in Complex Mixtures.” I.R. thanks the Carl Trygger Foundation project 22:2336 for financial support. The project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program grant agreement No. 101124488, LearningStructurE: Machine Learning and Mass Spectrometry for Structural Elucidation of Novel Toxic Chemicals.

Available from: 2026-01-22 Created: 2026-01-22 Last updated: 2026-06-02Bibliographically approved
Norinder, U., Zheng, Z. & Cotgreave, I. (2025). Prediction of the classification, labelling and packaging regulation H-statements with confidence using conformal prediction with N-grams and molecular fingerprints. Current research in toxicology, 8, Article ID 100242.
Open this publication in new window or tab >>Prediction of the classification, labelling and packaging regulation H-statements with confidence using conformal prediction with N-grams and molecular fingerprints
2025 (English)In: Current research in toxicology, ISSN 2666-027X, Vol. 8, article id 100242Article in journal (Refereed) Published
Abstract [en]

Effective chemical hazard labelling systems are essential for safeguarding human health and the environment as a result of widespread chemical use, and machine-learning models can be used to predict hazard labels efficiently and reduce the use of animal tests. This investigation shows the utility of N-grams and other fingerprint featurization procedures for predicting classification, labelling and packaging (CLP). Regulation H-statements, particularly in an ensemble (consensus) setting. Consensus modelling by class or Conformal Prediction median pvalues seems to be particularly advantageous in order to obtain both high conformal prediction validity and efficiency as well as good balanced accuracy, sensitivity and specificity. Utilization of the N-grams allows handling of all symbols in SMILES strings including those related to metals and salts that may be important for the compounds to exhibit their experimental determined toxicities. The models developed in this study are efficient tools to access hazard classification H-statements of chemicals, which can be useful for chemical hazard assessment, read-across as well as risk management.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
CLP Regulation, Conformal prediction, Consensus modeling, H -statements, Molecular fingerprints, N -grams, Random forest
National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:oru:diva-121603 (URN)10.1016/j.crtox.2025.100242 (DOI)001503552100002 ()40519565 (PubMedID)2-s2.0-105006900609 (Scopus ID)
Funder
Swedish Foundation for Strategic Research, 2018/11
Available from: 2025-06-16 Created: 2025-06-16 Last updated: 2025-06-17Bibliographically approved
Geylan, G., De Maria, L., Engkvist, O., David, F. & Norinder, U. (2024). A methodology to correctly assess the applicability domain of cell membrane permeability predictors for cyclic peptides. Digital Discovery, 3(9), 1761-1775
Open this publication in new window or tab >>A methodology to correctly assess the applicability domain of cell membrane permeability predictors for cyclic peptides
Show others...
2024 (English)In: Digital Discovery, E-ISSN 2635-098X, Vol. 3, no 9, p. 1761-1775Article in journal (Refereed) Published
Abstract [en]

Being able to predict the cell permeability of cyclic peptides is essential for unlocking their potential as a drug modality for intracellular targets. With a wide range of studies of cell permeability but a limited number of data points, the reliability of the machine learning (ML) models to predict previously unexplored chemical spaces becomes a challenge. In this work, we systemically investigate the predictive capability of ML models from the perspective of their extrapolation to never-before-seen applicability domains, with a particular focus on the permeability task. Four predictive algorithms, namely Support-Vector Machine, Random Forest, LightGBM and XGBoost, jointly with a conformal prediction framework were employed to characterize and evaluate the applicability through uncertainty quantification. Efficiency and validity of the models' predictions with multiple calibration strategies were assessed with respect to several external datasets from different parts of the chemical space through a set of experiments. The experiments showed that the predictors generalizing well to the applicability domain defined by the training data, can fail to achieve similar model performance on other parts of the chemical spaces. Our study proposes an approach to overcome such limitations by the means of improving the efficiency of models without sacrificing the validity. The trade-off between the reliability and informativeness was balanced when the models were calibrated with a subset of the data from the new targeted domain. This study outlines an approach to enable the extrapolation of predictive power and restore the models' reliability via a recalibration strategy without the need for retraining the underlying model. This work outlines peptide predictive model methodology with conformal prediction, focusing on extrapolation task. Calibrating on the unseen chemical space recovers efficiency and validity enabling reliable predictions without retraining the models.

Place, publisher, year, edition, pages
Royal Society of Chemistry, 2024
National Category
Chemical Sciences Computer Sciences
Identifiers
urn:nbn:se:oru:diva-115374 (URN)10.1039/d4dd00056k (DOI)001279737000001 ()2-s2.0-85200371105 (Scopus ID)
Funder
Swedish Foundation for Strategic Research
Note

This work has been partially funded by the Swedish Foundation for Strategic Research (SSF) through an industrial PhD studentship for GG.

Available from: 2024-08-16 Created: 2024-08-16 Last updated: 2025-01-07Bibliographically approved
Arvidsson McShane, S., Norinder, U., Alvarsson, J., Ahlberg, E., Carlsson, L. & Spjuth, O. (2024). CPSign: conformal prediction for cheminformatics modeling. Journal of Cheminformatics, 16(1), Article ID 75.
Open this publication in new window or tab >>CPSign: conformal prediction for cheminformatics modeling
Show others...
2024 (English)In: Journal of Cheminformatics, E-ISSN 1758-2946, Vol. 16, no 1, article id 75Article in journal (Refereed) Published
Abstract [en]

Conformal prediction has seen many applications in pharmaceutical science, being able to calibrate outputs of machine learning models and producing valid prediction intervals. We here present the open source software CPSign that is a complete implementation of conformal prediction for cheminformatics modeling. CPSign implements inductive and transductive conformal prediction for classification and regression, and probabilistic prediction with the Venn-ABERS methodology. The main chemical representation is signatures but other types of descriptors are also supported. The main modeling methodology is support vector machines (SVMs), but additional modeling methods are supported via an extension mechanism, e.g. DeepLearning4J models. We also describe features for visualizing results from conformal models including calibration and efficiency plots, as well as features to publish predictive models as REST services. We compare CPSign against other common cheminformatics modeling approaches including random forest, and a directed message-passing neural network. The results show that CPSign produces robust predictive performance with comparative predictive efficiency, with superior runtime and lower hardware requirements compared to neural network based models. CPSign has been used in several studies and is in production-use in multiple organizations. The ability to work directly with chemical input files, perform descriptor calculation and modeling with SVM in the conformal prediction framework, with a single software package having a low footprint and fast execution time makes CPSign a convenient and yet flexible package for training, deploying, and predicting on chemical data. CPSign can be downloaded from GitHub at https://github.com/arosbio/cpsign.

Scientific contribution: CPSign provides a single software that allows users to perform data preprocessing, modeling and make predictions directly on chemical structures, using conformal and probabilistic prediction. Building and evaluating new models can be achieved at a high abstraction level, without sacrificing flexibility and predictive performance-showcased with a method evaluation against contemporary modeling approaches, where CPSign performs on par with a state-of-the-art deep learning based model.

Place, publisher, year, edition, pages
Springer Nature, 2024
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:oru:diva-114520 (URN)10.1186/s13321-024-00870-9 (DOI)001258657400001 ()38943219 (PubMedID)2-s2.0-85197657994 (Scopus ID)
Funder
Uppsala UniversitySwedish Research Council, 2020-03731; 2020-01865Swedish Cancer Society, 22 2412Swedish Research Council Formas, 2022-00940EU, Horizon Europe, 101057014 (PARC)
Available from: 2024-07-01 Created: 2024-07-01 Last updated: 2024-07-29Bibliographically approved
Béquignon, O. J. M., Gómez-Tamayo, J. C., Lenselink, E. B., Wink, S., Hiemstra, S., Lam, C. C., . . . van Westen, G. J. P. (2023). Collaborative SAR Modeling and Prospective In Vitro Validation of Oxidative Stress Activation in Human HepG2 Cells. Journal of Chemical Information and Modeling, 63(17), 5433-5445
Open this publication in new window or tab >>Collaborative SAR Modeling and Prospective In Vitro Validation of Oxidative Stress Activation in Human HepG2 Cells
Show others...
2023 (English)In: Journal of Chemical Information and Modeling, ISSN 1549-9596, E-ISSN 1549-960X, Vol. 63, no 17, p. 5433-5445Article in journal (Refereed) Published
Abstract [en]

Oxidative stress is the consequence of an abnormal increase of reactive oxygen species (ROS). ROS are generated mainly during the metabolism in both normal and pathological conditions as well as from exposure to xenobiotics. Xenobiotics can, on the one hand, disrupt molecular machinery involved in redox processes and, on the other hand, reduce the effectiveness of the antioxidant activity. Such dysregulation may lead to oxidative damage when combined with oxidative stress overpassing the cell capacity to detoxify ROS. In this work, a green fluorescent protein (GFP)-tagged nuclear factor erythroid 2-related factor 2 (NRF2)-regulated sulfiredoxin reporter (Srxn1-GFP) was used to measure the antioxidant response of HepG2 cells to a large series of drug and drug-like compounds (2230 compounds). These compounds were then classified as positive or negative depending on cellular response and distributed among different modeling groups to establish structure-activity relationship (SAR) models. A selection of models was used to prospectively predict oxidative stress induced by a new set of compounds subsequently experimentally tested to validate the model predictions. Altogether, this exercise exemplifies the different challenges of developing SAR models of a phenotypic cellular readout, model combination, chemical space selection, and results interpretation.

Place, publisher, year, edition, pages
American Chemical Society (ACS), 2023
National Category
Biochemistry Molecular Biology
Identifiers
urn:nbn:se:oru:diva-107840 (URN)10.1021/acs.jcim.3c00220 (DOI)001063562000001 ()37616385 (PubMedID)2-s2.0-85169891068 (Scopus ID)
Funder
EU, Horizon 2020, 681002 964537 777365
Note

Funding agencies:

EFPIA

Netherlands Organization for Scientific Research (NWO) NWA-ORC 1292.19.272

 

Available from: 2023-08-25 Created: 2023-08-25 Last updated: 2025-02-20Bibliographically approved
Sapounidou, M., Norinder, U. & Andersson, P. L. (2023). Predicting Endocrine Disruption Using Conformal Prediction: A Prioritization Strategy to Identify Hazardous Chemicals with Confidence. Chemical Research in Toxicology, 36(1), 53-65
Open this publication in new window or tab >>Predicting Endocrine Disruption Using Conformal Prediction: A Prioritization Strategy to Identify Hazardous Chemicals with Confidence
2023 (English)In: Chemical Research in Toxicology, ISSN 0893-228X, E-ISSN 1520-5010, Vol. 36, no 1, p. 53-65Article in journal (Refereed) Published
Abstract [en]

Receptor-mediated molecular initiating events (MIEs) and their relevance in endocrine activity (EA) have been highlighted in literature. More than 15 receptors have been associated with neurodevelopmental adversity and metabolic disruption. MIEs describe chemical interactions with defined biological outcomes, a relationship that could be described with quantitative structure-activity relationship (QSAR) models. QSAR uncertainty can be assessed using the conformal prediction (CP) framework, which provides similarity (i.e., nonconformity) scores relative to the defined classes per prediction. CP calibration can indirectly mitigate data imbalance during model development, and the nonconformity scores serve as intrinsic measures of chemical applicability domain assessment during screening. The focus of this work was to propose an in silico predictive strategy for EA. First, 23 QSAR models for MIEs associated with EA were developed using high-throughput data for 14 receptors. To handle the data imbalance, five protocols were compared, and CP provided the most balanced class definition. Second, the developed QSAR models were applied to a large data set (∼55,000 chemicals), comprising chemicals representative of potential risk for human exposure. Using CP, it was possible to assess the uncertainty of the screening results and identify model strengths and out of domain chemicals. Last, two clustering methods, t-distributed stochastic neighbor embedding and Tanimoto similarity, were used to identify compounds with potential EA using known endocrine disruptors as reference. The cluster overlap between methods produced 23 chemicals with suspected or demonstrated EA potential. The presented models could be utilized for first-tier screening and identification of compounds with potential biological activity across the studied MIEs.

Place, publisher, year, edition, pages
American Chemical Society (ACS), 2023
National Category
Pharmacology and Toxicology
Identifiers
urn:nbn:se:oru:diva-102811 (URN)10.1021/acs.chemrestox.2c00267 (DOI)000903383200001 ()36534483 (PubMedID)2-s2.0-85144410434 (Scopus ID)
Funder
European Commission, 825759 825489Swedish Foundation for Strategic Research, DIA 2018/11
Available from: 2022-12-20 Created: 2022-12-20 Last updated: 2024-01-16Bibliographically approved
Norinder, U. & Lowry, S. (2023). Predicting Larch Casebearer damage with confidence using Yolo network models and conformal prediction. Remote Sensing Letters, 14(10), 1023-1035
Open this publication in new window or tab >>Predicting Larch Casebearer damage with confidence using Yolo network models and conformal prediction
2023 (English)In: Remote Sensing Letters, ISSN 2150-704X, E-ISSN 2150-7058, Vol. 14, no 10, p. 1023-1035Article in journal (Refereed) Published
Abstract [en]

This investigation shows that successful forecasting models for monitoring forest health status with respect to Larch Casebearer damages can be derived using a combination of a confidence predictor framework (Conformal Prediction) in combination with a deep learning architecture (Yolo v5). A confidence predictor framework can predict the current types of diseases used to develop the model and also provide indication of new, unseen, types or degrees of disease. The user of the models is also, at the same time, provided with reliable predictions and a well-established applicability domain for the model where such reliable predictions can and cannot be expected. Furthermore, the framework gracefully handles class imbalances without explicit over- or under-sampling or category weighting which may be of crucial importance in cases of highly imbalanced datasets. The present approach also provides indication of when insufficient information has been provided as input to the model at the level of accuracy (reliability) need by the user to make subsequent decisions based on the model predictions.

Place, publisher, year, edition, pages
Taylor & Francis, 2023
Keywords
Yolo network, Larch Casebearer moth, conformal prediction, forest health, tree damage
National Category
Computer Sciences
Identifiers
urn:nbn:se:oru:diva-108845 (URN)10.1080/2150704X.2023.2258460 (DOI)001071044000001 ()2-s2.0-85171885925 (Scopus ID)
Funder
Swedish Research Council, 2018-03807
Available from: 2023-10-10 Created: 2023-10-10 Last updated: 2024-01-16Bibliographically approved
Alijagic, A., Scherbak, N., Kotlyar, O., Karlsson, P., Persson, A., Hedbrant, A., . . . Engwall, M. (2022). Cell Painting unveils cell response signatures to (nano)particles formed in additive manufacturing. Paper presented at XVIth International Congress of Toxicology (ICT 2022) - UNITING IN TOXICOLOGY, Maastricht, The Netherlands, September 18-21, 2022. Toxicology Letters, 368(Suppl. 1), S226-S227, Article ID P17-01.
Open this publication in new window or tab >>Cell Painting unveils cell response signatures to (nano)particles formed in additive manufacturing
Show others...
2022 (English)In: Toxicology Letters, ISSN 0378-4274, E-ISSN 1879-3169, P17-01, Vol. 368, no Suppl. 1, p. S226-S227, article id P17-01Article in journal, Meeting abstract (Other academic) Published
Place, publisher, year, edition, pages
Elsevier, 2022
National Category
Environmental Sciences Production Engineering, Human Work Science and Ergonomics
Identifiers
urn:nbn:se:oru:diva-101516 (URN)10.1016/j.toxlet.2022.07.611 (DOI)000853725600549 ()
Conference
XVIth International Congress of Toxicology (ICT 2022) - UNITING IN TOXICOLOGY, Maastricht, The Netherlands, September 18-21, 2022
Available from: 2022-09-29 Created: 2022-09-29 Last updated: 2024-03-05Bibliographically approved
Tuerkova, A., Bongers, B. J., Norinder, U., Ungvári, O., Székely, V., Tarnovskiy, A., . . . Zdrazil, B. (2022). Identifying Novel Inhibitors for Hepatic Organic Anion Transporting Polypeptides by Machine Learning-Based Virtual Screening. Journal of Chemical Information and Modeling, 62(24), 6323-6335
Open this publication in new window or tab >>Identifying Novel Inhibitors for Hepatic Organic Anion Transporting Polypeptides by Machine Learning-Based Virtual Screening
Show others...
2022 (English)In: Journal of Chemical Information and Modeling, ISSN 1549-9596, E-ISSN 1549-960X, Vol. 62, no 24, p. 6323-6335Article in journal (Refereed) Published
Abstract [en]

Integration of statistical learning methods with structure-based modeling approaches is a contemporary strategy to identify novel lead compounds in drug discovery. Hepatic organic anion transporting polypeptides (OATP1B1, OATP1B3, and OATP2B1) are classical off-targets, and it is well recognized that their ability to interfere with a wide range of chemically unrelated drugs, environmental chemicals, or food additives can lead to unwanted adverse effects like liver toxicity and drug-drug or drug-food interactions. Therefore, the identification of novel (tool) compounds for hepatic OATPs by virtual screening approaches and subsequent experimental validation is a major asset for elucidating structure-function relationships of (related) transporters: they enhance our understanding about molecular determinants and structural aspects of hepatic OATPs driving ligand binding and selectivity. In the present study, we performed a consensus virtual screening approach by using different types of machine learning models (proteochemometric models, conformal prediction models, and XGBoost models for hepatic OATPs), followed by molecular docking of preselected hits using previously established structural models for hepatic OATPs. Screening the diverse REAL drug-like set (Enamine) shows a comparable hit rate for OATP1B1 (36% actives) and OATP1B3 (32% actives), while the hit rate for OATP2B1 was even higher (66% actives). Percentage inhibition values for 44 selected compounds were determined using dedicated in vitro assays and guided the prioritization of several highly potent novel hepatic OATP inhibitors: six (strong) OATP2B1 inhibitors (IC50 values ranging from 0.04 to 6 μM), three OATP1B1 inhibitors (2.69 to 10 μM), and five OATP1B3 inhibitors (1.53 to 10 μM) were identified. Strikingly, two novel OATP2B1 inhibitors were uncovered (C7 and H5) which show high affinity (IC50 values: 40 nM and 390 nM) comparable to the recently described estrone-based inhibitor (IC50 = 41 nM). A molecularly detailed explanation for the observed differences in ligand binding to the three transporters is given by means of structural comparison of the detected binding sites and docking poses.

Place, publisher, year, edition, pages
American Chemical Society (ACS), 2022
National Category
Pharmacology and Toxicology
Identifiers
urn:nbn:se:oru:diva-98150 (URN)10.1021/acs.jcim.1c01460 (DOI)000840962400001 ()35274943 (PubMedID)2-s2.0-85126619146 (Scopus ID)
Available from: 2022-03-21 Created: 2022-03-21 Last updated: 2024-01-16Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-3107-331x

Search in DiVA

Show all publications