To Örebro University

oru.seÖrebro University Publications
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 204) Show all publications
Kalidindi, S. S., Banaee, H., Karlsson, H. & Loutfi, A. (2025). District heating optimization in residential buildings using reinforcement learning with adaptive context-aware predictive environment. Energy and AI, 22, Article ID 100603.
Open this publication in new window or tab >>District heating optimization in residential buildings using reinforcement learning with adaptive context-aware predictive environment
2025 (English)In: Energy and AI, E-ISSN 2666-5468, Vol. 22, article id 100603Article in journal (Refereed) Published
Abstract [en]

As district heating networks evolve to meet climate-neutral objectives, optimizing their control under heterogeneous building characteristics and dynamic environmental conditions remains a significant challenge. Traditional control strategies often lack the adaptability necessary to account for building-specific dynamics and to ensure real-time adherence to operational safety constraints. In this work, we present an integrated machine learning-based framework that combines an adaptive context-aware transformer model with deep reinforcement learning to address these limitations. The proposed approach introduces an adaptive context-aware transformer as a predictive environment within a Deep Q-Network (DQN) framework, enabling data-driven, building-specific control of district heating systems. Utilizing real-world data from 148 residential buildings across Sweden and Finland, the model incorporates contextual embeddings and temporal features to predict indoor temperature trajectories with high accuracy, achieving root mean square error values between 0.18-0.24 degrees C for Swedish buildings and 0.26-0.32 degrees C for Finnish buildings. The DQN agent leverages these predictions to optimize heating control while ensuring compliance with operational safety limits and preserving occupant comfort. Experimental results demonstrate significant energy savings, with mid-rise buildings achieving up to 14.85% reduction in energy consumption, and peak seasonal savings exceeding 20% during spring months. This integrated approach illustrates the potential for substantial energy optimization and reliable indoor climate management in future district heating networks.

Place, publisher, year, edition, pages
Elsevier, 2025
Keywords
Residential buildings, Adaptive context-aware transformer, District heating, Energy optimization, Reinforcement learning (RL)
National Category
Artificial Intelligence
Identifiers
urn:nbn:se:oru:diva-124092 (URN)10.1016/j.egyai.2025.100603 (DOI)001577905300001 ()
Funder
Knowledge Foundation, 20190128
Note

This work has been supported by the Industrial Graduate School Collaborative AI & Robotics funded by the Swedish Knowledge Foundation Dnr: 20190128 and in collaboration with industrial partner EcoGuard AB.

Available from: 2025-10-02 Created: 2025-10-02 Last updated: 2025-10-02Bibliographically approved
Neelakantan, S., Längkvist, M. & Loutfi, A. (2025). DR-SCAN: AN INTERPRETABLE DUAL-BRANCHRESIDUAL SPATIAL AND CHANNEL ATTENTIONNETWORK FOR REMOTE SENSING AND GEOSCIENCEIMAGE SUPER-RESOLUTION. In: : . Paper presented at The Thirteenth International Conference on Learning Representations (ICLR 2025), Singapore, April 24-28, 2025.
Open this publication in new window or tab >>DR-SCAN: AN INTERPRETABLE DUAL-BRANCHRESIDUAL SPATIAL AND CHANNEL ATTENTIONNETWORK FOR REMOTE SENSING AND GEOSCIENCEIMAGE SUPER-RESOLUTION
2025 (English)Conference paper, Poster (with or without abstract) (Refereed)
Abstract [en]

High-resolution imaging is essential in remote sensing and geoscience for precise environmental and geological analysis. DR-SCAN (Dual-Branch Residual Spatial and Channel Attention Networks), a neural network architecture for image super-resolution across these domains, is introduced. Evaluated on the UCMerced Land Use and DeepRock-SR datasets, DR-SCAN demonstrates adaptability to diverse remote sensing landscapes and effectiveness in resolving pore-scale geological features. Feature map visualizations highlight the model’s ability to prioritize crit-ical spatial features, enhancing interpretability for domain-specific applications.

National Category
Artificial Intelligence
Identifiers
urn:nbn:se:oru:diva-121021 (URN)
Conference
The Thirteenth International Conference on Learning Representations (ICLR 2025), Singapore, April 24-28, 2025
Funder
Knowledge Foundation
Available from: 2025-05-13 Created: 2025-05-13 Last updated: 2025-05-13Bibliographically approved
Mironenko, O., Banaee, H. & Loutfi, A. (2025). Evaluation of Coordination Strategies for Underground Automated Vehicle Fleets in Mixed Traffic. In: 2025 IEEE Intelligent Vehicles Symposium (IV): Proceedings. Paper presented at 36th IEEE Intelligent Vehicles Symposium (IEEE IV2025), Cluj-Napoca, Romania, June 22-25, 2025 (pp. 1020-1026). IEEE
Open this publication in new window or tab >>Evaluation of Coordination Strategies for Underground Automated Vehicle Fleets in Mixed Traffic
2025 (English)In: 2025 IEEE Intelligent Vehicles Symposium (IV): Proceedings, IEEE, 2025, p. 1020-1026Conference paper, Published paper (Refereed)
Abstract [en]

This study investigates the efficiency and safety outcomes of implementing different adaptive coordination models for automated vehicle (AV) fleets, managed by a centralized coordinator that dynamically responds to human-controlled vehicle behavior. The simulated scenarios replicate an underground mining environment characterized by narrow tunnels with limited connectivity. To address the unique challenges of such settings, we propose a novel metric — Path Overlap Density (POD) — to predict efficiency and potentially the safety performance of AV fleets. The study also explores the impact of map features on AV fleets performance. The results demonstrate that both AV fleet coordination strategies and underground tunnel network characteristics significantly influence overall system performance. While map features are critical for optimizing efficiency, adaptive coordination strategies are essential for ensuring safe operations. 

Place, publisher, year, edition, pages
IEEE, 2025
Series
IEEE Intelligent Vehicles Symposium (IV), ISSN 1931-0587, E-ISSN 2642-7214
Keywords
Mixed traffic with fleets of automated vehicles, path overlap density (POD), human behavior in driving, centralized coordination, coordination strategies, underground mining
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:oru:diva-122686 (URN)10.1109/IV64158.2025.11097685 (DOI)001556907500146 ()9798331538033 (ISBN)9798331538040 (ISBN)
Conference
36th IEEE Intelligent Vehicles Symposium (IEEE IV2025), Cluj-Napoca, Romania, June 22-25, 2025
Funder
Knowledge Foundation, 20190128
Note

This work has been supported by Sustainable Underground Mining (SUM) Academy, Project SP-12 2021-2024, and the Industrial Graduate School Collaborative AI & Robotics funded by the Swedish Knowledge Foundation Dnr:20190128. 

Available from: 2025-08-07 Created: 2025-08-07 Last updated: 2025-10-02Bibliographically approved
Sygkounas, A., Athanasiadis, I., Persson, A., Felsberg, M. & Loutfi, A. (2025). Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving. In: 2025 IEEE Intelligent Vehicles Symposium (IV): Proceedings. Paper presented at 36th Intelligent Vehicles Symposium-IV-Annual, Cluj-Napoca, Romania, June 22-25, 2025 (pp. 2325-2332). IEEE
Open this publication in new window or tab >>Interactive Double Deep Q-network: Integrating Human Interventions and Evaluative Predictions in Reinforcement Learning of Autonomous Driving
Show others...
2025 (English)In: 2025 IEEE Intelligent Vehicles Symposium (IV): Proceedings, IEEE, 2025, p. 2325-2332Conference paper, Published paper (Refereed)
Abstract [en]

Integrating human expertise with machine learning is crucial for applications demanding high accuracy and safety, such as autonomous driving. This study introduces Interactive Double Deep Q-network (iDDQN), a Human-in-the-Loop (HITL) approach that enhances Reinforcement Learning (RL) by merging human insights directly into the RL training process, improving model performance. Our proposed iDDQN method modifies the Q-value update equation to integrate human and agent actions, establishing a collaborative approach for policy development. Additionally, we present an offline evaluative framework that simulates the agent's trajectory as if no human intervention to assess the effectiveness of human interventions. Empirical results in simulated autonomous driving scenarios demonstrate that iDDQN outperforms established approaches, including Behavioral Cloning (BC), HG-DAgger, Deep Q-Learning from Demonstrations (DQfD), and vanilla DRL in leveraging human expertise for improving performance and adaptability.

Place, publisher, year, edition, pages
IEEE, 2025
Series
IEEE Intelligent Vehicles Symposium (IV), ISSN 1931-0587, E-ISSN 2642-7214
National Category
Computer Sciences
Identifiers
urn:nbn:se:oru:diva-124089 (URN)10.1109/IV64158.2025.11097638 (DOI)001556907500332 ()9798331538040 (ISBN)9798331538033 (ISBN)
Conference
36th Intelligent Vehicles Symposium-IV-Annual, Cluj-Napoca, Romania, June 22-25, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2025-10-02 Created: 2025-10-02 Last updated: 2025-10-02Bibliographically approved
Hazra, R., Sygkounas, A., Persson, A., Loutfi, A. & Zuidberg dos Martires, P. (2025). REvolve: Reward Evolution with Large Language Models using Human Feedback. In: 13th International Conference on Learning Representations (ICLR 2025): Proceedings. Paper presented at 13th International Conference on Learning Representations (ICLR 2025), Singapore, April 24-28, 2025 (pp. 25710-25751). International Conference on Learning Representations, ICLR
Open this publication in new window or tab >>REvolve: Reward Evolution with Large Language Models using Human Feedback
Show others...
2025 (English)In: 13th International Conference on Learning Representations (ICLR 2025): Proceedings, International Conference on Learning Representations, ICLR , 2025, p. 25710-25751Conference paper, Published paper (Refereed)
Abstract [en]

Designing effective reward functions is crucial to training reinforcement learning (RL) algorithms. However, this design is non-trivial, even for domain experts, due to the subjective nature of certain tasks that are hard to quantify explicitly. In recent works, large language models (LLMs) have been used for reward generation from natural language task descriptions, leveraging their extensive instruction tuning and commonsense understanding of human behavior. In this work, we hypothesize that LLMs, guided by human feedback, can be used to formulate reward functions that reflect human implicit knowledge. We study this in three challenging settings - autonomous driving, humanoid locomotion, and dexterous manipulation - wherein notions of “good” behavior are tacit and hard to quantify. To this end, we introduce REvolve, a truly evolutionary framework that uses LLMs for reward design in RL. REvolve generates and refines reward functions by utilizing human feedback to guide the evolution process, effectively translating implicit human knowledge into explicit reward functions for training (deep) RL agents. Experimentally, we demonstrate that agents trained on REvolve-designed rewards outperform other state-of-the-art baselines. 

Place, publisher, year, edition, pages
International Conference on Learning Representations, ICLR, 2025
National Category
Computer Sciences
Identifiers
urn:nbn:se:oru:diva-123277 (URN)10.48550/arXiv.2406.01309 (DOI)9798331320850 (ISBN)
Conference
13th International Conference on Learning Representations (ICLR 2025), Singapore, April 24-28, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg Foundation
Available from: 2025-09-01 Created: 2025-09-01 Last updated: 2025-09-01Bibliographically approved
Kalidindi, S. S., Banaee, H., Karlsson, H. & Loutfi, A. (2024). Adaptive Context Embedding for Temperature Prediction in Residential Buildings. In: Ulle Endriss; Francisco S. Melo; Kerstin Bach; Alberto Bugarín-Diz; José M. Alonso-Moral; Senén Barro; Fredrik Heintz (Ed.), 27th European Conference on Artificial Intelligence, 19–24 October 2024, Santiago de Compostela, Spain – Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024): . Paper presented at 27th European Conference on Artificial Intelligence (ECAI 2024), Santiago de Compostela, Spain, October 19-24, 2024 (pp. 4727-4733). IOS Press, 392
Open this publication in new window or tab >>Adaptive Context Embedding for Temperature Prediction in Residential Buildings
2024 (English)In: 27th European Conference on Artificial Intelligence, 19–24 October 2024, Santiago de Compostela, Spain – Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024) / [ed] Ulle Endriss; Francisco S. Melo; Kerstin Bach; Alberto Bugarín-Diz; José M. Alonso-Moral; Senén Barro; Fredrik Heintz, IOS Press, 2024, Vol. 392, p. 4727-4733Conference paper, Published paper (Refereed)
Abstract [en]

Transformer-based models have gained increasing popularity for time-series prediction; however, in specific applications such as residential heating systems, static contextual data of buildings is crucial to effectively capture and learn complex environmental dynamics. This paper presents a novel transformer-based model that adapts the contextual meta-data of residential buildings, generalizing across diverse environments. The model integrates temporal data with adaptive embedding of building-specific contextual meta-data such as geographic locations and building characteristics to dynamically learn and adapt to the variations. These adaptive context embeddings allow the model to comprehensively understand how different buildings respond to environmental changes over time. Initial results show improved accuracy and reliability in indoor temperature predictions of residential buildings, demonstrating the model’s potential to optimize district heating systems across a diverse array of residential buildings. This proposed model provides a basis for developing proactive heat management systems in buildings.

Place, publisher, year, edition, pages
IOS Press, 2024
Series
Frontiers in Artificial Intelligence and Applications, ISSN 0922-6389, E-ISSN 1879-8314 ; 392
Keywords
Time series Prediction, Transformer model, Temperature Prediction, Residential buildings, Context aware
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:oru:diva-119317 (URN)10.3233/FAIA241070 (DOI)2-s2.0-85216620102 (Scopus ID)9781643685489 (ISBN)
Conference
27th European Conference on Artificial Intelligence (ECAI 2024), Santiago de Compostela, Spain, October 19-24, 2024
Funder
Knowledge Foundation, 20190128
Note

This work has been supported by the Industrial Graduate School Collaborative AI & Robotics funded by the Swedish Knowledge Foundation Dnr:20190128 and in collaboration with industrial partner EcoGuard AB.

Available from: 2025-02-17 Created: 2025-02-17 Last updated: 2025-02-18Bibliographically approved
Aregbede, V., Abraham, S. S., Persson, A., Längkvist, M. & Loutfi, A. (2024). Affordance-Based Goal Imagination for Embodied AI Agents. In: 2024 IEEE International Conference on Development and Learning (ICDL): . Paper presented at IEEE International Conference on Development and Learning (ICDL 2024), Austin, Texas, USA, May 20-23, 2024 (pp. 1-6). IEEE
Open this publication in new window or tab >>Affordance-Based Goal Imagination for Embodied AI Agents
Show others...
2024 (English)In: 2024 IEEE International Conference on Development and Learning (ICDL), IEEE, 2024, p. 1-6Conference paper, Published paper (Refereed)
Abstract [en]

Goal imagination in robotics is an emerging concept and involves the capability to automatically generate realistic goals, which, in turn, requires the assessment of the feasibility of transitioning from the current conditions of an initial scene to thedesired goal state. Existing research has explored the utilization of diverse image-generative models to create images depicting potential goal states based on the current state and instructions. In this paper, we illustrate the limitations of current state-of-the-art image generative models in accurately assessing the feasibility of specific actions in particular situations. Consequently, we present how integrating large language models, which possess profound knowledge of real-world objects and affordances, can enhance the performance of image-generative models in discerning plausible from implausible actions and simulating the outcomes of actions in a given context. This will be a step towards achieving the pragmatic goal of imagination in robotics.

Place, publisher, year, edition, pages
IEEE, 2024
Keywords
Embodiment, Affordance
National Category
Computer graphics and computer vision
Research subject
Computer Science
Identifiers
urn:nbn:se:oru:diva-118193 (URN)10.1109/ICDL61372.2024.10644764 (DOI)001338553000023 ()2-s2.0-85203835311 (Scopus ID)9798350348552 (ISBN)9798350348569 (ISBN)
Conference
IEEE International Conference on Development and Learning (ICDL 2024), Austin, Texas, USA, May 20-23, 2024
Funder
Swedish Research Council, 2021-05229
Available from: 2025-01-09 Created: 2025-01-09 Last updated: 2025-02-07Bibliographically approved
Morillo-Mendez, L., Schrooten, M. G. S., Loutfi, A. & Martinez Mozos, O. (2024). Age-Related Differences in the Perception of Robotic Referential Gaze in Human-Robot Interaction. International Journal of Social Robotics, 16(6), 1069-1081
Open this publication in new window or tab >>Age-Related Differences in the Perception of Robotic Referential Gaze in Human-Robot Interaction
2024 (English)In: International Journal of Social Robotics, ISSN 1875-4791, E-ISSN 1875-4805, Vol. 16, no 6, p. 1069-1081Article in journal (Refereed) Published
Abstract [en]

There is an increased interest in using social robots to assist older adults during their daily life activities. As social robots are designed to interact with older users, it becomes relevant to study these interactions under the lens of social cognition. Gaze following, the social ability to infer where other people are looking at, deteriorates with older age. Therefore, the referential gaze from robots might not be an effective social cue to indicate spatial locations to older users. In this study, we explored the performance of older adults, middle-aged adults, and younger controls in a task assisted by the referential gaze of a Pepper robot. We examined age-related differences in task performance, and in self-reported social perception of the robot. Our main findings show that referential gaze from a robot benefited task performance, although the magnitude of this facilitation was lower for older participants. Moreover, perceived anthropomorphism of the robot varied less as a result of its referential gaze in older adults. This research supports that social robots, even if limited in their gazing capabilities, can be effectively perceived as social entities. Additionally, this research suggests that robotic social cues, usually validated with young participants, might be less optimal signs for older adults.

Supplementary Information: The online version contains supplementary material available at 10.1007/s12369-022-00926-6.

Place, publisher, year, edition, pages
Springer, 2024
Keywords
Aging, Gaze following, Human-robot interaction, Non-verbal cues, Referential gaze, Social cues
National Category
Gerontology, specialising in Medical and Health Sciences Robotics and automation
Identifiers
urn:nbn:se:oru:diva-101615 (URN)10.1007/s12369-022-00926-6 (DOI)000857896500001 ()36185773 (PubMedID)2-s2.0-85138680591 (Scopus ID)
Funder
European Commission, 754285Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Funding agency:

RobWell project - Spanish Ministerio de Ciencia, Innovacion y Universidades RTI2018-095599-A-C22

Available from: 2022-10-04 Created: 2022-10-04 Last updated: 2025-02-05Bibliographically approved
Rahaman, G. M., Längkvist, M. & Loutfi, A. (2024). Deep learning based automated estimation of urban green space index from satellite image: A case study. Urban Forestry & Urban Greening, 97, Article ID 128373.
Open this publication in new window or tab >>Deep learning based automated estimation of urban green space index from satellite image: A case study
2024 (English)In: Urban Forestry & Urban Greening, ISSN 1618-8667, E-ISSN 1610-8167, Vol. 97, article id 128373Article in journal (Refereed) Published
Abstract [en]

The green area factor model is a crucial tool for conserving and creating urban greenery and ecosystem services within neighborhood land. This model serves as a valuable index, streamlining the planning, assessment, and comparison of local-scale green infrastructures. However, conventional on-site measurements of the green area factor are resource intensive. In response, this study pioneers a computational approach that integrates ecological and social dimensions to estimate the green area factor. Employing satellite remote sensing and advanced deep learning techniques, the methodology utilizes satellite orthophotos of urban areas subjected to semantic segmentation, identifying and categorizing diverse green elements. Ground truths are established through on-site measurements of green area factors and satellite orthophotos from benchmarking sites in <spacing diaeresis>Orebro, Sweden. Results reveal an 82.0% average F1-score for semantic segmentations, signifying a favourable correlation between computationally estimated and measured green area factors. The proposed methodology is potential for adapting to various urban settings. In essence, this research introduces a promising, cost-effective solution for assessing urban greenness, particularly beneficial for urban administrators and planners aiming for insightful and comprehensive green strategies in city planning.

Place, publisher, year, edition, pages
Elsevier, 2024
Keywords
Deep convolutional neural networks (CNN), Green infrastructure, Green index, Semantic segmentation, Urban greenery, Urban planning
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:oru:diva-114996 (URN)10.1016/j.ufug.2024.128373 (DOI)001247062200001 ()2-s2.0-85194227352 (Scopus ID)
Funder
Region Örebro County, 20294202
Available from: 2024-07-25 Created: 2024-07-25 Last updated: 2024-07-25Bibliographically approved
Mironenko, O., Banaee, H. & Loutfi, A. (2024). Evaluation of Human Interaction with Fleets of Automated Vehicles in Dynamic Underground Mining Environments. In: Angelo Ferrando; Rafael C. Cardoso (Ed.), Agents and Robots for reliable Engineered Autonomy: 4th Workshop, AREA 2024, Santiago de Compostela, Spain, October 19, 2024, Proceedings. Paper presented at Agents and Robots for reliable Engineered Autonomy (AREA 2024), in conjunction with ECAI 2024, Santiago de Compostela, Spain, October 19, 2024 (pp. 54-72). Springer
Open this publication in new window or tab >>Evaluation of Human Interaction with Fleets of Automated Vehicles in Dynamic Underground Mining Environments
2024 (English)In: Agents and Robots for reliable Engineered Autonomy: 4th Workshop, AREA 2024, Santiago de Compostela, Spain, October 19, 2024, Proceedings / [ed] Angelo Ferrando; Rafael C. Cardoso, Springer, 2024, p. 54-72Conference paper, Published paper (Refereed)
Abstract [en]

This study investigates the complexities of Mixed Traffic with Fleets of Automated Vehicles (MTF-AVs) in underground mining environments characterized by confined spaces, limited visibility, and strict navigation requirements. The research focuses on integrating human-controlled vehicles into coordinated AV fleets, addressing the unpredictable interactions that arise from human behaviour. The ORU coordination framework, originally designed for a fully autonomous system, is adapted for mixed traffic scenarios to evaluate the impact of human behaviour on system efficiency and safety. Through a series of simulations, the study explores how fleet coordination algorithms adapt to human driver behaviour. These simulations demonstrate that human error and rule violations significantly reduce performance, increasing safety risks and decreasing efficiency. Findings emphasize the need for advanced coordination algorithms that dynamically adapt to unpredictable human behaviour in MTF-AVs. Such algorithms would optimize interactions between automated and human-controlled vehicles, enhancing both safety and efficiency in these complex and dynamic environments. Future research will further explore the influence of human behaviour on the coordination system and develop advanced coordination algorithms with methods to evaluate these interactions effectively.

Place, publisher, year, edition, pages
Springer, 2024
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937
Keywords
Human behaviour in driving, Mixed traffic with fleets of automated vehicles, Centralised coordination, Underground mining
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:oru:diva-117072 (URN)10.1007/978-3-031-73180-8_4 (DOI)001437042500004 ()2-s2.0-85207572427 (Scopus ID)9783031731808 (ISBN)9783031731792 (ISBN)
Conference
Agents and Robots for reliable Engineered Autonomy (AREA 2024), in conjunction with ECAI 2024, Santiago de Compostela, Spain, October 19, 2024
Funder
Knowledge Foundation, 20190128
Note

This work has been supported by Sustainable Underground Mining (SUM) Academy, Project SP-12 2021-2024, and the Industrial Graduate School Collaborative AI & Robotics funded by the Swedish Knowledge Foundation Dnr:20190128.

Available from: 2024-10-30 Created: 2024-10-30 Last updated: 2025-04-15Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-3122-693X

Search in DiVA

Show all publications