To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning Interpretable Robot Policies from Demonstration
Örebro University, School of Science and Technology.ORCID iD: 0000-0003-2279-9418
2025 (English)Doctoral thesis, monograph (Other academic)
Abstract [en]

The vision of robots assisting with our daily life tasks hinges on a fundamental challenge: most robot behaviors require programming by experts, creating a barrier for the many. Learning from Demonstration (LfD) is a family of methods with the potential to democratize robot programming by enabling laypeople to teach robots new skills simply by showing them. However, most LfD methods operate as black-box systems, making it difficult for humans to interpret, adapt, or reuse — key factors for human-robot collaboration and understanding.

In this thesis, we instead build upon Behavior Trees (BTs) — a transparent and manageable robot programming framework, whose reactive, modular design and potential for interoperability make them well-suited for constructing sophisticated robot behaviors. To bridge the gap between high-level planning and low-level control, engineers often embed entire behaviors within the BT’s leaf nodes. While functional, this approach encapsulates complete skills, obscuring intermediate subgoals and undermining the transparency and modularity that BTs are meant to provide — ultimately limiting skill reuse and adaptability.

The structure of a BT plays a crucial role in effective behavior design. While several approaches have aimed to learn BT structures from demonstrations, they often rely on predefined action sets and state spaces. This necessity for expert-curated inputs constrains the robot’s learning flexibility and reintroduces a degree of expert dependency. Compounding these challenges is a more foundational issue: the BT community lacks universally accepted definitions and rigorous evaluation methods for key properties such as interpretability and modularity. This absence leads to inconsistent claims and makes meaningful comparisons across different studies a significant challenge.

To address these gaps, this dissertation proposes a path toward more intuitive and effective robot learning, articulated through three key contributions:

First, this thesis formalizes core BT properties for robotics and introduces metrics for the systematic evaluation and comparison of learned policies. Building on this foundation, it investigates how different BT structures impact the interpretability of the control policy, identifying design patterns that best align with human intuition and understanding.

Second, this thesis presents a unified control framework that integrates BTs with a high-frequency Stack-of-Tasks (SoT) control strategy, enhancing transparency of BT policies by explicitly revealing the underlying subgoals. This approach allows BT nodes to function as hierarchical, high-frequency control objectives. Moreover, the resulting system achieves rapid reactivity in dynamic environments while supporting the coexistence of heterogeneous controllers and preserving a clean, modular decomposition of complex tasks.

Finally, this thesis introduces an end-to-end, label-free LfD pipeline that simultaneously learns the global BT structure and the underlying actions —modeled as Dynamic Movement Primitives—directly from raw demonstration data. By leveraging vision-language models to automatically extract and annotate state representations, this method eliminates the need for handcrafted action sets, predefined state spaces, and time-consuming manual labeling.

In summary, this thesis provides a comprehensive framework for learning interpretable, modular, and adaptable robot control policies from demonstration, bridging the gap between transparent policy representation and practical, high-frequency robot control, which marks a significant step toward making robot programming more accessible, robust, and understandable for both experts and non-experts.

Place, publisher, year, edition, pages
Örebro: Örebro University , 2025. , p. 121
Series
Örebro Studies in Technology, ISSN 1650-8580 ; 110
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:oru:diva-124258ISBN: 9789175297187 (print)OAI: oai:DiVA.org:oru-124258DiVA, id: diva2:2004429
Public defence
2025-12-05, Örebro universitet, Teknikhuset, Hörsal T, Fakultetsgatan 1, Örebro, 09:00 (English)
Opponent
Supervisors
Available from: 2025-10-07 Created: 2025-10-07 Last updated: 2026-01-20Bibliographically approved

Open Access in DiVA

Cover(293 kB)33 downloads
File information
File name COVER01.pdfFile size 293 kBChecksum SHA-512
1bad6c8965900a5593274e511502d7ffe8d38c1708788da57391de9b5d5b6bc281f948de35006be2fa48f47718ca72845c5c9d14d565b998776ae726ac8e6acd
Type coverMimetype application/pdf
Learning Interpretable Robot Policies from Demonstration(5125 kB)86 downloads
File information
File name FULLTEXT01.pdfFile size 5125 kBChecksum SHA-512
b748e521fdd89c0f2d0ecf0e4eddde03d1ed11724fd1312b4d67300bcbece755ee27bfb10faba545afe9f134599e1678b7aaabc47b42b9d58a920aa58082a83c
Type fulltextMimetype application/pdf
Spikblad(131 kB)35 downloads
File information
File name SPIKBLAD01.pdfFile size 131 kBChecksum SHA-512
3e2abed43789e7c30a57ac048a7c06452d5037a0ef10d08789d93062833b60edfb8ba3906932ce6165475e830b6d2bc011da28f1f3da9c3cf1fa10d35ed65be0
Type spikbladMimetype application/pdf

Authority records

Caceres Dominguez, David

Search in DiVA

By author/editor
Caceres Dominguez, David
By organisation
School of Science and Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 8706 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf