To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Holding Form in a Shifting World: Flexible and Reliable Robot Manipulation through Behavior Trees
Örebro University, School of Science and Technology.ORCID iD: 0000-0002-2142-6516
2025 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The increasing complexity and unpredictability of manipulation tasks in modern industrial and service robotics have highlighted the limitations of pre-programmed robot solutions. Robots operating under changing object positions, variable obstacles, and unforeseen perturbations must adjust their actions online to reliably satisfy the high-performance requirements of real-world deployment scenarios.

A promising direction for enabling such flexible and reliable manipulation lies in the use of Behavior Trees (BTs), a formalism for transparent decision-making that structures robot behavior hierarchically through modular, reusable components. BTs are a well-suited solution because their inherent reactivity allows the system to respond effectively to high-level disturbances, such as perception or grasping failures. At the same time, their modular design facilitates the reuse of sub-behaviors across different scenarios, enabling automation systems to be easily reconfigured to meet varying operational demands. However, existing BT-based approaches fall short in scenarios in which more advanced forms of robustness to local perturbations and task variations are required. This thesis contributes novel solutions to address these limitations and enhance the applicability of BTs as control policies in real-world manipulation settings.

To rigorously assess the solutions proposed in this thesis, we first need to formalize the terminology and evaluation criteria associated with BT-based robot control. We begin by identifying a subset of properties that are most relevant to our scope, such as reactivity, modularity, and robustness, and clarifying their definitions by resolving ambiguities found in prior work. For each of these properties, we examine how they have been evaluated in the literature and propose additional metrics to address identified gaps in existing evaluation practices.

The first technical contribution addresses the reactivity of BT policies and their way of handling simultaneous control objectives. While BTs effectivel ymanage global, high-level disturbances, flexible manipulation also requires rapid response to local, low-level perturbations that do not warrant changes to the high-level plan. Furthermore, when BTs are coupled with convention allow-level controllers for redundant manipulators, they often struggle to satisfy multiple, potentially competing objectives in a coherent and reliable manner. To address these limitations, we integrate BTs with a prioritized control strategy that decomposes each manipulation skill, such as grasping, into multiple control objectives with defined priorities, distributed across the BT nodes and executed concurrently. This integration introduces an additional layer of low-level reactivity, ensures the reliable satisfaction of multiple objectives, and reinforces the modularity of the BT policy by assigning distinct goals to separate leaf nodes.

Although the proposed framework provides robustness to both high- and low-level disturbances during execution, it still relies on manually specified parameters, which often need adjustment to specific task variations, such as minor changes of object positions or obstacle configurations. The second technical contribution is a data-driven approach based on Reinforcement Learning that augments the BT with a context-based adaptation policy. This module observes task-relevant features, referred to as a context, and selects appropriate BT parameters at execution time. The result is a policy that adapts its behavior on the fly to previously unseen variations, without manual intervention.

Despite its benefits, the proposed framework remains limited to adapting only to directly observable task variations and requires training procedures that, when performed on a physical robot, are often unsafe and timeconsuming. The last technical contribution addresses both limitations by introducing a context estimator that infers latent dynamics parameters — such as friction coefficients or object mass — from recent interaction data. Conditioning the context-based adaptation policy on this latent estimate enables the BT-based policy to operate robustly even under partial observability. Moreover, because these latent parameters often underlie the discrepancies between simulation and reality, the very same mechanism also provides a principled way to bridge the sim-to-real gap: policies are trained in simulation with domain randomization, while the estimated context compensates for the mismatched dynamics, improving robustness at deployment.

Place, publisher, year, edition, pages
Örebro: Örebro University , 2025. , p. 143
Series
Örebro Studies in Technology, ISSN 1650-8580 ; 111
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:oru:diva-124067ISBN: 9789175297217 (print)OAI: oai:DiVA.org:oru-124067DiVA, id: diva2:2002317
Public defence
2025-12-09, Örebro universitet, Långhuset, Hörsal L2, Fakultetsgatan 1, Örebro, 09:00 (English)
Opponent
Supervisors
Available from: 2025-09-30 Created: 2025-09-30 Last updated: 2025-11-28Bibliographically approved

Open Access in DiVA

Cover(407 kB)33 downloads
File information
File name COVER01.pdfFile size 407 kBChecksum SHA-512
c8cc54e921c10cbf2db9960004c7b288f7933b21be6dafc1ba8b625b6887494116f73ce8a65be567cf30c130487532e1462a7e7ba4ef18d0992691dc6611cdda
Type coverMimetype application/pdf
Holding Form in a Shifting World: Flexible and Reliable Robot Manipulation through Behavior Trees(3648 kB)263 downloads
File information
File name FULLTEXT01.pdfFile size 3648 kBChecksum SHA-512
091b62fdf75bc7fdaaf31f6fc889c6cc93960ccaf365a11ef044073d2a53a12ca82696bd0923c21f53d1d55087793544e1c726600ed79770d4e9607038749f9e
Type fulltextMimetype application/pdf
Spikblad(125 kB)42 downloads
File information
File name SPIKBLAD01.pdfFile size 125 kBChecksum SHA-512
2f3af236f7db860197d6388af9e4961543b37b35a1439d9eb5b9093f4b5e4b413253440771a6fc728a5c3f4fa111ed516374e70e0868000ad5f62566d9c8fe30
Type spikbladMimetype application/pdf

Authority records

Iannotta, Marco

Search in DiVA

By author/editor
Iannotta, Marco
By organisation
School of Science and Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 10405 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf