To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Task-Prioritized Policy Composition
Örebro University, School of Science and Technology.ORCID iD: 0000-0001-8151-4692
Örebro University, School of Science and Technology.ORCID iD: 0000-0002-0804-8637
Örebro University, School of Science and Technology.ORCID iD: 0000-0002-6013-4874
Örebro University, School of Science and Technology.ORCID iD: 0000-0003-3958-6179
2022 (English)Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

Combining learned policies in a prioritized, ordered manner is desirable because it allows for modular design and facilitates data reuse through knowledge transfer. In control theory, prioritized composition is realized by null-space control, where low-priority control actions are projected into the null-space of high-priority control actions. Such a method is currently unavailable for Reinforcement Learning. We propose a novel, task-prioritized composition framework for Reinforcement Learning, which involves a novel concept: The indifferent-space of Reinforcement Learning policies. Our framework has the potential to facilitate knowledge transfer and modular design while greatly increasing data efficiency and data reuse for Reinforcement Learning agents. Further, our approach can ensure high-priority constraint satisfaction, which makes it promising for learning in safety-critical domains like robotics. Unlike null-space control, our approach allows learning globally optimal policies for the compound task by online learning in the indifference-space of higher-level policies after initial compound policy construction. 

Place, publisher, year, edition, pages
2022.
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:oru:diva-102120OAI: oai:DiVA.org:oru-102120DiVA, id: diva2:1709450
Conference
35th IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), Kyoto, Japan, October 24-26, 2022
Available from: 2022-11-08 Created: 2022-11-08 Last updated: 2024-01-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Rietz, FinnSchaffernicht, ErikStoyanov, TodorStork, Johannes Andreas

Search in DiVA

By author/editor
Rietz, FinnSchaffernicht, ErikStoyanov, TodorStork, Johannes Andreas
By organisation
School of Science and Technology
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 178 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf