To Örebro University

oru.seÖrebro University Publications
System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Diversity for Contingency: Learning Diverse Behaviors for Efficient Adaptation and Transfer
Örebro University, School of Science and Technology. (Adaptive and Interpretable Learning Systems Lab)ORCID iD: 0000-0001-8151-4692
Örebro University, School of Science and Technology. (Adaptive and Interpretable Learning Systems Lab)ORCID iD: 0000-0003-3958-6179
2023 (English)Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

Discovering all useful solutions for a given task is crucial for transferable RL agents, to account for changes in the task or transition dynamics. This is not considered by classical RL algorithms that are only concerned with finding the optimal policy, given the current task and dynamics. We propose a simple method for discovering all possible solutions of a given task, to obtain an agent that performs well in the transfer setting and adapts quickly to changes in the task or transition dynamics. Our method iteratively learns a set of policies, while each subsequent policy is constrained to yield a solution that is unlikely under all previous policies. Unlike prior methods, our approach does not require learning additional models for novelty detection and avoids balancing task and novelty reward signals, by directly incorporating the constraint into the action selection and optimization steps. 

Place, publisher, year, edition, pages
2023.
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:oru:diva-112199DOI: 10.48550/arXiv.2310.07493OAI: oai:DiVA.org:oru-112199DiVA, id: diva2:1842996
Conference
2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023), Detroit, MI, USA, October 1-5, 2023
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

Presented at the third RL-Conform workshop at IROS 2023.

Available from: 2024-03-07 Created: 2024-03-07 Last updated: 2024-03-11Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Rietz, FinnStork, Johannes Andreas

Search in DiVA

By author/editor
Rietz, FinnStork, Johannes Andreas
By organisation
School of Science and Technology
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 49 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf