To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Planning in hybrid relational MDPs
Department of Computer Science, KU Leuven, Leuven, Belgium.
School of Informatics, University of Edinburgh, Edinburgh, England.
Faculty of Engineering Science, KU Leuven, Leuven, Belgium.
Department of Computer Science, KU Leuven, Leuven, Belgium.ORCID iD: 0000-0002-6860-6303
2017 (English)In: Machine Learning, ISSN 0885-6125, E-ISSN 1573-0565, Vol. 106, no 12, p. 1905-1932Article in journal (Refereed) Published
Abstract [en]

We study planning in relational Markov decision processes involving discrete and continuous states and actions, and an unknown number of objects. This combination of hybrid relational domains has so far not received a lot of attention. While both relational and hybrid approaches have been studied separately, planning in such domains is still challenging and often requires restrictive assumptions and approximations. We propose HYPE: a sample-based planner for hybrid relational domains that combines model-based approaches with state abstraction. HYPE samples episodes and uses the previous episodes as well as the model to approximate the Q-function. In addition, abstraction is performed for each sampled episode, this removes the complexity of symbolic approaches for hybrid relational domains. In our empirical evaluations, we show that HYPE is a general and widely applicable planner in domains ranging from strictly discrete to strictly continuous to hybrid ones, handles intricacies such as unknown objects and relational models. Moreover, empirical results showed that abstraction provides significant improvements.

Place, publisher, year, edition, pages
Springer, 2017. Vol. 106, no 12, p. 1905-1932
National Category
Mechanical Engineering
Identifiers
URN: urn:nbn:se:oru:diva-84420DOI: 10.1007/s10994-017-5669-xISI: 000415881500003Scopus ID: 2-s2.0-85029575014OAI: oai:DiVA.org:oru-84420DiVA, id: diva2:1452290
Conference
25th International Conference on Inductive Logic Programming (ILP), Kyoto Univ, Kyoto, JAPAN, August, 20-22, 2015.
Available from: 2020-07-06 Created: 2020-07-06 Last updated: 2020-08-21Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

De Raedt, Luc

Search in DiVA

By author/editor
De Raedt, Luc
In the same journal
Machine Learning
Mechanical Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 75 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf