Open this publication in new window or tab >>2024 (English)In: Proceedings of the 38th AAAI Conference on Artificial Intelligence / [ed] Michael Wooldridge; Jennifer Dy; Sriraam Natarajan, AAAI Press, 2024, Vol. 38, p. 20123-20133Conference paper, Published paper (Refereed)
Abstract [en]
Large Language Models (LLMs) have demonstrated impressive planning abilities due to their vast "world knowledge". Yet, obtaining plans that are both feasible (grounded in affordances) and cost-effective (in plan length), remains a challenge, despite recent progress. This contrasts with heuristic planning methods that employ domain knowledge (formalized in action models such as PDDL) and heuristic search to generate feasible, optimal plans. Inspired by this, we propose to combine the power of LLMs and heuristic planning by leveraging the world knowledge of LLMs and the principles of heuristic search. Our approach, SayCanPay, employs LLMs to generate actions (Say) guided by learnable domain knowledge, that evaluates actions' feasibility (Can) and long-term reward/payoff (Pay), and heuristic search to select the best sequence of actions. Our contributions are (1) a novel framing of the LLM planning problem in the context of heuristic planning, (2) integrating grounding and cost-effective elements into the generated plans, and (3) using heuristic search over actions. Our extensive evaluations show that our model surpasses other LLM planning approaches.
Place, publisher, year, edition, pages
AAAI Press, 2024
Series
Proceedings of the AAAI Conference on Artificial Intelligence, ISSN 2159-5399, E-ISSN 2374-3468 ; 38:18
National Category
Computer Sciences
Identifiers
urn:nbn:se:oru:diva-115501 (URN)10.1609/aaai.v38i18.29991 (DOI)001241509500037 ()2-s2.0-85189544071 (Scopus ID)9781577358879 (ISBN)
Conference
38th AAAI Conference on Artificial Intelligence (AAAI) / 36th Conference on Innovative Applications of Artificial Intelligence / 14th Symposium on Educational Advances in Artificial Intelligence, Vancouver, Canada, February 20-27, 2024
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationEU, Horizon 2020, 952215
Note
This work was supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation, and is also part of the EU H2020 ICT48 project “TAILOR” under contract 952215, and the KU Leuven Research Fund (C14/18/062).
2024-08-212024-08-212025-09-01Bibliographically approved