We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: Planning with Large Language Models via Corrective Re-prompting

Abstract: Extracting the common sense knowledge present in Large Language Models (LLMs) offers a path to designing intelligent, embodied agents. Related works have queried LLMs with a wide-range of contextual information, such as goals, sensor observations and scene descriptions, to generate high-level action plans for specific tasks; however these approaches often involve human intervention or additional machinery to enable sensor-motor interactions. In this work, we propose a prompting-based strategy for extracting executable plans from an LLM, which leverages a novel and readily-accessible source of information: precondition errors. Our approach assumes that actions are only afforded execution in certain contexts, i.e., implicit preconditions must be met for an action to execute (e.g., a door must be unlocked to open it), and that the embodied agent has the ability to determine if the action is/is not executable in the current context (e.g., detect if a precondition error is present). When an agent is unable to execute an action, our approach re-prompts the LLM with precondition error information to extract an executable corrective action to achieve the intended goal in the current context. We evaluate our approach in the VirtualHome simulation environment on 88 different tasks and 7 scenes. We evaluate different prompt templates and compare to methods that naively re-sample actions from the LLM. Our approach, using precondition errors, improves executability and semantic correctness of plans, while also reducing the number of re-prompts required when querying actions.
Comments: 21 pages, 7 figures, Accepted to Foundation Models for Decision Making Workshop at Neural Information Processing Systems 2022
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Robotics (cs.RO)
MSC classes: 68T20, 68T50
ACM classes: I.2.7; I.2.8; I.2.2; I.2.4
Cite as: arXiv:2211.09935 [cs.AI]
  (or arXiv:2211.09935v1 [cs.AI] for this version)

Submission history

From: Shreyas Sundara Raman [view email]
[v1] Thu, 17 Nov 2022 23:14:51 GMT (4624kb,D)
[v2] Mon, 23 Oct 2023 03:56:16 GMT (18486kb,D)
[v3] Sat, 9 Mar 2024 13:53:47 GMT (18749kb,D)

Link back to: arXiv, form interface, contact.