References & Citations
Computer Science > Computation and Language
Title: Memory-assisted prompt editing to improve GPT-3 after deployment
(Submitted on 16 Jan 2022 (v1), revised 25 Feb 2022 (this version, v2), latest version 18 Feb 2023 (v7))
Abstract: Large LMs such as GPT-3, while powerful, are not immune to mistakes, but are prohibitively costly to retrain. One failure mode is misinterpreting a user's instruction (e.g., GPT-3 interpreting "What word is similar to good?" to mean a homonym, while the user intended a synonym). Our goal is to allow users to correct such errors directly through interaction -- without retraining. Our approach pairs GPT-3 with a growing memory of cases where the model misunderstood the user's intent and was provided with feedback, clarifying the instruction. Given a new query, our memory-enhanced GPT-3 uses feedback from similar, prior queries to enrich the prompt. Through simple proof-of-concept experiments, we show how a (simulated) user can interactively teach a deployed GPT-3, doubling its accuracy on basic lexical tasks (e.g., generate a synonym) where users query in different, novel (often misunderstood) ways. In such scenarios, memory helps avoid repeating similar past mistakes. Our simple idea is a first step towards strengthening deployed models, potentially broadening their utility. All the code and data is available at this https URL
Submission history
From: Aman Madaan [view email][v1] Sun, 16 Jan 2022 10:11:37 GMT (3993kb,D)
[v2] Fri, 25 Feb 2022 14:05:09 GMT (4068kb,D)
[v3] Fri, 4 Mar 2022 16:47:00 GMT (4073kb,D)
[v4] Wed, 16 Mar 2022 05:49:45 GMT (5239kb,D)
[v5] Mon, 24 Oct 2022 04:52:42 GMT (7038kb,D)
[v6] Sun, 30 Oct 2022 21:02:16 GMT (7038kb,D)
[v7] Sat, 18 Feb 2023 05:49:01 GMT (7038kb,D)
Link back to: arXiv, form interface, contact.