We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Memory-assisted prompt editing to improve GPT-3 after deployment

Abstract: Large LMs such as GPT-3, while powerful, are not immune to mistakes, but are prohibitively costly to retrain. One failure mode is misinterpreting a user's instruction (e.g., GPT-3 interpreting "What word is similar to good?" to mean a homonym, while the user intended a synonym). Our goal is to allow users to correct such errors directly through interaction -- without retraining. Our approach pairs GPT-3 with a growing memory of cases where the model misunderstood the user's intent and was provided with feedback, clarifying the instruction. Given a new query, our memory-enhanced GPT-3 uses feedback from similar, prior queries to enrich the prompt. Through simple proof-of-concept experiments, we show how a (simulated) user can interactively teach a deployed GPT-3, doubling its accuracy on basic lexical tasks (e.g., generate a synonym) where users query in different, novel (often misunderstood) ways. In such scenarios, memory helps avoid repeating similar past mistakes. Our simple idea is a first step towards strengthening deployed models, potentially broadening their utility. All the code and data is available at this https URL
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2201.06009 [cs.CL]
  (or arXiv:2201.06009v2 [cs.CL] for this version)

Submission history

From: Aman Madaan [view email]
[v1] Sun, 16 Jan 2022 10:11:37 GMT (3993kb,D)
[v2] Fri, 25 Feb 2022 14:05:09 GMT (4068kb,D)
[v3] Fri, 4 Mar 2022 16:47:00 GMT (4073kb,D)
[v4] Wed, 16 Mar 2022 05:49:45 GMT (5239kb,D)
[v5] Mon, 24 Oct 2022 04:52:42 GMT (7038kb,D)
[v6] Sun, 30 Oct 2022 21:02:16 GMT (7038kb,D)
[v7] Sat, 18 Feb 2023 05:49:01 GMT (7038kb,D)

Link back to: arXiv, form interface, contact.