P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts

Newman, Benjamin; Choubey, Prafulla Kumar; Rajani, Nazneen

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computation and Language

Title: P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts

Authors: Benjamin Newman, Prafulla Kumar Choubey, Nazneen Rajani

(Submitted on 14 Oct 2021 (this version), latest version 19 Apr 2022 (v2))

Abstract: Recent work (e.g. LAMA (Petroni et al., 2019)) has found that the quality of the factual information extracted from Large Language Models (LLMs) depends on the prompts used to query them. This inconsistency is problematic because different users will query LLMs for the same information using different wording, but should receive the same, accurate responses regardless. In this work we aim to address this shortcoming by introducing P-Adapters: lightweight models that sit between the embedding layer and first attention layer of LLMs. They take LLM embeddings as input and output continuous prompts that are used to query the LLM. Additionally, we investigate Mixture of Experts (MoE) models that learn a set of continuous prompts ("experts") and select one to query the LLM. They require a separate classifier trained on human-annotated data to map natural language prompts to the continuous ones. P-Adapters perform comparably to the more complex MoE models in extracting factual information from BERT and RoBERTa while eliminating the need for additional annotations. P-Adapters show between 12-26% absolute improvement in precision and 36-50% absolute improvement in consistency over a baseline of only using natural language queries. Finally, we investigate what makes a P-adapter successful and conclude that access to the LLM's embeddings of the original natural language prompt, particularly the subject of the entity pair being asked about, is a significant factor.

Comments:	15 pages, 6 figures, 4 tables
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2110.07280 [cs.CL]
	(or arXiv:2110.07280v1 [cs.CL] for this version)

Submission history

From: Benjamin Newman [view email]
[v1] Thu, 14 Oct 2021 11:32:22 GMT (213kb,D)
[v2] Tue, 19 Apr 2022 07:12:44 GMT (174kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.07280v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts

Submission history