We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: ReadOnce Transformers: Reusable Representations of Text for Transformers

Abstract: We present ReadOnce Transformers, an approach to convert a transformer-based model into one that can build an information-capturing, task-independent, and compressed representation of text. The resulting representation is reusable across different examples and tasks, thereby requiring a document shared across many examples or tasks to only be \emph{read once}. This leads to faster training and evaluation of models. Additionally, we extend standard text-to-text transformer models to Representation+Text-to-text models, and evaluate on multiple downstream tasks: multi-hop QA, abstractive QA, and long-document summarization. Our one-time computed representation results in a 2x-5x speedup compared to standard text-to-text models, while the compression also allows existing language models to handle longer documents without the need for designing new pre-trained models.
Comments: Accepted to ACL 2021(Camera Ready)
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as: arXiv:2010.12854 [cs.CL]
  (or arXiv:2010.12854v2 [cs.CL] for this version)

Submission history

From: Tushar Khot [view email]
[v1] Sat, 24 Oct 2020 09:53:16 GMT (359kb,D)
[v2] Tue, 3 Aug 2021 18:33:45 GMT (624kb,D)

Link back to: arXiv, form interface, contact.