We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model

Abstract: As large dialogue models become commonplace in practice, the problems surrounding high compute requirements for training, inference and larger memory footprint still persists. In this work, we present AUTODIAL, a multi-task dialogue model that addresses the challenges of deploying dialogue model. AUTODIAL utilizes parallel decoders to perform tasks such as dialogue act prediction, domain prediction, intent prediction, and dialogue state tracking. Using classification decoders over generative decoders allows AUTODIAL to significantly reduce memory footprint and achieve faster inference times compared to existing generative approach namely SimpleTOD. We demonstrate that AUTODIAL provides 3-6x speedups during inference while having 11x fewer parameters on three dialogue tasks compared to SimpleTOD. Our results show that extending current dialogue models to have parallel decoders can be a viable alternative for deploying them in resource-constrained environments.
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2303.06245 [cs.CL]
  (or arXiv:2303.06245v2 [cs.CL] for this version)

Submission history

From: Prajjwal Bhargava [view email]
[v1] Fri, 10 Mar 2023 23:34:14 GMT (6691kb,D)
[v2] Tue, 2 May 2023 00:24:51 GMT (6693kb,D)

Link back to: arXiv, form interface, contact.