Understanding BLOOM: An empirical study on diverse NLP tasks

Dakle, Parag Pravin; Rallabandi, SaiKrishna; Raghavan, Preethi

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2211

Change to browse by:

Computer Science > Computation and Language

Title: Understanding BLOOM: An empirical study on diverse NLP tasks

Authors: Parag Pravin Dakle, SaiKrishna Rallabandi, Preethi Raghavan

(Submitted on 27 Nov 2022 (this version), latest version 15 Mar 2023 (v2))

Abstract: In this work, we present an evaluation of smaller BLOOM model variants (350m/560m and 1b3/1b7) on various natural language processing tasks. This includes GLUE - language understanding, prompt-based zero-shot and few-shot text classification and extraction, question answering, prompt-based text generation, and multi-lingual text classification to understand model strengths/weaknesses and behavior. Empirical results show that BLOOM variants under-perform on all GLUE tasks (except WNLI), question-answering, and text generation. The variants bloom for WNLI, with an accuracy of 56.3%, and for prompt-based few-shot text extraction on MIT Movies and ATIS datasets. The BLOOM variants on average have 7% greater accuracy over GPT-2 and GPT-Neo models on Director and Airline Name extraction from MIT Movies and ATIS datasets, respectively.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2211.14865 [cs.CL]
	(or arXiv:2211.14865v1 [cs.CL] for this version)

Submission history

From: Sai Krishna Rallabandi [view email]
[v1] Sun, 27 Nov 2022 15:48:14 GMT (7230kb,D)
[v2] Wed, 15 Mar 2023 03:54:14 GMT (10186kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.14865v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Understanding BLOOM: An empirical study on diverse NLP tasks

Submission history