We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Digital Libraries

Title: The State of NLP Literature: A Diachronic Analysis of the ACL Anthology

Abstract: The ACL Anthology (AA) is a digital repository of tens of thousands of articles on Natural Language Processing (NLP). This paper examines the literature as a whole to identify broad trends in productivity, focus, and impact. It presents the analyses in a sequence of questions and answers. The goal is to record the state of the AA literature: who and how many of us are publishing? what are we publishing on? where and in what form are we publishing? and what is the impact of our publications? The answers are usually in the form of numbers, graphs, and inter-connected visualizations. Special emphasis is laid on the demographics and inclusiveness of NLP publishing. Notably, we find that only about 30% of first authors are female, and that this percentage has not improved since the year 2000. We also show that, on average, female first authors are cited less than male first authors, even when controlling for experience. We hope that recording citation and participation gaps across demographic groups will encourage more inclusiveness and fairness in research.
Subjects: Digital Libraries (cs.DL); Computation and Language (cs.CL)
Cite as: arXiv:1911.03562 [cs.DL]
  (or arXiv:1911.03562v1 [cs.DL] for this version)

Submission history

From: Saif Mohammad Dr. [view email]
[v1] Fri, 8 Nov 2019 22:15:32 GMT (8403kb,D)

Link back to: arXiv, form interface, contact.