New submissions for Tue, 7 Dec 21

[1]  arXiv:2112.02175 [pdf]
Title: The h-index
Comments: In book: Handbook Bibliometrics | Edition: De Gruyter Reference | Chapter: 3.4 | Publisher: De Gruyter Saur
Subjects: Digital Libraries (cs.DL); Social and Information Networks (cs.SI)

The h-index is a mainstream bibliometric indicator, since it is widely used in academia, research management and research policy. While its advantages have been highlighted, such as its simple calculation, it has also received widespread criticism. The criticism is mainly based on the negative effects it may have on scholars, when the index is used to describe the quality of a scholar. The "h" means "highly-cited" and "high achievement", and should not be confused with the last name of its inventor, Hirsch. Put simply, the h-index combines a measure of quantity and impact in a single indicator. Several initiatives try to provide alternatives to the h-index to counter some of its shortcomings.

[2]  arXiv:2112.02183 [pdf]
Title: International Conferences of Bibliometrics
Comments: In book: Handbook Bibliometrics | Edition: De Gruyter Reference | Chapter: 1.6 | Publisher: De Gruyter Saur
Subjects: Digital Libraries (cs.DL); Social and Information Networks (cs.SI)

Conferences are deeply connected to research fields, in this case bibliometrics. As such, they are a venue to present and discuss current and innovative research, and play an important role for the scholarly community. In this article, we provide an overview on the history of conferences in bibliometrics. We conduct an analysis to list the most prominent conferences that were announced in the newsletter by ISSI, the International Society for Scientometrics and Informetrics. Furthermore, we describe how conferences are connected to learned societies and journals. Finally, we provide an outlook on how conferences might change in future.

[3]  arXiv:2112.02471 [pdf]
Title: Grappling with the Scale of Born-Digital Government Publications: Toward Pipelines for Processing and Searching Millions of PDFs
Comments: 22 pages, 4 figures
Subjects: Digital Libraries (cs.DL); Information Retrieval (cs.IR)

Official government publications are key sources for understanding the history of societies. Web publishing has fundamentally changed the scale and processes by which governments produce and disseminate information. Significantly, a range of web archiving programs have captured massive troves of government publications. For example, hundreds of millions of unique U.S. Government documents posted to the web in PDF form have been archived by libraries to date. Yet, these PDFs remain largely unutilized and understudied in part due to the challenges surrounding the development of scalable pipelines for searching and analyzing them. This paper utilizes a Library of Congress dataset of 1,000 government PDFs in order to offer initial approaches for searching and analyzing these PDFs at scale. In addition to demonstrating the utility of PDF metadata, this paper offers computationally-efficient machine learning approaches to search and discovery that utilize the PDFs' textual and visual features as well. We conclude by detailing how these methods can be operationalized at scale in order to support systems for navigating millions of PDFs.

[4]  arXiv:2112.02672 [pdf]
Title: Globalization of Scientific Communication: Evidence from authors in academic journals by country of origin
Authors: Vít Macháček
Subjects: Digital Libraries (cs.DL); General Economics (econ.GN)

This study measures the tendency to publish in international scientific journals. For each of nearly 35 thousands Scopus-indexed journals, we derive seven globalization indicators based on the composition of authors by country of origin and other characteristics. These are subsequently scaled up to the level of 174 countries and 27 disciplines between 2005 and 2017. The results indicate that advanced countries maintain high globalization of scientific communication that is not varying across disciplines. Social sciences and health sciences are less globalized than physical and life sciences. Countries of the former Soviet bloc score far lower on the globalization measures, especially in social sciences or health sciences. Russia remains among the least globalized during the whole period, with no upward trend. Contrary, China has profoundly globalized its science system, gradually moving from the lowest globalization figures to the world average. The paper concludes with reflections on measurement issues and policy implications.

Replacements for Tue, 7 Dec 21

[5]  arXiv:2112.01181 (replaced) [pdf, other]
Title: LDA2Net: Digging under the surface of COVID-19 topics in scientific literature
Subjects: Digital Libraries (cs.DL); Information Retrieval (cs.IR)
[6]  arXiv:2108.06503 (replaced) [pdf]
Title: Packaging research artefacts with RO-Crate
Comments: 44 pages. Accepted for Data Science
Subjects: Digital Libraries (cs.DL)
