We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

Abstract: Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus.
To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recently released pretrained Transformer models for Hebrew. Additionally, we demonstrate how our method can be applied to uncover lost material from Midrash Tanhuma.
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2211.09710 [cs.CL]
  (or arXiv:2211.09710v1 [cs.CL] for this version)

Submission history

From: Shlomo Tannor [view email]
[v1] Thu, 17 Nov 2022 17:45:59 GMT (7341kb,D)
[v2] Wed, 24 May 2023 04:58:05 GMT (976kb,D)
[v3] Mon, 24 Jul 2023 05:39:27 GMT (1927kb,D)

Link back to: arXiv, form interface, contact.