We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Zero-Shot Text Matching for Automated Auditing using Sentence Transformers

Abstract: Natural language processing methods have several applications in automated auditing, including document or passage classification, information retrieval, and question answering. However, training such models requires a large amount of annotated data which is scarce in industrial settings. At the same time, techniques like zero-shot and unsupervised learning allow for application of models pre-trained using general domain data to unseen domains.
In this work, we study the efficiency of unsupervised text matching using Sentence-Bert, a transformer-based model, by applying it to the semantic similarity of financial passages. Experimental results show that this model is robust to documents from in- and out-of-domain data.
Comments: To be published in proceedings of IEEE International Conference on Machine Learning Applications IEEE ICMLA 2022
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2211.07716 [cs.CL]
  (or arXiv:2211.07716v1 [cs.CL] for this version)

Submission history

From: David Biesner [view email]
[v1] Fri, 28 Oct 2022 11:52:16 GMT (184kb,D)

Link back to: arXiv, form interface, contact.