We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: A Simple yet Brisk and Efficient Active Learning Platform for Text Classification

Abstract: In this work, we propose the use of a fully managed machine learning service, which utilizes active learning to directly build models from unstructured data. With this tool, business users can quickly and easily build machine learning models and then directly deploy them into a production ready hosted environment without much involvement from data scientists. Our approach leverages state-of-the-art text representation like OpenAI's GPT2 and a fast implementation of the active learning workflow that relies on a simple construction of incremental learning using linear models, thus providing a brisk and efficient labeling experience for the users. Experiments on both publicly available and real-life insurance datasets empirically show why our choices of simple and fast classification algorithms are ideal for the task at hand.
Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Information Retrieval (cs.IR)
Cite as: arXiv:2102.00426 [cs.LG]
  (or arXiv:2102.00426v1 [cs.LG] for this version)

Submission history

From: Teja Kanchinadam [view email]
[v1] Sun, 31 Jan 2021 10:44:04 GMT (4186kb,D)

Link back to: arXiv, form interface, contact.