Model Stability with Continuous Data Updates

Liu, Huiting; S., Avinesh P. V.; Patwardhan, Siddharth; Grasch, Peter; Agarwal, Sachin

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2201

Change to browse by:

Computer Science > Computation and Language

Title: Model Stability with Continuous Data Updates

Authors: Huiting Liu, Avinesh P.V.S., Siddharth Patwardhan, Peter Grasch, Sachin Agarwal

(Submitted on 14 Jan 2022)

Abstract: In this paper, we study the "stability" of machine learning (ML) models within the context of larger, complex NLP systems with continuous training data updates. For this study, we propose a methodology for the assessment of model stability (which we refer to as jitter under various experimental conditions. We find that model design choices, including network architecture and input representation, have a critical impact on stability through experiments on four text classification tasks and two sequence labeling tasks. In classification tasks, non-RNN-based models are observed to be more stable than RNN-based ones, while the encoder-decoder model is less stable in sequence labeling tasks. Moreover, input representations based on pre-trained fastText embeddings contribute to more stability than other choices. We also show that two learning strategies -- ensemble models and incremental training -- have a significant influence on stability. We recommend ML model designers account for trade-offs in accuracy and jitter when making modeling choices.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2201.05692 [cs.CL]
	(or arXiv:2201.05692v1 [cs.CL] for this version)

Submission history

From: Avinesh P.V.S. [view email]
[v1] Fri, 14 Jan 2022 22:11:16 GMT (130kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.05692v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Model Stability with Continuous Data Updates

Submission history