Chinese grammatical error correction based on knowledge distillation

Xia, Peng; Zhou, Yuechi; Zhang, Ziyan; Tang, Zecheng; Li, Juntao

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2208

Computer Science > Computation and Language

Title: Chinese grammatical error correction based on knowledge distillation

Authors: Peng Xia, Yuechi Zhou, Ziyan Zhang, Zecheng Tang, Juntao Li

(Submitted on 31 Jul 2022 (v1), last revised 31 Aug 2022 (this version, v4))

Abstract: In view of the poor robustness of existing Chinese grammatical error correction models on attack test sets and large model parameters, this paper uses the method of knowledge distillation to compress model parameters and improve the anti-attack ability of the model. In terms of data, the attack test set is constructed by integrating the disturbance into the standard evaluation data set, and the model robustness is evaluated by the attack test set. The experimental results show that the distilled small model can ensure the performance and improve the training speed under the condition of reducing the number of model parameters, and achieve the optimal effect on the attack test set, and the robustness is significantly improved. Code is available at this https URL

Comments:	9 pages, 4 figures, 5 tables
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.00351 [cs.CL]
	(or arXiv:2208.00351v4 [cs.CL] for this version)

Submission history

From: Peng Xia [view email]
[v1] Sun, 31 Jul 2022 03:16:29 GMT (501kb)
[v2] Fri, 5 Aug 2022 02:00:28 GMT (0kb,I)
[v3] Sun, 28 Aug 2022 17:26:02 GMT (1076kb,D)
[v4] Wed, 31 Aug 2022 07:48:31 GMT (1076kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.00351

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Chinese grammatical error correction based on knowledge distillation

Submission history