We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Exploiting Language Model for Efficient Linguistic Steganalysis

Abstract: Recent advances in linguistic steganalysis have successively applied CNN, RNN, GNN and other efficient deep models for detecting secret information in generative texts. These methods tend to seek stronger feature extractors to achieve higher steganalysis effects. However, we have found through experiments that there actually exists significant difference between automatically generated stego texts and carrier texts in terms of the conditional probability distribution of individual words. Such kind of difference can be naturally captured by the language model used for generating stego texts. Through further experiments, we conclude that this ability can be transplanted to a text classifier by pre-training and fine-tuning to improve the detection performance. Motivated by this insight, we propose two methods for efficient linguistic steganalysis. One is to pre-train a language model based on RNN, and the other is to pre-train a sequence autoencoder. The results indicate that the two methods have different degrees of performance gain compared to the randomly initialized RNN, and the convergence speed is significantly accelerated. Moreover, our methods have achieved the state-of-the-art detection results.
Comments: this https URL&hl=en
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
Cite as: arXiv:2107.12168 [cs.CL]
  (or arXiv:2107.12168v2 [cs.CL] for this version)

Submission history

From: Hanzhou Wu [view email]
[v1] Mon, 26 Jul 2021 12:37:18 GMT (590kb,D)
[v2] Thu, 7 Oct 2021 13:27:01 GMT (2094kb,D)

Link back to: arXiv, form interface, contact.