We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Pattern-mining Driven Study on Differences of Newspapers in Expressing Temporal Information

Abstract: This paper studies the differences between different types of newspapers in expressing temporal information, which is a topic that has not received much attention. Techniques from the fields of temporal processing and pattern mining are employed to investigate this topic. First, a corpus annotated with temporal information is created by the author. Then, sequences of temporal information tags mixed with part-of-speech tags are extracted from the corpus. The TKS algorithm is used to mine skip-gram patterns from the sequences. With these patterns, the signatures of the four newspapers are obtained. In order to make the signatures uniquely characterize the newspapers, we revise the signatures by removing reference patterns. Through examining the number of patterns in the signatures and revised signatures, the proportion of patterns containing temporal information tags and the specific patterns containing temporal information tags, it is found that newspapers differ in ways of expressing temporal information.
Comments: 19 pages
Subjects: Computation and Language (cs.CL)
Journal reference: David C. Wyld et al. (Eds): NLP, JSE, MLTEC, DMS, NeTIOT, ITCS, SIP, CST, ARIA - 2020 pp. 111-129, 2020. CS & IT - CSCP 2020
DOI: 10.5121/csit.2020.101409
Cite as: arXiv:2011.12265 [cs.CL]
  (or arXiv:2011.12265v1 [cs.CL] for this version)

Submission history

From: Yingxue Fu [view email]
[v1] Tue, 24 Nov 2020 18:20:24 GMT (1169kb)

Link back to: arXiv, form interface, contact.