We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Concept Identification of Directly and Indirectly Related Mentions Referring to Groups of Persons

Abstract: Unsupervised concept identification through clustering, i.e., identification of semantically related words and phrases, is a common approach to identify contextual primitives employed in various use cases, e.g., text dimension reduction, i.e., replace words with the concepts to reduce the vocabulary size, summarization, and named entity resolution. We demonstrate the first results of an unsupervised approach for the identification of groups of persons as actors extracted from a set of related articles. Specifically, the approach clusters mentions of groups of persons that act as non-named entity actors in the texts, e.g., "migrant families" = "asylum-seekers." Compared to our baseline, the approach keeps the mentions of the geopolitical entities separated, e.g., "Iran leaders" != "European leaders," and clusters (in)directly related mentions with diverse wording, e.g., "American officials" = "Trump Administration."
Subjects: Computation and Language (cs.CL)
Journal reference: Diversity, Divergence, Dialogue (2021) 514-526
DOI: 10.1007/978-3-030-71292-1_40
Cite as: arXiv:2107.00955 [cs.CL]
  (or arXiv:2107.00955v1 [cs.CL] for this version)

Submission history

From: Anastasia Zhukova [view email]
[v1] Fri, 2 Jul 2021 10:38:43 GMT (664kb,D)

Link back to: arXiv, form interface, contact.