We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context

Abstract: Making sense of words often requires to simultaneously examine the surrounding context of a term as well as the global themes characterizing the overall corpus. Several topic models have already exploited word embeddings to recognize local context, however, it has been weakly combined with the global context during the topic inference. This paper proposes to extract topical phrases corroborating the word embedding information with the global context detected by Latent Semantic Analysis, and then combine them by means of the P\'{o}lya urn model. To highlight the effectiveness of this combined approach the model was assessed analyzing clinical reports, a challenging scenario characterized by technical jargon and a limited word statistics available. Results show it outperforms the state-of-the-art approaches in terms of both topic coherence and computational cost.
Comments: The 2nd AAAI Workshop on Health Intelligence, AAAI18
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1911.10180 [cs.CL]
  (or arXiv:1911.10180v1 [cs.CL] for this version)

Submission history

From: Gabriele Pergola [view email]
[v1] Fri, 22 Nov 2019 18:29:19 GMT (542kb,D)

Link back to: arXiv, form interface, contact.