We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: n-stage Latent Dirichlet Allocation: A Novel Approach for LDA

Abstract: Nowadays, data analysis has become a problem as the amount of data is constantly increasing. In order to overcome this problem in textual data, many models and methods are used in natural language processing. The topic modeling field is one of these methods. Topic modeling allows determining the semantic structure of a text document. Latent Dirichlet Allocation (LDA) is the most common method among topic modeling methods. In this article, the proposed n-stage LDA method, which can enable the LDA method to be used more effectively, is explained in detail. The positive effect of the method has been demonstrated by the applied English and Turkish studies. Since the method focuses on reducing the word count in the dictionary, it can be used language-independently. You can access the open-source code of the method and the example: this https URL
Comments: Published in: 2019 4th International Conference on Computer Science and Engineering (UBMK). This study is extension version of "Comparison of Topic Modeling Methods for Type Detection of Turkish News" this http URL . Please citation this IEEE paper
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
ACM classes: H.3.3; I.2.7; I.7.0
DOI: 10.1109/UBMK.2019.8907050
Cite as: arXiv:2110.08591 [cs.CL]
  (or arXiv:2110.08591v2 [cs.CL] for this version)

Submission history

From: Zekeriya Anil Guven [view email]
[v1] Sat, 16 Oct 2021 15:26:53 GMT (465kb,D)
[v2] Wed, 20 Oct 2021 08:39:24 GMT (465kb,D)

Link back to: arXiv, form interface, contact.