We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Hierarchical Dirichlet Scaling Process

Abstract: We present the \textit{hierarchical Dirichlet scaling process} (HDSP), a Bayesian nonparametric mixed membership model. The HDSP generalizes the hierarchical Dirichlet process (HDP) to model the correlation structure between metadata in the corpus and mixture components. We construct the HDSP based on the normalized gamma representation of the Dirichlet process, and this construction allows incorporating a scaling function that controls the membership probabilities of the mixture components. We develop two scaling methods to demonstrate that different modeling assumptions can be expressed in the HDSP. We also derive the corresponding approximate posterior inference algorithms using variational Bayes. Through experiments on datasets of newswire, medical journal articles, conference proceedings, and product reviews, we show that the HDSP results in a better predictive performance than labeled LDA, partially labeled LDA, and author topic model and a better negative review classification performance than the supervised topic model and SVM.
Subjects: Machine Learning (cs.LG)
DOI: 10.1007/s10994-016-5621-5
Cite as: arXiv:1404.1282 [cs.LG]
  (or arXiv:1404.1282v3 [cs.LG] for this version)

Submission history

From: Dongwoo Kim [view email]
[v1] Sat, 22 Mar 2014 06:25:51 GMT (650kb,D)
[v2] Mon, 12 May 2014 02:59:57 GMT (4726kb,D)
[v3] Wed, 11 Feb 2015 05:17:27 GMT (5816kb,D)

Link back to: arXiv, form interface, contact.