We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: The semi-hierarchical Dirichlet Process and its application to clustering homogeneous distributions

Abstract: Assessing homogeneity of distributions is an old problem that has received considerable attention, especially in the nonparametric Bayesian literature. To this effect, we propose the semi-hierarchical Dirichlet process, a novel hierarchical prior that extends the hierarchical Dirichlet process of Teh et al. (2006) and that avoids the degeneracy issues of nested processes recently described by Camerlenghi et al. (2019a). We go beyond the simple yes/no answer to the homogeneity question and embed the proposed prior in a random partition model; this procedure allows us to give a more comprehensive response to the above question and in fact find groups of populations that are internally homogeneous when I greater or equal than 2 such populations are considered. We study theoretical properties of the semi-hierarchical Dirichlet process and of the Bayes factor for the homogeneity test when I = 2. Extensive simulation studies and applications to educational data are also discussed.
Subjects: Methodology (stat.ME)
Cite as: arXiv:2005.10287 [stat.ME]
  (or arXiv:2005.10287v4 [stat.ME] for this version)

Submission history

From: Mario Beraha [view email]
[v1] Wed, 20 May 2020 18:10:13 GMT (2782kb,D)
[v2] Thu, 17 Dec 2020 17:20:07 GMT (3005kb,D)
[v3] Wed, 31 Mar 2021 08:15:54 GMT (2080kb,D)
[v4] Wed, 16 Jun 2021 15:42:52 GMT (2081kb,D)

Link back to: arXiv, form interface, contact.