We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Computation

Title: Estimation and prediction in sparse and unbalanced tables

Abstract: We consider the problem where we have a multi-way table of means, indexed by several factors, where each factor can have a large number of levels. The entry in each cell is the mean of some response, averaged over the observations falling into that cell. Some cells may be very sparsely populated, and in extreme cases, not populated at all. We might still like to estimate an expected response in such cells. We propose here a novel hierarchical ANOVA (HANOVA) representation for such data. Sparse cells will lean more on the lower-order interaction model for the data. These in turn could have components that are poorly represented in the data, in which case they rely on yet lower-order models. Our approach leads to a simple hierarchical algorithm, requiring repeated calculations of sub-table means of modified counts. The algorithm has shown superiority over the unshrinked methods in both simulations and real data sets.
Comments: 14 pages, 1 figure
Subjects: Computation (stat.CO)
Cite as: arXiv:1703.02081 [stat.CO]
  (or arXiv:1703.02081v1 [stat.CO] for this version)

Submission history

From: Qingyuan Zhao [view email]
[v1] Mon, 6 Mar 2017 19:44:45 GMT (256kb,D)

Link back to: arXiv, form interface, contact.