We gratefully acknowledge support from
the Simons Foundation and member institutions.


New submissions

[ total of 11 entries: 1-11 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Thu, 13 Aug 20

[1]  arXiv:2008.05109 [pdf, other]
Title: A Bayesian Approach to Spherical Factor Analysis for Binary Data
Subjects: Methodology (stat.ME); Applications (stat.AP); Other Statistics (stat.OT)

Factor models are widely used across diverse areas of application for purposes that include dimensionality reduction, covariance estimation, and feature engineering. Traditional factor models can be seen as an instance of linear embedding methods that project multivariate observations onto a lower dimensional Euclidean latent space. This paper discusses a new class of geometric embedding models for multivariate binary data in which the embedding space correspond to a spherical manifold, with potentially unknown dimension. The resulting models include traditional factor models as a special case, but provide additional flexibility. Furthermore, unlike other techniques for geometric embedding, the models are easy to interpret, and the uncertainty associated with the latent features can be properly quantified. These advantages are illustrated using both simulation studies and real data on voting records from the U.S. Senate.

[2]  arXiv:2008.05191 [pdf, other]
Title: Log-concave Ridge Estimation
Authors: Christof Strähl
Comments: 29 pages, It is part of the author's PhD dissertation
Subjects: Methodology (stat.ME)

We develop a density ridge search algorithm based on a novel density ridge definition. This definition is based on a conditional variance matrix and the mode in the lower dimensional subspace. It is compared to the subspace constraint mean shift algorithm, based on the gradient and Hessian of the underlying probability density function. We show the advantages of the new algorithm in a simulation study and estimate galaxy filaments from a data set of the Baryon Oscillation Spectroscopic Survey.

[3]  arXiv:2008.05338 [pdf, other]
Title: A presmoothing approach for estimation in mixture cure models
Subjects: Methodology (stat.ME)

A challenge when dealing with survival analysis data is accounting for a cure fraction, meaning that some subjects will never experience the event of interest. Mixture cure models have been frequently used to estimate both the probability of being cured and the time to event for the susceptible subjects, by usually assuming a parametric (logistic) form of the incidence. We propose a new estimation procedure for a parametric cure rate that relies on a preliminary smooth estimator and is independent of the model assumed for the latency. We investigate the theoretical properties of the estimators and show through simulations that, in the logistic/Cox model, presmoothing leads to more accurate results compared to the maximum likelihood estimator. To illustrate the practical use, we apply the new estimation procedure to two studies of melanoma survival data.

Cross-lists for Thu, 13 Aug 20

[4]  arXiv:2008.05021 (cross-list from stat.CO) [pdf, other]
Title: A Fast, Scalable, and Calibrated Computer Model Emulator: An Empirical Bayes Approach
Subjects: Computation (stat.CO); Methodology (stat.ME)

Mathematical models implemented on a computer have become the driving force behind the acceleration of the cycle of scientific processes. This is because computer models are typically much faster and economical to run than physical experiments. In this work, we develop an empirical Bayes approach to predictions of physical quantities using a computer model, where we assume that the computer model under consideration needs to be calibrated and is computationally expensive. We propose a Gaussian process emulator and a Gaussian process model for the systematic discrepancy between the computer model and the underlying physical process. This allows for closed-form and easy-to-compute predictions given by a conditional distribution induced by the Gaussian processes. We provide a rigorous theoretical justification of the proposed approach by establishing posterior consistency of the estimated physical process. The computational efficiency of the methods is demonstrated in an extensive simulation study and a real data example. The newly established approach makes enhanced use of computer models both from practical and theoretical standpoints.

[5]  arXiv:2008.05337 (cross-list from cs.SI) [pdf, other]
Title: Inference of a universal social scale and segregation measures using social connectivity kernels
Comments: Article: 23 pages, 3 figures. Supplementary material: 8 pages, 1 figure
Subjects: Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph); Methodology (stat.ME)

How people connect with one another is a fundamental question in the social sciences, and the resulting social networks can have a profound impact on our daily lives. Blau offered a powerful explanation: people connect with one another based on their positions in a social space. Yet a principled measure of social distance, allowing comparison within and between societies, remains elusive.
We use the connectivity kernel of conditionally-independent edge models to develop a family of segregation statistics with desirable properties: they offer an intuitive and universal characteristic scale on social space (facilitating comparison across datasets and societies), are applicable to multivariate and mixed node attributes, and capture segregation at the level of individuals, pairs of individuals, and society as a whole. We show that the segregation statistics can induce a metric on Blau space (a space spanned by the attributes of the members of society) and provide maps of two societies.
Under a Bayesian paradigm, we infer the parameters of the connectivity kernel from eleven ego-network datasets collected in four surveys in the United Kingdom and United States. The importance of different dimensions of Blau space is similar across time and location, suggesting a macroscopically stable social fabric. Physical separation and age differences have the most significant impact on segregation within friendship networks with implications for intergenerational mixing and isolation in later stages of life.

Replacements for Thu, 13 Aug 20

[6]  arXiv:1911.11709 (replaced) [pdf, other]
Title: Maximum likelihood estimation of regularisation parameters in high-dimensional inverse problems: an empirical Bayesian approach. Part I: Methodology and Experiments
Comments: 37 pages - SIIMS 2020
Subjects: Methodology (stat.ME); Computation (stat.CO)
[7]  arXiv:2002.01040 (replaced) [pdf, other]
Title: Scale mixture of skew-normal linear mixed models with within-subject serial dependence
Subjects: Methodology (stat.ME); Statistics Theory (math.ST)
[8]  arXiv:2005.03496 (replaced) [pdf, other]
Title: Modeling High-Dimensional Unit-Root Time Series
Comments: 45 pages, 11 figures. arXiv admin note: text overlap with arXiv:1808.07932
Subjects: Methodology (stat.ME); Econometrics (econ.EM)
[9]  arXiv:2008.04692 (replaced) [pdf, other]
Title: Test for mean matrix in GMANOVA model under heteroscedasticity and non-normality for high-dimensional data
Comments: Supplementary is available as ancillary files
Subjects: Methodology (stat.ME)
[10]  arXiv:1710.00915 (replaced) [pdf, other]
Title: Change Acceleration and Detection
Subjects: Statistics Theory (math.ST); Methodology (stat.ME)
[11]  arXiv:2007.02189 (replaced) [pdf, other]
Title: The joint survival signature of coherent systems with shared components
Comments: 14 pages, 4 figures
Subjects: Statistics Theory (math.ST); Methodology (stat.ME)
[ total of 11 entries: 1-11 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, stat, recent, 2008, contact, help  (Access key information)