Statistics Theory

New submissions

New submissions for Tue, 4 Aug 20

[1]  arXiv:2008.00130 [pdf, ps, other]
Title: A New Class of Multivariate Elliptically Contoured Distributions with Inconsistency Property
Comments: 34 pages
Subjects: Statistics Theory (math.ST); Probability (math.PR)

We introduce a new class of multivariate elliptically symmetric distributions including elliptically symmetric logistic distributions and Kotz type distributions. We investigate the various probabilistic properties including marginal distributions, conditional distributions, linear transformations, characteristic functions and dependence measure in the perspective of the inconsistency property. In addition, we provide a real data example to show that the new distributions have reasonable flexibility.

[2]  arXiv:2008.00242 [pdf, ps, other]
Title: Posterior Impropriety of some Sparse Bayesian Learning Models
Comments: 13 pages
Subjects: Statistics Theory (math.ST)

Sparse Bayesian learning models are typically used for prediction in datasets with significantly greater number of covariates than observations. Among the class of sparse Bayesian learning models, relevance vector machines (RVM) is very popular. Its popularity is demonstrated by a large number of citations of the original RVM paper of Tipping (2001)[JMLR, 1, 211 - 244]. In this article we show that RVM and some other sparse Bayesian learning models with hyperparameter values currently used in the literature are based on improper posteriors. Further, we also provide necessary and sufficient conditions for posterior propriety of RVM.

[3]  arXiv:2008.00683 [pdf, ps, other]
Title: On Bayesian Estimation of Densities and Sampling Distributions: the Posterior Predictive Distribution as the Bayes Estimator
Authors: A.G. Nogales
Subjects: Statistics Theory (math.ST)

Optimality results for two outstanding Bayesian estimation problems are given in this paper: the estimation of the sampling distribution for the squared total variation function and the estimation of the density for the $L^1$-squared loss function. The posterior predictive distribution provides the solution to these problems. Some examples are presented to illustrate it. The Bayesian estimation problem of a distribution function is also addressed.

[4]  arXiv:2008.00847 [pdf, other]
Title: On Dantzig and Lasso estimators of the drift in a high dimensional Ornstein-Uhlenbeck model
Subjects: Statistics Theory (math.ST)

In this paper we present new theoretical results for the Dantzig and Lasso estimators of the drift in a high dimensional Ornstein-Uhlenbeck model under sparsity constraints. Our focus is on oracle inequalities for both estimators and error bounds with respect to several norms. In the context of the Lasso estimator our paper is strongly related to [11], who investigated the same problem under row sparsity. We improve their rates and also prove the restricted eigenvalue property solely under ergodicity assumption on the model. Finally, we demonstrate a numerical analysis to uncover the finite sample performance of the Dantzig and Lasso estimators.

[5]  arXiv:2008.01006 [pdf, other]
Title: Gibbs sampler and coordinate ascent variational inference: a set-theoretical review
Authors: Se Yoon Lee
Subjects: Statistics Theory (math.ST)

We clarify that the Gibbs sampler and coordinate ascent variational inference can be explained more generally in a set-theoretical point of view. This is an immediate consequence of a duality formula for variational inference.

Cross-lists for Tue, 4 Aug 20

[6]  arXiv:2008.00043 (cross-list from math.CO) [pdf, ps, other]
Title: Generalized Cut Polytopes for Binary Hierarchical Models
Comments: 28 pages, 4 figures
Subjects: Combinatorics (math.CO); Statistics Theory (math.ST)

Marginal polytopes are important geometric objects that arise in statistics as the polytopes underlying hierarchical log-linear models. These polytopes can be used to answer geometric questions about these models, such as determining the existence of maximum likelihood estimates or the normality of the associated semigroup. Cut polytopes of graphs have been useful in analyzing binary marginal polytopes in the case where the simplicial complex underlying the hierarchical model is a graph. We introduce a generalized cut polytope that is isomorphic to the binary marginal polytope of an arbitrary simplicial complex via a generalized covariance map. This polytope is full dimensional in its ambient space and has a natural switching operation among its facets that can be used to deduce symmetries between the facets of the correlation and binary marginal polytopes. We find complete H-representations of the generalized cut polytope for some important families of simplicial complexes. We also compute the volume of these polytopes in some instances.

[7]  arXiv:2008.00520 (cross-list from cs.AI) [pdf, other]
Title: Statistical Inference of Minimally Complex Models
Comments: 20 pages, 5 figures
Subjects: Artificial Intelligence (cs.AI); Statistics Theory (math.ST); Data Analysis, Statistics and Probability (physics.data-an); Quantitative Methods (q-bio.QM)

Finding the best model that describes a high dimensional dataset, is a daunting task. For binary data, we show that this becomes feasible, if the search is restricted to simple models. These models -- that we call Minimally Complex Models (MCMs) -- are simple because they are composed of independent components of minimal complexity, in terms of description length. Simple models are easy to infer and to sample from. In addition, model selection within the MCMs' class is invariant with respect to changes in the representation of the data. They portray the structure of dependencies among variables in a simple way. They provide robust predictions on dependencies and symmetries, as illustrated in several examples. MCMs may contain interactions between variables of any order. So, for example, our approach reveals whether a dataset is appropriately described by a pairwise interaction model.

[8]  arXiv:2008.00848 (cross-list from stat.ME) [pdf, other]
Title: A monotonicity property of weighted log-rank tests
Subjects: Methodology (stat.ME); Statistics Theory (math.ST)

The logrank test is a well-known nonparametric test which is often used to compare the survival distributions of two samples including right censored observations, it is also known as the Mantel-Haenszel test. The $G^{\rho}$ family of tests, introduced by Harrington and Fleming (1982), generalizes the logrank test by using weights assigned to observations. In this paper, we present a monotonicity property for the $G^{\rho}$ family of tests, which was motivated by the need to derive bounds for the test statistic in case of imprecise data observations.

[9]  arXiv:2008.01036 (cross-list from cs.LG) [pdf, other]
Title: Multiple Descent: Design Your Own Generalization Curve
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)

This paper explores the generalization loss of linear regression in variably parameterized families of models, both under-parameterized and over-parameterized. We show that the generalization curve can have an arbitrary number of peaks, and moreover, locations of those peaks can be explicitly controlled.
Our results highlight the fact that both classical U-shaped generalization curve and the recently observed double descent curve are not intrinsic properties of the model family. Instead, their emergence is due to the interaction between the properties of the data and the inductive biases of learning algorithms.

Replacements for Tue, 4 Aug 20

[10]  arXiv:1608.00033 (replaced) [pdf, ps, other]
Title: Locally Robust Semiparametric Estimation
Subjects: Statistics Theory (math.ST); Econometrics (econ.EM)
[11]  arXiv:1805.08342 (replaced) [pdf, other]
Title: Nearest neighbor density functional estimation from inverse Laplace transform
Comments: 53 pages, 4 figures. Submitted to the IEEE Transactions on Information Theory
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Methodology (stat.ME); Machine Learning (stat.ML)
[12]  arXiv:1807.03829 (replaced) [pdf, other]
Title: A theoretical framework of the scaled Gaussian stochastic process in prediction and calibration
Subjects: Statistics Theory (math.ST)
[13]  arXiv:1910.04355 (replaced) [pdf, other]
Title: Adaptive Variational Bayesian Inference for Sparse Deep Neural Network
Subjects: Statistics Theory (math.ST)
[14]  arXiv:2002.04898 (replaced) [pdf, other]
Title: Asymptotics for M-type smoothing splines with non-smooth objective functions
Subjects: Statistics Theory (math.ST)
[15]  arXiv:2002.06956 (replaced) [pdf, other]
Title: Density estimation using Dirichlet kernels
Comments: 39 pages, 1 figure v2: intro added, some proofs were fixed
Subjects: Statistics Theory (math.ST); Probability (math.PR)
[16]  arXiv:2003.05619 (replaced) [pdf, ps, other]
Title: On uniform consistency of nonparametric tests I
Authors: Mikhail Ermakov
Comments: 42 pages. arXiv admin note: text overlap with arXiv:1807.09076
Subjects: Statistics Theory (math.ST)
[17]  arXiv:2004.07039 (replaced) [pdf, ps, other]
Title: On uniform consistency of nonparametric tests II
Authors: Mikhail Ermakov
Comments: 18 pages
Subjects: Statistics Theory (math.ST)
[18]  arXiv:2006.04499 (replaced) [pdf, other]
Title: Cointegration and unit root tests: A fully Bayesian approach
Subjects: Statistics Theory (math.ST)
[19]  arXiv:2007.09349 (replaced) [pdf, ps, other]
Title: Explicit expressions for joint moments of $n$-dimensional elliptical distributions
Comments: 20 pages
Subjects: Statistics Theory (math.ST); Risk Management (q-fin.RM)
[20]  arXiv:2007.13482 (replaced) [pdf, ps, other]
Title: Equilibrium in Wright-Fisher models of population genetics
Comments: 6 pages, a genetic Wright-Fisher model is considered as a multivariate statistical experiment which has a representation as a Discrete Markov Diffusion
Journal-ref: Kibernetika i Sistemnyi Analiz, No. 2, March-April, 2019, pp. 96-101
Subjects: Statistics Theory (math.ST)
[21]  arXiv:1906.09855 (replaced) [pdf, other]
Title: Universal Bayes consistency in metric spaces
Comments: arXiv admin note: text overlap with arXiv:1705.08184
Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
[22]  arXiv:1909.03488 (replaced) [pdf, other]
Title: Probabilistic Convergence and Stability of Random Mapper Graphs
Subjects: Algebraic Topology (math.AT); Computational Geometry (cs.CG); Probability (math.PR); Statistics Theory (math.ST)
[23]  arXiv:1910.14167 (replaced) [pdf, ps, other]
Title: Phase Transitions for Detecting Latent Geometry in Random Graphs
Comments: 62 pages
Subjects: Probability (math.PR); Information Theory (cs.IT); Social and Information Networks (cs.SI); Statistics Theory (math.ST)
[24]  arXiv:1911.05401 (replaced) [pdf, other]
Title: Tropical Optimal Transport and Wasserstein Distances
Comments: 34 pages, 8 figures, 3 tables
Subjects: Optimization and Control (math.OC); Metric Geometry (math.MG); Statistics Theory (math.ST)
[25]  arXiv:1912.11914 (replaced) [pdf, other]
Title: Inverses of Matern Covariances on Grids
Authors: Joseph Guinness
Subjects: Computation (stat.CO); Statistics Theory (math.ST); Machine Learning (stat.ML)
[26]  arXiv:2007.08936 (replaced) [pdf, ps, other]
Title: Asymptotic Behaviour of the Empirical Distance Covariance for Dependent Data
Authors: Marius Kroll
Comments: 21 pages; Added references
Subjects: Probability (math.PR); Statistics Theory (math.ST)
