Statistics Theory
New submissions
[ showing up to 2000 entries per page: fewer  more ]
New submissions for Tue, 7 Feb 23
 [1] arXiv:2302.02247 [pdf, ps, other]

Title: Spectral Density Estimation of FunctionValued Spatial ProcessesComments: 84 pages, 0 figuresSubjects: Statistics Theory (math.ST)
The spectral density function describes the secondorder properties of a stationary stochastic process on $\mathbb{R}^d$. This paper considers the nonparametric estimation of the spectral density of a continuoustime stochastic process taking values in a separable Hilbert space. Our estimator is based on kernel smoothing and can be applied to a wide variety of spatial sampling schemes including those in which data are observed at irregular spatial locations. Thus, it finds immediate applications in Spatial Statistics, where irregularly sampled data naturally arise. The rates for the bias and variance of the estimator are obtained under general conditions in a mixeddomain asymptotic setting. When the data are observed on a regular grid, the optimal rate of the estimator matches the minimax rate for the class of covariance functions that decay according to a power law. The asymptotic normality of the spectral density estimator is also established under general conditions for Gaussian Hilbertspace valued processes. Finally, with a view towards practical applications the asymptotic results are specialized to the case of discretelysampled functional data in a reproducing kernel Hilbert space.
 [2] arXiv:2302.02415 [pdf, ps, other]

Title: On Kronecker Separability of Multiway CovarianceComments: 15 pagesSubjects: Statistics Theory (math.ST); Methodology (stat.ME)
Multiway data analysis is aimed at inferring patterns from data represented as a multidimensional array. Estimating covariance from multiway data is a fundamental statistical task, however, the intrinsic high dimensionality poses significant statistical and computational challenges. Recently, several factorized covariance models, paired with estimation algorithms, have been proposed to circumvent these obstacles. Despite several promising results on the algorithmic front, it remains underexplored whether and when such a model is valid. To address this question, we define the notion of Kroneckerseparable multiway covariance, which can be written as a sum of $r$ tensor products of modewise covariances. The question of whether a given covariance can be represented as a separable multiway covariance is then reduced to an equivalent question about separability of quantum states. Using this equivalence, it follows directly that a generic multiway covariance tends to be nonseparable (even if $r \to \infty$), and moreover, finding its best separable approximation is NPhard. These observations imply that factorized covariance models are restrictive and should be used only when there is a compelling rationale for such a model.
 [3] arXiv:2302.02482 [pdf, other]

Title: Continuously Indexed Graphical ModelsSubjects: Statistics Theory (math.ST); Probability (math.PR); Methodology (stat.ME)
Let $X = \{X_{u}\}_{u \in U}$ be a realvalued Gaussian process indexed by a set $U$. It can be thought of as an undirected graphical model with every random variable $X_{u}$ serving as a vertex. We characterize this graph in terms of the covariance of $X$ through its reproducing kernel property. Unlike other characterizations in the literature, our characterization does not restrict the index set $U$ to be finite or countable, and hence can be used to model the intrinsic dependence structure of stochastic processes in continuous time/space. Consequently, the said characterization is not (and apparently cannot be) of the inversezero type. This poses novel challenges for the problem of recovery of the dependence structure from a sample of independent realizations of $X$, also known as structure estimation. We propose a methodology that circumvents these issues, by targeting the recovery of the underlying graph up to a finite resolution, which can be arbitrarily fine and is limited only by the available sample size. The recovery is shown to be consistent so long as the graph is sufficiently regular in an appropriate sense, and convergence rates are provided. Our methodology is illustrated by simulation and two data analyses.
 [4] arXiv:2302.02497 [pdf, other]

Title: Highdimensional Location Estimation via Norm Concentration for Subgamma VectorsSubjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Probability (math.PR); Machine Learning (stat.ML)
In location estimation, we are given $n$ samples from a known distribution $f$ shifted by an unknown translation $\lambda$, and want to estimate $\lambda$ as precisely as possible. Asymptotically, the maximum likelihood estimate achieves the Cram\'erRao bound of error $\mathcal N(0, \frac{1}{n\mathcal I})$, where $\mathcal I$ is the Fisher information of $f$. However, the $n$ required for convergence depends on $f$, and may be arbitrarily large. We build on the theory using \emph{smoothed} estimators to bound the error for finite $n$ in terms of $\mathcal I_r$, the Fisher information of the $r$smoothed distribution. As $n \to \infty$, $r \to 0$ at an explicit rate and this converges to the Cram\'erRao bound. We (1) improve the prior work for 1dimensional $f$ to converge for constant failure probability in addition to high probability, and (2) extend the theory to highdimensional distributions. In the process, we prove a new bound on the norm of a highdimensional random variable whose 1dimensional projections are subgamma, which may be of independent interest.
 [5] arXiv:2302.02544 [pdf, other]

Title: Sequential change detection via backward confidence sequencesComments: 24 pages, 10 figuresSubjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
We present a simple reduction from sequential estimation to sequential changepoint detection (SCD). In short, suppose we are interested in detecting changepoints in some parameter or functional $\theta$ of the underlying distribution. We demonstrate that if we can construct a confidence sequence (CS) for $\theta$, then we can also successfully perform SCD for $\theta$. This is accomplished by checking if two CSs  one forwards and the other backwards  ever fail to intersect. Since the literature on CSs has been rapidly evolving recently, the reduction provided in this paper immediately solves several old and new change detection problems. Further, our "backward CS", constructed by reversing time, is new and potentially of independent interest. We provide strong nonasymptotic guarantees on the frequency of false alarms and detection delay, and demonstrate numerical effectiveness on several problems.
 [6] arXiv:2302.02613 [pdf, ps, other]

Title: An asymptotic behavior of a finitesection of the optimal causal filterAuthors: Junho YangSubjects: Statistics Theory (math.ST)
We derive an $L_1$bound between the coefficients of the optimal causal filter applied to the datagenerating process and its approximation based on finite sample observations. Here, we assume that the datagenerating process is secondorder stationary with either short or long memory autocovariances. To obtain the $L_1$bound, we first provide an exact expression of the causal filter coefficients and their approximation in terms of the absolute convergent series of the multistep ahead infinite and finite predictor coefficients, respectively. Then, we prove a socalled uniformtype Baxter's inequality to obtain a bound for the difference between the two multistep ahead predictor coefficients (under both short and memory time series). The $L_1$approximation error bound of the causal filter coefficients can be used to evaluate the quality of the predictions of time series through the mean squared error criterion.
 [7] arXiv:2302.02954 [pdf, other]

Title: Maximum likelihood estimator for skew Brownian motion: the convergence rateSubjects: Statistics Theory (math.ST); Probability (math.PR)
We give a thorough description of the asymptotic property of the maximum likelihood estimator (MLE) of the skewness parameter of a Skew Brownian Motion (SBM). Thanks to recent results on the Central Limit Theorem of the rate of convergence of estimators for the SBM, we prove a conjecture left open that the MLE has asymptotically a mixed normal distribution involving the local time with a rate of convergence of order $1/4$. We also give a series expansion of the MLE and study the asymptotic behavior of the score and its derivatives, as well as their variation with the skewness parameter. In particular, we exhibit a specific behavior when the SBM is actually a Brownian motion, and quantify the explosion of the coefficients of the expansion when the skewness parameter is close to $1$ or $1$.
Crosslists for Tue, 7 Feb 23
 [8] arXiv:2302.02200 (crosslist from math.CO) [pdf, other]

Title: Rankbased linkage I: triplet comparisons and oriented simplicial complexesComments: 37 pages, 12 figuresSubjects: Combinatorics (math.CO); Statistics Theory (math.ST)
Rankbased linkage is a new tool for summarizing a collection $S$ of objects according to their relationships. These objects are not mapped to vectors, and ``similarity'' between objects need be neither numerical nor symmetrical. All an object needs to do is rank nearby objects by similarity to itself, using a Comparator which is transitive, but need not be consistent with any metric on the whole set. Call this a ranking system on $S$. Rankbased linkage is applied to the $K$nearest neighbor digraph derived from a ranking system. Computations occur on a 2dimensional abstract oriented simplicial complex whose faces are among the points, edges, and triangles of the line graph of the undirected $K$nearest neighbor graph on $S$. In $S K^2$ steps it builds an edgeweighted linkage graph $(S, \mathcal{L}, \sigma)$ where $\sigma(\{x, y\})$ is called the insway between objects $x$ and $y$. Take $\mathcal{L}_t$ to be the links whose insway is at least $t$, and partition $S$ into components of the graph $(S, \mathcal{L}_t)$, for varying $t$. Rankbased linkage is a functor from a category of outordered digraphs to a category of partitioned sets, with the practical consequence that augmenting the set of objects in a rankrespectful way gives a fresh clustering which does not ``rip apart`` the previous one. The same holds for single linkage clustering in the metric space context, but not for typical optimizationbased methods. Open combinatorial problems are presented in the last section.
 [9] arXiv:2302.02254 (crosslist from stat.CO) [pdf, other]

Title: Getting to "rateoptimal'' in ranking & selectionJournalref: Proceedings of the 2021 Winter Simulation ConferenceSubjects: Computation (stat.CO); Statistics Theory (math.ST)
In their 2004 seminal paper, Glynn and Juneja formally and precisely established the rateoptimal, probabilityofincorrectselection, replication allocation scheme for selecting the best of k simulated systems. In the case of independent, normally distributed outputs this allocation has a simple form that depends in an intuitively appealing way on the true means and variances. Of course the means and (typically) variances are unknown, but the rateoptimal allocation provides a target for implementable, dynamic, datadriven policies to achieve. In this paper we compare the empirical behavior of four related replicationallocation policies: mCEI from Chen and Rzyhov and our new gCEI policy that both converge to the Glynn and Juneja allocation; AOMAP from Peng and Fu that converges to the OCBA optimal allocation; and TTTS from Russo that targets the rate of convergence of the posterior probability of incorrect selection. We find that these policies have distinctly different behavior in some settings.
 [10] arXiv:2302.02486 (crosslist from stat.ME) [pdf, other]

Title: The DifferenceofLogNormals Distribution: Properties, Estimation, and GrowthAuthors: Robert ParhamSubjects: Methodology (stat.ME); Statistics Theory (math.ST); General Finance (qfin.GN)
This paper describes the DifferenceofLogNormals (DLN) distribution. A companion paper makes the case that the DLN is a fundamental distribution in nature, and shows how a simple application of the CLT gives rise to the DLN in many disparate phenomena. Here, I characterize its PDF, CDF, moments, and parameter estimators; generalize it to Ndimensions using spherical distribution theory; describe methods to deal with its signature ``doubleexponential'' nature; and use it to generalize growth measurement to possiblynegative variates distributing DLN. I also conduct MonteCarlo experiments to establish some properties of the estimators and measures described.
 [11] arXiv:2302.02774 (crosslist from stat.ML) [pdf, other]

Title: The SSL Interplay: Augmentations, Inductive Bias, and GeneralizationSubjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Statistics Theory (math.ST)
Selfsupervised learning (SSL) has emerged as a powerful framework to learn representations from raw data without supervision. Yet in practice, engineers face issues such as instability in tuning optimizers and collapse of representations during training. Such challenges motivate the need for a theory to shed light on the complex interplay between the choice of data augmentation, network architecture, and training algorithm. We study such an interplay with a precise analysis of generalization performance on both pretraining and downstream tasks in a theory friendly setup, and highlight several insights for SSL practitioners that arise from our theory.
 [12] arXiv:2302.02988 (crosslist from cs.LG) [pdf, other]

Title: Asymptotically Minimax Optimal FixedBudget Best Arm Identification for Expected Simple Regret MinimizationSubjects: Machine Learning (cs.LG); Econometrics (econ.EM); Statistics Theory (math.ST); Methodology (stat.ME); Machine Learning (stat.ML)
We investigate fixedbudget best arm identification (BAI) for expected simple regret minimization. In each round of an adaptive experiment, a decision maker draws one of multiple treatment arms based on past observations and subsequently observes the outcomes of the chosen arm. After the experiment, the decision maker recommends a treatment arm with the highest projected outcome. We evaluate this decision in terms of the expected simple regret, a difference between the expected outcomes of the best and recommended treatment arms. Due to the inherent uncertainty, we evaluate the regret using the minimax criterion. For distributions with fixed variances (locationshift models), such as Gaussian distributions, we derive asymptotic lower bounds for the worstcase expected simple regret. Then, we show that the Random Sampling (RS)Augmented Inverse Probability Weighting (AIPW) strategy proposed by Kato et al. (2022) is asymptotically minimax optimal in the sense that the leading factor of its worstcase expected simple regret asymptotically matches our derived worstcase lower bound. Our result indicates that, for locationshift models, the optimal RSAIPW strategy draws treatment arms with varying probabilities based on their variances. This result contrasts with the results of Bubeck et al. (2011), which shows that drawing each treatment arm with an equal ratio is minimax optimal in a bounded outcome setting.
Replacements for Tue, 7 Feb 23
 [13] arXiv:2109.02959 (replaced) [pdf, other]

Title: Fast approximations of pseudoobservations in the context of rightcensoring and intervalcensoringAuthors: Olivier Bouaziz (MAP5  UMR 8145)Subjects: Statistics Theory (math.ST)
 [14] arXiv:2207.00357 (replaced) [pdf, other]

Title: Efficient parameter estimation for parabolic SPDEs based on a loglinear model for realized volatilitiesSubjects: Statistics Theory (math.ST)
 [15] arXiv:2207.08038 (replaced) [pdf, other]

Title: A Singular Woodbury and PseudoDeterminant Matrix Identities and Application to Gaussian Process RegressionSubjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Numerical Analysis (math.NA); Computation (stat.CO)
 [16] arXiv:2209.05153 (replaced) [pdf, other]

Title: The test of exponentiality based on the mean residual life function revisitedAuthors: Bruno EbnerComments: 16 pages, 1 figure, 5 tablesSubjects: Statistics Theory (math.ST)
 [17] arXiv:2209.07791 (replaced) [pdf, ps, other]

Title: Maximum likelihood estimation and prediction error for a Mat{é}rn model on the circleAuthors: Sébastien Petit (L2S, LNE )Subjects: Statistics Theory (math.ST)
 [18] arXiv:2101.02094 (replaced) [pdf, ps, other]

Title: BernsteinType Bounds for Beta DistributionAuthors: Maciej SkorskiComments: major revision  fixed a mistake in the proofSubjects: Probability (math.PR); Statistics Theory (math.ST); Applications (stat.AP)
 [19] arXiv:2109.09367 (replaced) [pdf, other]

Title: Extending Bootstrap AMG for Clustering of Attributed GraphsComments: 32 pages, 12 figures, preprintSubjects: Machine Learning (cs.LG); Numerical Analysis (math.NA); Statistics Theory (math.ST)
 [20] arXiv:2111.03289 (replaced) [pdf, ps, other]

Title: Improved Regret Analysis for VarianceAdaptive Linear Bandits and HorizonFree Linear Mixture MDPsComments: accepted to neurips'22Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
 [21] arXiv:2202.03835 (replaced) [pdf, other]

Title: A covariant, discrete timefrequency representation tailored for zerobased signal detectionComments: Accepted for publication in IEEE Transactions on Signal Processing on May, 26, 2022Subjects: Signal Processing (eess.SP); Statistics Theory (math.ST); Methodology (stat.ME)
 [22] arXiv:2204.08964 (replaced) [pdf, other]

Title: Adaptive measurement filter: efficient strategy for optimal estimation of quantum Markov chainsComments: 25 pages 7 figuresSubjects: Quantum Physics (quantph); Mathematical Physics (mathph); Statistics Theory (math.ST)
 [23] arXiv:2206.02659 (replaced) [pdf, other]

Title: Robust FineTuning of Deep Neural Networks with Hessianbased Generalization GuaranteesComments: 36 pages, 5 figures, 8 tables (Fixed typos). ICML 2022Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Machine Learning (stat.ML)
 [24] arXiv:2206.14275 (replaced) [pdf, other]

Title: Dynamic CoVaR ModelingSubjects: Econometrics (econ.EM); Statistics Theory (math.ST); Risk Management (qfin.RM); Methodology (stat.ME)
 [25] arXiv:2207.02287 (replaced) [pdf, other]

Title: Branching Processes in Random Environments with ThresholdsComments: 47 pages, 3 figures, 5 tablesSubjects: Probability (math.PR); Statistics Theory (math.ST)
 [26] arXiv:2207.14088 (replaced) [pdf, other]

Title: On the Sequential Probability Ratio Test in Hidden Markov ModelsComments: 28 pages, 10 figures, submitted to CONCUR 2022Subjects: Probability (math.PR); Logic in Computer Science (cs.LO); Statistics Theory (math.ST)
 [27] arXiv:2208.00959 (replaced) [pdf, other]

Title: HUG model: an interaction point process for Bayesian detection of multiple sources in groundwaters from hydrochemical dataAuthors: Christophe Reype (IECL, PASTA), Radu S. Stoica (IECL, PASTA), Antonin Richard, Madalina Deaconu (IECL, PASTA)Subjects: Applications (stat.AP); Statistics Theory (math.ST); Methodology (stat.ME)
 [28] arXiv:2209.12651 (replaced) [pdf, other]

Title: Learning Variational Models with Unrolling and Bilevel OptimizationSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
 [29] arXiv:2210.00895 (replaced) [pdf, other]

Title: On BestArm Identification with a Fixed Budget in NonParametric MultiArmed BanditsAuthors: Antoine Barrier (UMPAENSL, LMO, CELESTE), Aurélien Garivier (UMPAENSL, LIP), Gilles Stoltz (LMO, CELESTE)Journalref: ALT 2023  The 34th International Conference on Algorithmic Learning Theory, Feb 2023, Singapour, SingaporeSubjects: Machine Learning (cs.LG); Information Theory (cs.IT); Statistics Theory (math.ST); Machine Learning (stat.ML)
 [30] arXiv:2211.14908 (replaced) [pdf, other]

Title: A Permutationfree Kernel TwoSample TestComments: Published at the Thirtysixth Conference on Neural Information Processing Systems (NeurIPS), with an oral presentationSubjects: Methodology (stat.ME); Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
 [31] arXiv:2212.09178 (replaced) [pdf, ps, other]

Title: Support Vector Regression: Risk Quadrangle FrameworkSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
[ showing up to 2000 entries per page: fewer  more ]
Disable MathJax (What is MathJax?)
Links to: arXiv, form interface, find, math, recent, 2302, contact, help (Access key information)