References & Citations
Mathematics > Statistics Theory
Title: PanIC: consistent information criteria for general model selection problems
(Submitted on 7 Mar 2023 (v1), last revised 22 Apr 2024 (this version, v3))
Abstract: Model selection is a ubiquitous problem that arises in the application of many statistical and machine learning methods. In the likelihood and related settings, it is typical to use the method of information criteria (IC) to choose the most parsimonious among competing models by penalizing the likelihood-based objective function. Theorems guaranteeing the consistency of IC can often be difficult to verify and are often specific and bespoke. We present a set of results that guarantee consistency for a class of IC, which we call PanIC (from the Greek root 'pan', meaning 'of everything'), with easily verifiable regularity conditions. The PanIC are applicable in any loss-based learning problem and are not exclusive to likelihood problems. We illustrate the verification of regularity conditions for model selection problems regarding finite mixture models, least absolute deviation and support vector regression, and principal component analysis, and we demonstrate the effectiveness of the PanIC for such problems via numerical simulations. Furthermore, we present new sufficient conditions for the consistency of BIC-like estimators and provide comparisons of the BIC to PanIC.
Submission history
From: Hien Nguyen [view email][v1] Tue, 7 Mar 2023 04:37:45 GMT (26kb)
[v2] Thu, 9 Mar 2023 22:48:47 GMT (26kb)
[v3] Mon, 22 Apr 2024 02:06:30 GMT (27kb)
Link back to: arXiv, form interface, contact.