We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: On the Limit Imbalanced Logistic Regression by Binary Predictors

Authors: Vincent Runge
Abstract: In this work, we introduce a modified (rescaled) likelihood for imbalanced logistic regression. This new approach makes easier the use of exponential priors and the computation of lasso regularization path. Precisely, we study a limiting behavior for which class imbalance is artificially increased by replication of the majority class observations. If some strong overlap conditions are satisfied, the maximum likelihood estimate converges towards a finite value close to the initial one (intercept excluded) as shown by simulations with binary predictors. This solution corresponds to the extremum of a concave function that we refer to as "rescaled" likelihood. In this context, the use of exponential priors has a clear interpretation as a shift on the predictor means for the minority class. Thanks to the simple binary structure, some random designs give analytic path estimators for the lasso regularization problem. An effective approximate path algorithm by piecewise logarithmic functions based on matrix inversions is also presented. This work was motivated by its potential application to spontaneous reports databases in a pharmacovigilance context.
Comments: 8 figures
Subjects: Methodology (stat.ME)
MSC classes: Primary 62J12, 62F12, 62F15, secondary 34E05, 49M29, 62P10
Cite as: arXiv:1703.08995 [stat.ME]
  (or arXiv:1703.08995v2 [stat.ME] for this version)

Submission history

From: Vincent Runge [view email]
[v1] Mon, 27 Mar 2017 10:17:00 GMT (149kb,D)
[v2] Wed, 18 Apr 2018 10:46:23 GMT (175kb,D)

Link back to: arXiv, form interface, contact.