References & Citations
Statistics > Applications
Title: Confronting Quasi-Separation in Logistic Mixed Effects for Linguistic Data: A Bayesian Approach
(Submitted on 31 Oct 2016 (v1), last revised 7 Sep 2018 (this version, v3))
Abstract: Mixed effects regression models are widely used by language researchers. However, these regressions are implemented with an algorithm which may not converge on a solution. While convergence issues in linear mixed effects models can often be addressed with careful experiment design and model building, logistic mixed effects models introduce the possibility of separation or quasi-separation, which can cause problems for model estimation that result in convergence errors or in unreasonable model estimates. These problems cannot be solved by experiment or model design. In this paper, we discuss (quasi-)separation with the language researcher in mind, explaining what it is, how it causes problems for model estimation, and why it can be expected in linguistic datasets. Using real linguistic datasets, we then show how Bayesian models can be used to overcome convergence issues introduced by quasi-separation, whereas frequentist approaches fail. On the basis of these demonstrations, we advocate for the adoption of Bayesian models as a practical solution to dealing with convergence issues when modeling binary linguistic data.
Submission history
From: Joseph Roy [view email][v1] Mon, 31 Oct 2016 23:48:17 GMT (425kb,D)
[v2] Fri, 23 Dec 2016 13:21:28 GMT (394kb,D)
[v3] Fri, 7 Sep 2018 16:46:50 GMT (74kb,D)
Link back to: arXiv, form interface, contact.