We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Applications

Title: Confronting Quasi-Separation in Logistic Mixed Effects for Linguistic Data: A Bayesian Approach

Abstract: Mixed effects regression models are widely used by language researchers. However, these regressions are implemented with an algorithm which may not converge on a solution. While convergence issues in linear mixed effects models can often be addressed with careful experiment design and model building, logistic mixed effects models introduce the possibility of separation or quasi-separation, which can cause problems for model estimation that result in convergence errors or in unreasonable model estimates. These problems cannot be solved by experiment or model design. In this paper, we discuss (quasi-)separation with the language researcher in mind, explaining what it is, how it causes problems for model estimation, and why it can be expected in linguistic datasets. Using real linguistic datasets, we then show how Bayesian models can be used to overcome convergence issues introduced by quasi-separation, whereas frequentist approaches fail. On the basis of these demonstrations, we advocate for the adoption of Bayesian models as a practical solution to dealing with convergence issues when modeling binary linguistic data.
Comments: Draft version of JQL accepted paper
Subjects: Applications (stat.AP)
DOI: 10.1080/09296174.2018.1499457
Cite as: arXiv:1611.00083 [stat.AP]
  (or arXiv:1611.00083v3 [stat.AP] for this version)

Submission history

From: Joseph Roy [view email]
[v1] Mon, 31 Oct 2016 23:48:17 GMT (425kb,D)
[v2] Fri, 23 Dec 2016 13:21:28 GMT (394kb,D)
[v3] Fri, 7 Sep 2018 16:46:50 GMT (74kb,D)

Link back to: arXiv, form interface, contact.