Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Top-down Transformation Choice
(Submitted on 26 Jun 2017 (this version), latest version 14 Dec 2017 (v2))
Abstract: Simple models are preferred over complex models, but oversimplistic models may lead to errorenous interpretations. The classical approach is to start with a simple model whose shortcomings are assessed in residual-based model diagnostics. Eventually, one increases the complexity of this initial too simple model and obtains a better fitting model. We illustrate how transformation analysis can be used to implement an alternative approach to model choice. Instead of adding complexity to simple models, we demonstrate how step-wise complexity reduction can help to identify simpler and better interpretable models. We aim at modelling body mass index distributions in Switzerland by means of transformation models and try to understand the impact of sex, age, smoking and other lifestyle factors on a persons' body mass index. In this process, we try to find a compromise between model fit and model interpretability. Special emphasis is given to the understanding of the connections between transformation models of increasing complexity. The models used in our analysis range from evergreens, such as the normal linear regression model with constant variance, to novel models with extremely flexible conditional distribution functions, such as transformation trees and transformation forests.
Submission history
From: Torsten Hothorn [view email][v1] Mon, 26 Jun 2017 08:08:01 GMT (510kb,D)
[v2] Thu, 14 Dec 2017 13:09:24 GMT (510kb,D)
Link back to: arXiv, form interface, contact.