We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Applications

Title: Improving the precision of classification trees

Authors: Wei-Yin Loh
Abstract: Besides serving as prediction models, classification trees are useful for finding important predictor variables and identifying interesting subgroups in the data. These functions can be compromised by weak split selection algorithms that have variable selection biases or that fail to search beyond local main effects at each node of the tree. The resulting models may include many irrelevant variables or select too few of the important ones. Either eventuality can lead to erroneous conclusions. Four techniques to improve the precision of the models are proposed and their effectiveness compared with that of other algorithms, including tree ensembles, on real and simulated data sets.
Comments: Published in at this http URL the Annals of Applied Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects: Applications (stat.AP)
Journal reference: Annals of Applied Statistics 2009, Vol. 3, No. 4, 1710-1737
DOI: 10.1214/09-AOAS260
Report number: IMS-AOAS-AOAS260
Cite as: arXiv:1011.0608 [stat.AP]
  (or arXiv:1011.0608v1 [stat.AP] for this version)

Submission history

From: Wei-Yin Loh [view email]
[v1] Tue, 2 Nov 2010 13:08:38 GMT (518kb)

Link back to: arXiv, form interface, contact.