We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Interactive Graphics for Visually Diagnosing Forest Classifiers in R

Abstract: This paper describes structuring data and constructing plots to explore forest classification models interactively. A forest classifier is an example of an ensemble, produced by bagging multiple trees. The process of bagging and combining results from multiple trees, produces numerous diagnostics which, with interactive graphics, can provide a lot of insight into class structure in high dimensions. Various aspects are explored in this paper, to assess model complexity, individual model contributions, variable importance and dimension reduction, and uncertainty in prediction associated with individual observations. The ideas are applied to the random forest algorithm, and to the projection pursuit forest, but could be more broadly applied to other bagged ensembles. Interactive graphics are built in R, using the ggplot2, plotly, and shiny packages.
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1704.02502 [stat.ML]
  (or arXiv:1704.02502v1 [stat.ML] for this version)

Submission history

From: Natalia Da Silva [view email]
[v1] Sat, 8 Apr 2017 14:41:38 GMT (2164kb,D)

Link back to: arXiv, form interface, contact.