UNIGE document Chapitre d'actes
previous document  unige:4540  next document
add to browser collection

Computing and using the deviance with classification trees

Published in Rizzi, Alfredo and Vichi, Maurizio. COMPSTAT 2006 - Proceedings in Computational Statistics. Berlin - 2006 - . 2006, p. 55-66
Abstract The reliability of induced classification trees is most often evaluated by means of the error rate. Whether computed on test data or through cross-validation, this error rate is suited for classification purposes. We claim that it is, however, a partial indicator only of the quality of the knowledge provided by trees and that there is a need for additional indicators. For example, the error rate is not representative of the quality of the description provided. In this paper we focus on this descriptive aspect. We consider the deviance as a goodness-of-fit statistic that attempts to measure how well the tree is at reproducing the conditional distribution of the response variable for each possible profile (rather than the individual response value for each case) and we discuss various statistical tests that can be derived from them. Special attention is devoted to computational aspects.
Keywords Classification treeDevianceGoodness-of-fitChi-square statisticsBIC
Full text
Proceedings chapter - public document Free access
(ISO format)
RITSCHARD, Gilbert. Computing and using the deviance with classification trees. In: Rizzi, Alfredo and Vichi, Maurizio (Ed.). COMPSTAT 2006 - Proceedings in Computational Statistics. Berlin. [s.l.] : [s.n.], 2006. p. 55-66. doi: 10.1007/978-3-7908-1709-6_5 https://archive-ouverte.unige.ch/unige:4540

425 hits



Deposited on : 2009-12-01

Export document
Format :
Citation style :