Sensitivity analysis of the topology of classification trees
Loading...
Authors
Kobayashi, Izumi.
Subjects
Advisors
Buttrey, Samuel E.
Date of Issue
1999-12
Date
1999
Publisher
Monterey, California. Naval Postgraduate School
Language
en_US
Abstract
The use of classification trees is one of the most widely used techniques in classification. It is well known that classification trees are not stable in their topology, in contrast to their robustness with respect to misclassification rate. This thesis defines a measure that compares the topology of two trees and studies how a tree's topology changes when the dependent (y) variable or the independent (x) variables are perturbed. This allows us to examine the "robustness" of tree topology under perturbation and to compare it to the robustness with respect to the misclassification rate under the same perturbations. We show that the tree topology can change significantly even for small perturbations in many sets of data. This suggests that even small measurement errors in the variables can affect the tree topology greatly. Because data are often measured with error, it follows that splitting rules in trees may not be suitable for use in making policy decisions. We propose a measure for tree topology, and show that tree topology changes faster than the misclassification rate does under mild perturbations. This finding formalizes the concept that tree models are more stable in terms of misclassification rate than in terms of topology.
Type
Thesis
Description
Series/Report No
Department
Organization
Identifiers
NPS Report Number
Sponsors
Funder
Format
xii, 65 p.;28 cm.
Citation
Distribution Statement
Approved for public release; distribution is unlimited.