機機機機機機機機機 機機機 2016/04/13 Xavier Yin
2016/04/13 Xavier Yin
ID3, C4.5, C5.0, CHAID, CART
(Information Gain)(Impurity) (Homogeneous)Gini Index: CART
Entropy:ID3, C4.5, C5.0:C4.5
(Information Gain)
GINI Index() - GINI()
GINI
GINI
GINI Index() - GINI()
GINI
GINI
GINI Index() - GINI()
GINI
GINI
Entropy()Entropy()
Entropy
Entropy
Misclassification error()Error()
Entropy
Entropy
Gain Ratio()
SplitINFO
(Overfitting)(Underfitting)
(overfitting),
1) A->2) A->3) A->
,B,: 1) B->2) B->3) B->
AA
->A->->A
(Ockhams Razor) ( The simplest explanation is the best )
: (Prepruning) (Postpruning)
(Prepruning),,,: ...;
(Postpruning): (Subtree Replacement): (Subtree Raising):
(Minimum description length principle, MDL)
()