Top Banner
機機機機機機機機機 機機機 2016/04/13 Xavier Yin
35

機器學習與資料探勘:決策樹

Apr 15, 2017

Download

Data & Analytics

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript

2016/04/13 Xavier Yin

ID3, C4.5, C5.0, CHAID, CART

(Information Gain)(Impurity) (Homogeneous)Gini Index: CART

Entropy:ID3, C4.5, C5.0:C4.5

(Information Gain)

GINI Index() - GINI()

GINI

GINI

GINI Index() - GINI()

GINI

GINI

GINI Index() - GINI()

GINI

GINI

Entropy()Entropy()

Entropy

Entropy

Misclassification error()Error()

Entropy

Entropy

Gain Ratio()

SplitINFO

(Overfitting)(Underfitting)

(overfitting),

1) A->2) A->3) A->

,B,: 1) B->2) B->3) B->

AA

->A->->A

(Ockhams Razor) ( The simplest explanation is the best )

: (Prepruning) (Postpruning)

(Prepruning),,,: ...;

(Postpruning): (Subtree Replacement): (Subtree Raising):

(Minimum description length principle, MDL)

()