Top Banner
Intelligent Database Systems Lab 國國國國國國國國 National Yunlin University of Science and Technology 1 Learning multiple nonredundant clusterings Presenter : Wei-Hao Huang Authors : Ying Gui, Xiaoli Z. Fern, Jennifer G. DY TKDD, 2010
23

Learning multiple nonredundant clusterings

Feb 22, 2016

Download

Documents

Gwen_

Learning multiple nonredundant clusterings. Presenter : Wei- Hao Huang Authors : Ying Gui , Xiaoli Z. Fern, Jennifer G. DY TKDD, 2010. Outlines. Motivation Objectives Methodology Experiments Conclusions Comments. Motivation. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

國立雲林科技大學National Yunlin University of Science and Technology

1

Learning multiple nonredundant clusterings

Presenter : Wei-Hao Huang  Authors : Ying Gui, Xiaoli Z. Fern, Jennifer G. DY

TKDD, 2010

Page 2: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

2

Outlines Motivation Objectives Methodology Experiments Conclusions Comments

Page 3: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

3

Motivation· Data exist multiple groupings that are reasonable

and interesting from different perspectives.· Traditional clustering is restricted to finding only

one single clustering.

Page 4: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Objectives

4

• To propose a new clustering paradigm for finding all non-redundant clustering solutions of the data.

Page 5: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

5

Methodology· Orthogonal clustering

─ Cluster space· Clustering in orthogonal subspaces

─ Feature space· Automatically Finding the number of clusters· Stopping criteria

Page 6: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Orthogonal Clustering Framework

6

X (Face dataset)

Page 7: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Orthogonal clustering

· Residue space

7

)

Page 8: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Clustering in orthogonal subspaces· Feature space

─ linear discriminant analysis (LDA)

─ singular value decomposition (SVD)

─ LDA v.s. SVD where

8

Projection Y=ATX

Page 9: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Clustering in orthogonal subspaces

· Residue space

9

A(t)= eigenvectors of

Page 10: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Compare moethod1 and mothod2· Residue space· Moethod1

─ · Moethod2

─ ─

· Moethod1 is a special case of Moethod2.─

10

A(t)= eigenvectors of

M’=M then P1=P2

Page 11: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· To use PCA to reduce dimensional· Clustering

─ K-means clustering Smallest SSE

─ Gaussian mixture model clustering (GMM) Largest maximum likelihood

· Dataset─ Synthetic─ Real-world

Face, WebKB text, Vowel phoneme, Digit

11

Page 12: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Evaluation

12

Page 13: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Synthetic

13

Page 14: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Face dataset

14

Page 15: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· WebKB dataset

· Vowe phoneme dataset

15

Page 16: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Digit dataset

16

Page 17: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Finding the number of clusters

─ K-means Gap statistics

17

Page 18: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Finding the number of clusters

─ GMMBIC

· Stopping Criteria─ SSE is less than 10% at first iteration─ Kopt=1─ Kopt > Kmax Select Kmax ─ Gap statistics─ BIC Maximize value of BIC

18

Page 19: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Synthetic dataset

19

Page 20: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· Face dataset

20

Page 21: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.Experiments· WebKB dataset

21

Page 22: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

22

Conclusions

• To discover varied interesting and meaningful

clustering solutions.

• Method2 is able to apply any clustering and

dimensionality reduction algorithm.

Page 23: Learning multiple  nonredundant clusterings

Intelligent Database Systems Lab

N.Y.U.S.T.

I. M.

23

Comments· Advantages

─ Find Multiple non-redundant clustering solutions

· Applications─ Data Clustering