Top Banner
Label Embedding Trees for Large Multi-class Tasks Samy Bengio Jason Weston David Grangier Presented by Zhengming Xing
14

Label Embedding Trees for Large Multi-class Tasks

Dec 31, 2015

Download

Documents

Label Embedding Trees for Large Multi-class Tasks. Samy Bengio Jason Weston David Grangier. Presented by Zhengming Xing. Outline. Introduction Label Trees Label Embeddings Experiment result. Introduction. Large scale problem: the number of example Feature dimension Number of class. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Label Embedding Trees for Large Multi-class Tasks

Label Embedding Trees for Large Multi-class Tasks

Samy Bengio Jason WestonDavid Grangier

Presented by Zhengming Xing

Page 2: Label Embedding Trees for Large Multi-class Tasks

Outline

• Introduction• Label Trees• Label Embeddings• Experiment result

Page 3: Label Embedding Trees for Large Multi-class Tasks

Introduction

Main idea: propose a fast and memory saving multi-class classifier for large dataset based on trees structure method

Large scale problem:

the number of example Feature dimensionNumber of class

Page 4: Label Embedding Trees for Large Multi-class Tasks

Introduction Label Tree:

Label Predictors:

Indexed nodes:

Edges:

Label sets:

The root contain all classes, and each child label set is a subset of its parent

K is the number of classes

Disjoint tree: any two nodes at the same depth cannot share any labels.

Page 5: Label Embedding Trees for Large Multi-class Tasks

IntroductionClassifying an example:

Page 6: Label Embedding Trees for Large Multi-class Tasks

Label TreesTree loss

I is the indicator function

is the depth in the tree of the final prediction for x

Page 7: Label Embedding Trees for Large Multi-class Tasks

Label treeLearning with fixed label tree: N,E,L chosen in advanceGoal: minimize the tree loss over the variables F

Given training data

Relaxation 1

Relaxation 2

Replace indicator function with hinge loss and

Page 8: Label Embedding Trees for Large Multi-class Tasks

Label treeLearning label tree structure for disjoint tree

Treat A as the affinity matrix and apply the steps similar to spectral clustering

Basic idea: group together labels into the same label set that are likely to be confused at test time.

Page 9: Label Embedding Trees for Large Multi-class Tasks

Label embeddingsis a k-dimensional vector with a 1 in the yth position and 0 otherwise

Problem : how to learn W, V

define

solve

Page 10: Label Embedding Trees for Large Multi-class Tasks

Label embeddingsMethod 1:

The same two steps of algorithm 2Learn V

Learn W

minimize

minimize

Page 11: Label Embedding Trees for Large Multi-class Tasks

Label embedding

Combine all the methods discussed above

Method 2: join learn W and V

minimize

minimize

Page 12: Label Embedding Trees for Large Multi-class Tasks

ExperimentDataset

Page 13: Label Embedding Trees for Large Multi-class Tasks

Experiment

Page 14: Label Embedding Trees for Large Multi-class Tasks

Experiment