Top Banner
Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia Lab, Columbia University http://www.ee.columbai.edu/dvmm Columbia University TRECVID-2006 High-Level Feature Extraction
46

Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

Dec 10, 2018

Download

Documents

voxuyen
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu,

Akira Yanagawa, and Eric Zavesky

Digital Video and Multimedia Lab, Columbia Universityhttp://www.ee.columbai.edu/dvmm

Columbia University TRECVID-2006 High-Level Feature Extraction

Page 2: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

2

6 runsVisual-based

Overview – 5 methods & 6 submitted runs

5 methodsbaseline context-based concept fusion

baseline

lexicon-spatial pyramid matching

visual_concept adaptive multi-model_concept adaptive

context

LSPMtext

text featureevent detection

1 2

3 45

Page 3: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

3

Overview – performanceMAP

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

A_CL1_1 A_CL2_2 A_CL3_3 A_CL4_4 A_CL5_5 A_CL6_6

context > baselinecontext-based concept fusion (CBCF) improves baseline

LSPM > contextlexicon-spatial pyramid matching (LSPM) further improves detection

text > LSPM: text features improve visual

visual-basedvisual-textbest visual

best all

multi-model_concept adaptive

visual_concept adaptive

text LSPM context baselineEvery method contributes incrementally to the final detection

Page 4: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

4

Overview – performance

visual_concept adaptive > LSPM (also > context > baseline): best of visual selection works

visual-basedvisual-textbest visual

best all

text > multi-model_concept adaptive:best of all selection does not work well probably due to over fitting of text tool

MAP

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

A_CL1_1 A_CL2_2 A_CL3_3 A_CL4_4 A_CL5_5 A_CL6_6

visual-basedvisual-textbest visual

best all

multi-model_concept adaptive

visual_concept adaptive

text spatial pyramid context baseline

Page 5: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

5

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 6: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

6

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 7: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

7

Color

Texture

Edge

Fixed/Global

Support Vector Machines (SVM)

Individual Methods: (1) BaselineAverage fusion of two SVM baseline classification results

Based on 3 visual featurescolor moments over 5x5 fixed grid partitionsGabor textureedge direction histogram from the whole image

1

coarse local features, layout, and global appearance

Page 8: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

8

2

ensemble classifier

Average fusion of two SVM baseline classification results

Based on 3 visual featurescolor moments over 5x5 fixed grid partitionsGabor textureedge direction histogram from the whole image

Color

Texture

Edge

Fixed/Global

Yanagawa et al., Tec. Rep., Columbia Univ., 2006 , http://www.ee.columbia.edu/dvmm/newPublication.htm

Individual Methods: (1) Baseline

Features and models available for download

soon!

Page 9: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

9

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 10: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

10

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 11: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

11

Individual Methods: (2) CBCF

“Government-Leader” Detector

Hard/specific concept

“Face” Detector

Generic concept

“outdoor” Detector

Generic concept - +Outdoor Face

Government-Leader

Context-based Model

different person

different view

large variance in appearance

government-leader

Context Information

Background on Context Fusion

Page 12: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

12

outdoor detector government-leader detector face detector

context-based model

(government-leader|image)P (face|image)P(outdoor|image)P

(government-leader|image)P (face|image)P(outdoor|image)P

Formulation

Individual Methods: (2) CBCF

(Naphade et al 2002)

Page 13: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

13

outdoor detector government-leader detector face detector

(government-leader|image)P (face|image)P(outdoor|image)P

(government-leader|image)P (face|image)P(outdoor|image)P

Our approach: Discriminative + Generative

outdoor airplane office

Conditional Random Field (Jiang, Chang, et al ICIP 2006)

observation

updated posteriors

1x 2x 3x

1( 1 | )p y = X 2( 1 | )p y = X 3( 1 | )p y = X

I

1C2C 3C

Individual Methods: (2) CBCF

Page 14: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

14

outdoor detector government-leader detector face detector

(government-leader|image)P (face|image)P(outdoor|image)P

(government-leader|image)P (face|image)P(outdoor|image)P

Conditional Random Field

observation

updated posteriors

(1 ) / 2 (1 ) / 2( 1 | ) ( 1 | )i i

i

y yi i

I C

J p y p y+ −= − = = −∏∏ X X

1x 2x 3x

1( 1 | )p y = X 2( 1 | )p y = X 3( 1 | )p y = X

I

min

Our approach: Discriminative + Generative

1C2C 3C

iteratively minimized by boosting

Individual Methods: (2) CBCF

Page 15: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

15

(1 ) / 2 (1 ) / 2( 1 | ) ( 1 | )i i

i

y yi i

I C

J p y p y+ −= − = = −∏∏ X Xmin

iteratively minimized by boosting

During each iteration t:

two SVM classifiers are trained for each concept:

1. Using input independent detection results

2. Using updated posteriors from iteration t-1

Classifier 2 keeps updating through iterationAnd captures inter-conceptual influences

Without classifier 2, Traditional AdaBoost

Individual Methods: (2) CBCF

Page 16: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

16

Database & lexicon for context

• Predefined lexicon to provide context-- 374 concepts from LSCOM ontology (observation)

airplane, building, car, boat, person, outdoor, sports, etc

• Independent detector-- our baseline

• Test concepts-- the 39 concepts defined by NIST (update posteriors)

Individual Methods: (2) CBCF

Page 17: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

17

0

0.2

0.4

0.6

0.8

1

1.2

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39

independent detector Boosted CRF

experimental results over TRECVID 2005 development set24 improve15 degrade

AP

context-based fusionindependent detector

Individual Methods: (2) CBCF

Page 18: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

18

Selective Application of Context• Not every concept classification benefits

from context-based fusion

• Is there a way to predict when it works?

Consistent with previous context-based fusion:IBM: no more than 8 out of 17 concepts gained performance [Amir et al., TRECVID Workshop, 2003]

Mediamill: 80 out of 101 concepts[Snoek et al., TRECVID Workshop, 2005]

Page 19: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

19

Predict When Context Helps

Strong classifiers may suffer from fusion with weak context

Complex inter-conceptual relationships vs. limited training samples

Why CBCF may not help every concept ?

Strong context

,

,

( ; ) ( )

( ; )j

j

j i jC j i

j iC j i

I C C E C

I C Cβ≠

<∑

∑or

Avoid using CBCF for if is strong and with weak contextiC iC

Use CBCF for concept if is weak or with strong contextiC iC

-- mutual information between and( ; )i jI C C iC jC( )iE C -- error rate of independent detector for iC

( )iE C λ>

weak concept

Page 20: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

20

Predict When Context HelpsChange parameters to predict different number of concepts

# predicted # concept improved MAP gainprecision of prediction

9 9 7.2%100%

39 24 3.0%62%

20 15 9.5%75%

16 14 14%88%

Page 21: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

21

Example

. . .

Fighter_Combat

Individual House

Military

Page 22: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

22

Independent Detector

Example

Page 23: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

23

Context-based concept fusion

Example

Page 24: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

24

Context-based concept fusion

Example

House

Page 25: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

25

Context-based concept fusion

Example

Positive frames are moved forward with the help of Fighter_Combat

Page 26: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

26

Context-Based Fusion + Baseline

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

R6 R5 All get improved !baseline context

MAP Gain:14%

TRECVID 2005 development set

Page 27: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

27

Context-Based Fusion + Baseline

4 concepts

TRECVID 2006 evaluation

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4

AP

baseline context

Similar to results over TRECVID 2005 set !

Page 28: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

28

Discussion

Concepts with performance improved: 3.23

Concepts with performance degraded: 4.17

Adding context – strong relationship and robust

Quality of context: ,

,

( ; ) ( )

( ; )j

j

j i jC j i

j iC j i

I C C E C

I C C≠

The smaller the better

Page 29: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

29

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 30: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

30

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 31: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

31

Individual Methods: (3) LSPMLocal features (SIFT)

Spatial layout sky

watertree

Spatial Pyramid Matching (SPM) [Lazebnik et al. CVPR, 2006]multi-resolution histogram matching in spatial domain, bags-of-features

Lexicon-Spatial Pyramid Matching (LSPM) SPM matching guided by multi-resolution lexiconsAppropriate size for visual lexicon ?

Page 32: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

32

t 1 t 2 t nt 3 t 4 t 5

t 1_1

t n_1

t 1_2

. . .

t n_2

t 2_1

t 2_2

t 3_1 t 3_2

t 4_1

t 4_2

t 5_1

t 5_2

SIFT features

Lexiconlevel 0

Lexiconlevel 1

Individual Methods: (3) LSPM

Page 33: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

33

Image 1...

Image 2...

Local features & Spatial layout of local features| |

SPM kernel

+

+

. . .

t 1 t 2 t n. . .Lexicon level 0

spatial level 0

. . .spatial level 1

. . .spatial level 2

Individual Methods: (3) LSPM

Page 34: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

34

t 1 t 2 t n. . .Lexicon level 0

Lexicon level 1 t 1_1 t n_1t 1_2 . . . t n_2

. . .

SPM kernel 0

SPM kernel 1

. . .

+

+

| |

LSPM kernelSVM classifier

Individual Methods: (3) LSPM

Page 35: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

35

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6

AP

with LSPM without LSPM

We apply LSPM to 13 concepts: flag-us, building, maps, waterscape-waterfront, car, charts, urban,road, boat-ship, vegetation, court, government-leaderComplements baseline by considering local features

almost all get improved !

6 are evaluated by NIST

Individual Methods: (3) LSPM

Page 36: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

36

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 37: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

37

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 38: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

38

Individual Methods: (4) Text

asynchrony between the words being spoken and the visual concepts appearing in the shot

Problems:

Solution:incorporate associated text from the entire story

story bag-of-words(term-frequency-inverse document frequency)

training data: bag-of-words features of stories

ground-truth label: positive – one shot is positiveSVM

dimension reduction

by frequency-- top k most

frequent wordsautomatically detected story boundaries [Hsu et al., ADVENT Technical Report , Columbia Univ., 2005 ]

Page 39: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

39

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

AP

visual only text + visual

0.2 text + 0.8 visual

MAP Gain4.5%

Individual Methods: (4) Text

Page 40: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

40

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 41: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

41

• Baseline

• Context-based concept fusion (CBCF)

• Lexicon-spatial pyramid matching (LSPM)

• Text features

• Event detection

Outline – New Algorithms

Page 42: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

42

Individual Methods: (5) EventEvent detection: Key frame v.s. Multiple frames

Page 43: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

43

Individual Methods: (5) EventEvent detection: Key frame v.s. Multiple frames

P

.

.

.

p1

pm

P

Supply

.

.

.

Q

q1

qn

q2

Q

demand

dij

Earth Mover’s Distance: minimum weighted distance by linear programming

11/21/2

fij: correspondence flow

SVM

handle temporal shift: a frame at the beginning of P can map to a frame at the end of Q

Handle scale variations: a frame from P can map to multiple frames in Q

Page 44: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

44

Individual Methods: (5) Eventexperimental results

0

0.2

0.4

0.6

0.8

1

AP

Key Frame EMD

Performance over TRECVID 2005 development set11 events: airplane_flying, people_marching, car_crash,

exiting_car, demonstration_or_protest, election_campaign_greeting,

parade, riot, running, shooting, walking

Page 45: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

45

Conclusion

• TRECVID 2006 offers a mature opportunity for evaluating concept interaction— We have built 374 concept detectors— Models and feature will be released soon

• Context-Based Fusion— Propose a systematic framework for predicting the effect of context fusion— (TRECVID 2005) 14 out of 16 predicted concepts show performance gain— (TRECVID 2006) 3 out of 4 predicted concepts show performance gain— Promising methodology for scaling up to large-scale systems (374 models)

• Results from Parts-based model (LSPM) are mixed— But show consistent improvement when fused with SVM baseline— 3 out of 6 concepts improve by more than 10%

• Temporal event modeling— We propose a novel matching and detection method based on EMD+SVM — Show consistent gains in 2005 data set— Results in 2006 are incomplete and lower than expected

Page 46: Columbia University TRECVID-2006 High-Level Feature … · Shih-Fu Chang, Winston Hsu, Wei Jiang, Lyndon Kennedy, Dong Xu, Akira Yanagawa, and Eric Zavesky Digital Video and Multimedia

46

• More information at– http://www.ee.columbia.edu

• Features and models for baseline detectors for 374 LSCOM concepts coming soon