Top Banner
Analysis, Indexing and Visualization of Presentation Videos Analysis, Indexing and Visualization of Presentation Videos Analysis, Indexing and Visualization of Presentation Videos Michele Merler email: [email protected] Computer Science Department, Columbia University Motivation & Domain Description Michele Merler email: [email protected] Computer Science Department, Columbia University Motivation & Domain Description Domain challenges: “WILD” ! Many videos are already archived Low quality Lack of Structure A quickly increasing quantity of presentation videos is GOAL : Help users efficiently Domain challenges: “WILD” ! Lack of additional Not recorded by Unconstrained camera already archived Low quality Lack of Structure Videos of presentations are tools nowadays A quickly increasing quantity of presentation videos is publicly available and retrievable on the web GOAL : Help users efficiently and effectively access Lack of additional sources of Not recorded by professional Unconstrained camera movements employed in a large variety of systems and effectively access (educational) information information (e.g. electronic cameramen Slides Truncation Distance or E-learning Conference proceedings (educational) information (e.g. electronic copies of slides) Light cannot be used as clue Compression Conference proceedings Student presentations results 1-20 of 1,160 used as clue Not edited Standard processing Student presentations Corporate talks 659 events 9K authors 12K lectures 14K videos results 1-20 of 1,160 Not edited does not apply 12K lectures 14K videos 3. Graphics Index Generation GOAL : Ensure end users satisfaction with how the 1. User Preferred Face Indexes 3. Graphics Index Generation GOAL : Ensure end users satisfaction with how the information extracted from the videos is presented Results 1. User Preferred Face Indexes Experimental Setup Results 1575 Amazon Mechanical Turk HITs (15 speakers x 3 ordering x 35 unique workers) Most people prefer Head & Shoulder FRONTALview 35% of votes went to Left and Right ¾ Head & Shoulder! Proposed Solution (15 speakers x 3 ordering x 35 unique workers) 35% of votes went to Left and Right ¾ Head & Shoulder! Confirms results of psychological studies on inference of Proposed Solution LBP Histogram + Color Histogram head 3D information from 3/4 view of face [Burke VR07] Index presentation videos based on four major cues: LBP Histogram + Color Histogram Online Clustering (visual + temporal) with avg. Linkage Text (+audio transcripts) Graphics Online Clustering (visual + temporal) with avg. Linkage Norm. Cross Correlation for Template Matching Speaker faces Graphics Mosaics 5 . 0 ) , ( 1 1 ) , ( 1 2 C k j i c x C C x S j + = = χ i x region It has better illumination It has better resolution I can see/tell more about the whole appearance of the person I can see better the eyes and expression of the person ( ) ( ) ) ( 4 . 0 ) ( 4 . 0 ) , ( 5 . 0 ) , ( 1 j j j i k jk i j C T t C S C x T c x C + + = + = β α χ j C cluster I can see better the eyes and expression of the person I prefer this pose of a person in general I picked the best out of a bunch of bad pictures None of the above(please explain your reason with a few words in the box below) BACK-END ) , ( ) , ( j i j i C x T C x S < > BACK-END j i j i < 2. Automatic Generation of Speakers Face Indexes User Preferred Textual Index Graphics Index 4. Textual Index Generation 2. Automatic Generation of Speakers Face Indexes User Preferred Face Indexes Textual Index Generation Graphics Index Generation 4. Textual Index Generation 1 3 4 Edges Connected Geometric + Edge Local Adaptive Otsu Face Indexes Generation Generation Selection based on 3 quality measures Viola Jones detector Face LoGedges Edges Connected Components Geometric + Edge Density Constraints Local Adaptive Otsu (LAO) Binarization Tesseract OCR Speaker Face Semantic Shot 2 5 1. Resolution Selection based on 3 quality measures Viola Jones detector Color skin filter Face Detection Completed Tasks Speaker Face Index Generation Semantic Shot Representation 2 5 h w × 1. Resolution Size of the face region Color skin filter Detection Rcscarch Interview with Client { i sited Project Space h w × Size of the face region Face Seeds Qiant House Resident Association Meeting 1 hour and 45 minutes of video, 8 student presentations Face Seeds Completed Tasks 1 hour and 45 minutes of video, 8 student presentations 13 slides per presentation (average) O t f MILTrack(prediction): Face Research Interview Client After vocabulary correction LAO + Tesseract 13 slides per presentation (average) VASTMM Browser [1] 6 t f P t f MILTrack(prediction): Viola Jones detector (observation): Face Interview Client Project Space House Resident Association Meeting correction Tesseract 8000 zed LAO + Tesseract VASTMM Browser [1] 6 t f 2. Pose Viola Jones detector (observation): Simplified Kalman filter: Tracking House Resident Association Meeting Tesseract 6000 cogniz ers FRONT-END O t P t t f f f ) 1 ( α α + Left and right ¾ pose classifiers Edge histogram descriptor 2. Pose Simplified Kalman filter: 4000 Rec aracte t t t f f f ) 1 ( α α + 15383 15385 15387 15371 15409 15355 Edge histogram descriptor SVM RBF kernel Face Tracks 2000 4000 mber Cha 6. Final Browser interface 15383 15385 15387 15371 15409 15355 SVM RBF kernel FaceTracerdataset Face Tracks 2000 Num 6. Final Browser interface Training Set (left ¾, front, right ¾) ~10K images Home Search Explore Collections Visual Search Login or sign up 0 Recognition Method Training Set (left ¾, front, right ¾) ~10K images Test Set ~12K images Average Test Accuracy 81.5% Selection of faces to match Search Home Search Explore Collections Visual Search Login or sign up Number Number Precision Recall Recognition Method Average Test Accuracy 81.5% Selection of faces to match LBP descriptor + Sq. L2 distance Tracks People Index Graphics Index Search Tips Phone + P05 + G08 Tag Number GT Words Number Rec. Words Precision Recall 3 Skin Ratio Matching Phone + P05 + G08 Tag People Index Graphics Index 2276 1126 0.495 0.665 Unique Speakers area skinPixels skinRatio # = > 185 . 1 R Unique Speakers Face Tracks 5. Semantic Shot Representation Enhanced Feature Based Mosaic area > > = 107 . 0 185 . 1 skin Pixel RB G 5. Semantic Shot Representation Enhanced Feature Based Mosaic > > + + = 112 . 0 107 . 0 ) ( skin Pixel 2 RG B G R Face Index Resolution 10 secs > + + 112 . 0 ) ( 2 B G R RG Select “best faces” to present to end user Face Index Generation PTZ Estimation SIFT + RANSAC on min max Resolution 10 secs Click on an icon to find the graphic in the video present to end user Generation SIFT + RANSAC on keyframes skinRatio w resolution w pose w Q + + = 3 2 1 350 Overlay Recognized Text min max line Click on an icon to find the graphic in the video skinRatio w resolution w pose w Q + + = 3 2 1 Average Track Matching Time (secs) 335 300 350 Overlay Recognized Text Tagl s Test on 3 335 Track Matching Face Selection Left/right34 Extraction 200 250 Problem Statement •Ecological Impact Text1 Text3 Text4 Text5 Text7 Text8 Text10 Frames 51 out of 58 with Head & shoulder, ¾ profile view Test on 3 student Left/right34 Extraction Skin-Res Extraction K-Means Computation 150 200 •Waste goes to Landfills •Energy Source • Cost Efficiency Waste Disposal Bill Text1 Text3 Text4 Text5 Text7 Text8 Text10 Text Problem Phone 51 out of 58 with Head & shoulder, ¾ profile view student presenta- K-Means Computation 50 100 • Electrical Bill •Ms Wilson is looking for an eco-friendly, cost efficient, and easy to use product that will convert her solid waste into usable energy Segment video into semantically distinct shots based on slides People presenta- tion videos, 0 50 energy Enhance Graphics on slides Changes in text used to assess slide changes Graphics 45 minutes each 20 19 K-Means(100) select (100) min-min 1 2 3 Enhance Graphics Changes in text used to assess slide changes G each K-Means(100) select (100) min-min
1

Analysis, Indexing and Visualization of Presentation Videosmmerler/poster_dsp176-Merler.pdf · Slides Truncation Distance or E-learning Conference proceedings ( copies of slides)

Jul 03, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Analysis, Indexing and Visualization of Presentation Videosmmerler/poster_dsp176-Merler.pdf · Slides Truncation Distance or E-learning Conference proceedings ( copies of slides)

Analysis, Indexing and Visualization of Presentation Videos

Analysis, Indexing and Visualization of Presentation Videos

Analysis, Indexing and Visualization of Presentation Videos

Analysis, Indexing and Visualization of Presentation Videos

Mic

he

le M

erl

er

em

ail

: m

me

rle

r@cs

.co

lum

bia

.ed

u

C

om

pu

ter

Sci

en

ce D

ep

art

me

nt,

Co

lum

bia

Un

ive

rsit

y

Mo

tiv

ati

on

& D

om

ain

De

scri

pti

on

Mic

he

le M

erl

er

em

ail

: m

me

rle

r@cs

.co

lum

bia

.ed

u

C

om

pu

ter

Sci

en

ce D

ep

art

me

nt,

Co

lum

bia

Un

ive

rsit

y

Mo

tiv

ati

on

& D

om

ain

De

scri

pti

on

Do

ma

in c

ha

lle

ng

es:

“W

ILD

” !

Ma

ny

v

ide

os

are

alr

ea

dy

arc

hiv

ed

Low

qu

ali

tyLa

ck o

f S

tru

ctu

reA

qu

ick

ly i

ncr

ea

sin

g q

ua

nti

ty o

f p

rese

nta

tio

n v

ide

os

is

GO

AL

: H

elp

use

rs e

ffic

ien

tly

M

oti

va

tio

n &

Do

ma

in D

esc

rip

tio

nD

om

ain

ch

all

en

ge

s: “

WIL

D”

! •

Lack

of

ad

dit

ion

al

•N

ot

reco

rde

d b

y

•U

nco

nst

rain

ed

ca

me

ra

alr

ea

dy

arc

hiv

ed

Low

qu

ali

tyLa

ck o

f S

tru

ctu

re

Vid

eo

s o

f p

rese

nta

tio

ns

are

to

ols

no

wa

da

ys

A q

uic

kly

in

cre

asi

ng

qu

an

tity

of

pre

sen

tati

on

vid

eo

s is

pu

bli

cly a

vail

ab

le a

nd

re

trie

vab

le o

n t

he

we

bG

OA

L :

He

lp u

sers

eff

icie

ntl

y

an

d e

ffe

ctiv

ely

acc

ess

Lack

of

ad

dit

ion

al

sou

rce

s o

f

•N

ot

reco

rde

d b

y

pro

fess

ion

al

•U

nco

nst

rain

ed

ca

me

ra

mo

vem

en

ts

Vid

eo

s o

f p

rese

nta

tio

ns

are

to

ols

no

wa

da

ys

em

plo

yed

in a

la

rge

va

rie

ty o

f sy

ste

ms

an

d e

ffe

ctiv

ely

acc

ess

(ed

uca

tio

na

l) in

form

ati

on

sou

rce

s o

f

info

rma

tio

n

(e.g

. e

lect

ron

ic

pro

fess

ion

al

cam

era

me

n

mo

vem

en

ts

•S

lid

es

Tru

nca

tio

n�

Dis

tan

ce o

r E

-le

arn

ing

�C

on

fere

nce

pro

cee

din

gs

(ed

uca

tio

na

l) in

form

ati

on

(e.g

. e

lect

ron

ic

cop

ies

of

slid

es)

•Li

gh

t ca

nn

ot

be

use

d a

s cl

ue

•S

lid

es

Tru

nca

tio

n

•C

om

pre

ssio

n�

Co

nfe

ren

ce p

roce

ed

ing

s

�S

tud

en

t p

rese

nta

tio

ns

resu

lts

1-2

0 o

f 1

,16

0

use

d a

s cl

ue

•N

ot

ed

ite

d

•C

om

pre

ssio

n

•S

tan

da

rd p

roce

ssin

g

�S

tud

en

t p

rese

nta

tio

ns

�C

orp

ora

te t

alk

s6

59

eve

nts

9

K a

uth

ors

12

K le

ctu

res

1

4K

vid

eo

sre

sult

s 1

-20

of

1,1

60

•N

ot

ed

ite

d•

Sta

nd

ard

pro

cess

ing

do

es

no

t a

pp

ly

�C

orp

ora

te t

alk

s1

2K

lect

ure

s 1

4K

vid

eo

s

3.

Gra

ph

ics

Ind

ex

Ge

ne

rati

on

GO

AL

: E

nsu

re e

nd

use

rs s

ati

sfa

ctio

nw

ith

ho

w t

he

1

. U

ser

Pre

ferr

ed

Fa

ce I

nd

exe

s3

. G

rap

hic

s In

de

x G

en

era

tio

nG

OA

L:

En

sure

en

du

sers

sa

tisf

act

ion

wit

h h

ow

th

e

info

rma

tio

n e

xtra

cte

d f

rom

th

e v

ide

os

is p

rese

nte

d

Re

sult

s

1.

Use

r P

refe

rre

d F

ace

In

de

xes

3.

Gra

ph

ics

Ind

ex

Ge

ne

rati

on

Ex

pe

rim

en

tal

Se

tup

Re

sult

s

�1

57

5 A

ma

zon

Me

cha

nic

al

Turk

HIT

s

(15

sp

ea

kers

x 3

ord

eri

ng

x 3

5 u

niq

ue

wo

rke

rs)

�M

ost

pe

op

le p

refe

r H

ea

d &

Sh

ou

lde

r F

RO

NTA

Lv

iew

�3

5%

of

vote

s w

en

t to

Le

ft a

nd

Rig

ht

¾ H

ea

d &

Sh

ou

lde

r!P

rop

ose

d S

olu

tio

n(1

5 s

pe

ake

rs x

3 o

rde

rin

g x

35

un

iqu

e w

ork

ers

)�

35

% o

f vo

tes

we

nt

to L

eft

an

d R

igh

t ¾

He

ad

& S

ho

uld

er!

Co

nfi

rms

resu

lts

of

psy

cho

log

ica

l st

ud

ies

on

in

fere

nce

of

Pro

po

sed

So

luti

on

�LB

P H

isto

gra

m +

Co

lor

His

tog

ram

Co

nfi

rms

resu

lts

of

psy

cho

log

ica

l st

ud

ies

on

in

fere

nce

of

he

ad

3D

in

form

ati

on

fro

m 3

/4 v

iew

of

face

[B

urk

e V

R0

7]

Ind

ex

pre

sen

tati

on

vid

eo

s b

ase

d o

n f

ou

r m

ajo

r cu

es:

�LB

P H

isto

gra

m +

Co

lor

His

tog

ram

�O

nli

ne

Clu

ste

rin

g (

vis

ua

l + t

em

po

ral)

wit

h a

vg.

Lin

kag

e

Ind

ex

pre

sen

tati

on

vid

eo

s b

ase

d o

n f

ou

r m

ajo

r cu

es:

�Te

xt (

+a

ud

io t

ran

scri

pts

)�

Gra

ph

ics

�O

nli

ne

Clu

ste

rin

g (

vis

ua

l + t

em

po

ral)

wit

h a

vg.

Lin

kag

e

�N

orm

. C

ross

Co

rre

lati

on

fo

r Te

mp

late

Ma

tch

ing

Text

(+

au

dio

tra

nsc

rip

ts)

�S

pe

ake

r fa

ces

�G

rap

hic

s

�M

osa

ics

5.0

),

(

11

),

(1

2

C k

ji

cx

CC

xS

j

+=

∑ =χ

ixregion

It h

as

be

tte

r il

lum

ina

tio

n

It h

as

be

tte

r re

solu

tio

n

I ca

n s

ee

/te

ll m

ore

ab

ou

t th

e w

ho

le a

pp

ea

ran

ce o

f th

e p

ers

on

I ca

n s

ee

be

tte

r th

e e

ye

s a

nd

exp

ress

ion

of

the

pe

rso

n

�M

osa

ics

()

() )

(4.0

)(

4.0

),

(

5.0

),

(1

2

jj

ji

kjk

ij

CT

tC

SC

xT

cx

C

−+

−+

=

+∑ =

βα

χj

Ccluster

I ca

n s

ee

be

tte

r th

e e

ye

s a

nd

exp

ress

ion

of

the

pe

rso

n

I p

refe

r th

is p

ose

of

a p

ers

on

in

ge

ne

ral

I p

ick

ed

th

e b

est

ou

t o

f a

bu

nch

of

ba

d p

ictu

res

No

ne

of

the

ab

ov

e(p

lea

se e

xpla

in y

ou

r re

aso

n w

ith

a f

ew

wo

rds

in t

he

bo

x b

elo

w)

BA

CK

-EN

D

()

()

jj

ji

),

(

),

(j

ij

iC

xT

Cx

S<>

BA

CK

-EN

D)

,(

),

(j

ij

iC

xT

Cx

S<>

2.

Au

tom

ati

c G

en

era

tio

n o

f S

pe

ak

ers

Fa

ce I

nd

exe

sU

ser

Pre

ferr

ed

Te

xtu

al

Ind

ex

Gra

ph

ics

Ind

ex

4.

Tex

tua

l In

de

x G

en

era

tio

n2

. A

uto

ma

tic

Ge

ne

rati

on

of

Sp

ea

ke

rs F

ace

In

de

xes

Use

r P

refe

rre

d

Face

In

de

xes

Text

ua

l In

de

x

Ge

ne

rati

on

Gra

ph

ics

Ind

ex

Ge

ne

rati

on

4.

Tex

tua

l In

de

x G

en

era

tio

n1

34

Ed

ge

s C

on

ne

cte

d

Ge

om

etr

ic +

Ed

ge

Lo

cal A

da

pti

ve O

tsu

Fa

ce I

nd

exe

sG

en

era

tio

nG

en

era

tio

n

Se

lect

ion

ba

sed

on

3 q

ua

lity

me

asu

res

�V

iola

Jo

ne

s d

ete

cto

rFa

ce

LoG

ed

ge

sE

dg

es

Co

nn

ect

ed

Co

mp

on

en

ts

Ge

om

etr

ic +

Ed

ge

De

nsi

ty C

on

stra

ints

Loca

l Ad

ap

tive

Ots

u

(LA

O)

Bin

ari

zati

on

Tess

era

ctO

CR

Sp

ea

ker

Face

S

em

an

tic

Sh

ot

25

1.

Re

solu

tio

n

Se

lect

ion

ba

sed

on

3 q

ua

lity

me

asu

res

�V

iola

Jo

ne

s d

ete

cto

r

�C

olo

r sk

in f

ilte

rFa

ce

De

tect

ion

Co

mp

on

en

tsD

en

sity

Co

nst

rain

ts(L

AO

) B

ina

riza

tio

nTe

sse

ract

OC

R

Co

mp

lete

d T

ask

sS

pe

ake

r Fa

ce

Ind

ex

Ge

ne

rati

on

Se

ma

nti

c S

ho

t

Re

pre

sen

tati

on

25

hw

×1

. R

eso

luti

on

Siz

e o

f th

e f

ace

re

gio

n

�C

olo

r sk

in f

ilte

rD

ete

ctio

nR

csca

rch

Inte

rvie

w w

ith

Cli

en

t

{ i

site

d P

roje

ct S

pa

ce

Co

mp

lete

d T

ask

s

Ind

ex

Ge

ne

rati

on

Re

pre

sen

tati

on

hw

×S

ize

of

the

fa

ce r

eg

ion

Face

Se

ed

s

Qia

nt

Ho

use

Re

sid

en

t A

sso

cia

tio

n M

ee

tin

g

�1

ho

ur

an

d 4

5 m

inu

tes

of

vid

eo

, 8

stu

de

nt

pre

sen

tati

on

s

Face

Se

ed

s

Co

mp

lete

d T

ask

s

�1

ho

ur

an

d 4

5 m

inu

tes

of

vid

eo

, 8

stu

de

nt

pre

sen

tati

on

s

�1

3 s

lid

es

pe

r p

rese

nta

tio

n (

ave

rag

e)

O tf

�M

ILTr

ack

(pre

dic

tio

n):

Face

Re

sea

rch

Inte

rvie

w

Cli

en

tA

fte

r vo

cab

ula

ry

corr

ect

ion

LAO + Tesseract

�1

3 s

lid

es

pe

r p

rese

nta

tio

n (

ave

rag

e)

VA

ST

MM

Bro

wse

r [1

]6

tf

P tf

�M

ILTr

ack

(pre

dic

tio

n):

�V

iola

Jo

ne

s d

ete

cto

r(o

bse

rva

tio

n):

Face

In

terv

iew

C

lie

nt

Pro

ject

Sp

ace

Ho

use

Re

sid

en

t A

sso

cia

tio

n M

ee

tin

g

corr

ect

ion

Tesseract

8000

Recognized

LAO + Tesseract

VA

ST

MM

Bro

wse

r [1

]6

tf

2.

Po

se

�V

iola

Jo

ne

s d

ete

cto

r(o

bse

rva

tio

n):

�S

imp

lifi

ed

Ka

lma

nfi

lte

r:Tra

ckin

gH

ou

se R

esi

de

nt

Ass

oci

ati

on

Me

eti

ng

Tesseract

6000

Recognized Characters

FR

ON

T-E

ND

O t

P tt

ff

f)

1(α

α−

+←

�Le

ft a

nd

rig

ht

¾ p

ose

cla

ssif

iers

Ed

ge

his

tog

ram

de

scri

pto

r

2.

Po

se�

Sim

pli

fie

d K

alm

an

filt

er:

4000

6000

Number Recognized Characters

FR

ON

T-E

ND

tt

tf

ff

)1(

αα

−+

15

38

3

15

38

5

15

38

7

15

37

1

15

40

9

15

35

5

�E

dg

e h

isto

gra

m d

esc

rip

tor

�S

VM

R

BF

ke

rne

lFa

ce T

rack

s

2000

4000

Number Characters

6.

Fin

al

Bro

wse

r in

terf

ace

15

38

3

15

38

5

15

38

7

15

37

1

15

40

9

15

35

5

�S

VM

R

BF

ke

rne

l

�Fa

ceTr

ace

rd

ata

set

Face

Tra

cks

2000

Number

6.

Fin

al

Bro

wse

r in

terf

ace

Tra

inin

g S

et

(le

ft ¾

, fr

on

t,

rig

ht

¾)

~1

0K

im

ag

es

Home

Search

Explore Collections

Visual Search

Login

or sign up

0

Recognition Method

Tra

inin

g S

et

(le

ft ¾

, fr

on

t,

rig

ht

¾)

~1

0K

im

ag

es

Test

Se

t ~

12

K i

ma

ge

s

Av

era

ge

Te

st A

ccu

racy

81

.5%

�S

ele

ctio

n o

f fa

ces

to m

atc

h

Se

arc

h

Home

Search

Explore Collections

Visual Search

Login

or sign up

Nu

mb

er

N

um

be

r P

reci

sio

nR

eca

ll

Recognition Method

Av

era

ge

Te

st A

ccu

racy

81

.5%

�S

ele

ctio

n o

f fa

ces

to m

atc

h

�LB

P d

esc

rip

tor

+ S

q.

L2

dis

tan

ceTra

cks

Se

arc

h

People Index

Graphics Index

Search Tips

Phone + P05 + G08

Tag

Nu

mb

er

GT

Wo

rds

Nu

mb

er

Re

c. W

ord

sP

reci

sio

nR

eca

ll3

Sk

in R

ati

oM

atc

hin

gPhone + P05 + G08

Tag

People Index

Graphics Index

22

76

11

26

0.4

95

0.6

65

Un

iqu

e S

pe

ake

rs

area

skinPixels

skinRatio

#=

>

185

.1

RU

niq

ue

Sp

ea

kers

Face

Tra

cks

5.

Se

ma

nti

c S

ho

t R

ep

rese

nta

tio

nE

nh

an

ced

Fe

atu

re B

ase

d M

osa

ic

area

skinRatio

=

>

>

⇔=

107

.0

185

.1

skin

Pixel

RBGR

Face

Tra

cks

5.

Se

ma

nti

c S

ho

t R

ep

rese

nta

tio

nE

nh

an

ced

Fe

atu

re B

ase

d M

osa

ic

>>+

+⇔

=

112

.0

107

.0

)(

skin

Pixel

2

RG

BG

R

RB

Face

In

de

x

Resolution 10 secs

>

++

112

.0

)(

2B

GR

RG

�S

ele

ct“b

est

fa

ces”

to

pre

sen

t t

o e

nd

use

r

Face

In

de

x

Ge

ne

rati

on

�P

TZ

Est

ima

tio

n

�S

IFT

+ R

AN

SA

C o

n

min max

Resolution 10 secs

Click on an icon to find the graphic in the video

pre

sen

t t

o e

nd

use

rG

en

era

tio

n�

SIF

T +

R

AN

SA

C o

n

key

fra

me

sskinRatio

wresolution

wpose

wQ

⋅+

⋅+

⋅=

32

1

35

0O

verl

ay R

eco

gn

ize

d T

ext

min max

Tagline

Click on an icon to find the graphic in the video

skinRatio

wresolution

wpose

wQ

⋅+

⋅+

⋅=

32

1

Av

era

ge

Tra

ck M

atc

hin

g T

ime

(se

cs)

33

5

30

0

35

0O

verl

ay R

eco

gn

ize

d T

ext

Tagline Frames

Test

on

3

33

5T

rack

Ma

tch

ing

Fa

ce S

ele

ctio

n

Left

/rig

ht3

4 E

xtra

ctio

n

20

0

25

0P

rob

lem

Sta

tem

en

t

•E

colo

gic

al

Imp

act

Te

xt1

Te

xt3

Te

xt4

Te

xt5

Te

xt7

Te

xt8

Te

xt1

0

Frames

51

ou

t o

f 5

8 w

ith

He

ad

& s

ho

uld

er,

¾ p

rofi

le v

iew

Test

on

3

stu

de

nt

Left

/rig

ht3

4 E

xtra

ctio

n

Sk

in-R

es

Ext

ract

ion

K-M

ea

ns

Co

mp

uta

tio

n

15

0

20

0•

Wa

ste

go

es

to L

an

dfi

lls

•E

ne

rgy

So

urc

e

•C

ost

Eff

icie

ncy

•W

ast

e D

isp

osa

l B

ill

Te

xt1

Te

xt3

Te

xt4

Te

xt5

Te

xt7

Te

xt8

Te

xt1

0

Text

Problem

Phone

51

ou

t o

f 5

8 w

ith

He

ad

& s

ho

uld

er,

¾ p

rofi

le v

iew

stu

de

nt

pre

sen

ta-

K-M

ea

ns

Co

mp

uta

tio

n

50

10

0

15

0•

Ele

ctri

cal

Bil

l

•M

s W

ilso

n i

s lo

ok

ing

fo

r a

n e

co-f

rie

nd

ly,

cost

eff

icie

nt,

an

d e

asy

to

use

pro

du

ct t

ha

t

wil

l co

nve

rt h

er

soli

d w

ast

e i

nto

usa

ble

en

erg

y

�S

eg

me

nt

vid

eo

in

to s

em

an

tica

lly

dis

tin

ct s

ho

ts b

ase

d

on

sli

de

s

People

pre

sen

ta-

tio

nv

ide

os,

0

50

en

erg

y

En

ha

nce

Gra

ph

ics

on

sli

de

s

�C

ha

ng

es

in t

ext

use

d t

o a

sse

ss s

lid

e c

ha

ng

es

Graphics

45

min

ute

s

ea

ch

20

1

9

K-M

ea

ns(

10

0)

sele

ct (

10

0)

min

-min

0

12

3E

nh

an

ce G

rap

hic

s�

Ch

an

ge

s in

te

xt u

sed

to

ass

ess

sli

de

ch

an

ge

s

Graphics

ea

chK

-Me

an

s(1

00

)se

lect

(1

00

)m

in-m

in