Top Banner
Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie
44

Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Dec 18, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Contextual Wisdom

Social Relations and Correlations for Multimedia Event Annotation

Amit Zunjarwad, Hari Sundaram and Lexing Xie

Page 2: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

I don’t want to spend time annotating :( help!

April 18, 2023@NUS 2

Page 3: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Talk Outline

Observations

Events

Generalization:Sum of Partial Observations

Similarity, Co-Occurrence and Trust

@I2R April 18, 2023 3

Experiments:compare against SVM

Conclusions

Page 4: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

An Annotation Puzzle

@NUS April 18, 2023 4

Page 5: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023 5@NUS

Observing Flickr Data

Page 6: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The pool statistics reveal a power law distribution• Less than 11% of the tags have more than 10 photos• There are not enough instances to learn most of the

concepts! The global flickr pool is interesting:

April 18, 2023 6@NUS

Learnability

Page 7: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023 7@NUS

Learnability

Page 8: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The pool statistics reveal a power law distribution• Less than 11% of the photos have more than 10

instances• There are not enough instances to learn most of the

concepts! The global flickr pool is interesting: • Most of the tags have over 100 instances• Photos reveal very high visual diversity

The Power law is a fundamental property of online networks – cannot be wished away.

April 18, 2023 8@NUS

Learnability

Page 9: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Singapore People Walking Orchard rd. After MRT Experimenting Walking Day Outdoor..

April 18, 2023 9@NUS

Scalability

Page 10: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The assumption of consensual semantics Search for “yamagata”

April 18, 2023 10@NUS

The Role of context

Page 11: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023@NUS 11

What if the answer didn’t completely lie in the pixels?

Page 12: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

EventsWhat are they?

April 18, 2023@NUS 12

Page 13: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

An event refers to a real-world occurrence, spread over space and time.

Observations form event meta data [Westermann / Jain 2007]• Images / text / sounds describe events

April 18, 2023 13@NUS

Defining Events

when

where

who

what

author

image

Page 14: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Event context refers to the set of attributes that help in understanding the semantics • Images / Who / Where / When / What / Why / How

Context is always application dependent • Ubiquitous computing community – location, identity

and time are main considerations

April 18, 2023 14@NUS

Context

[Mani and Sundaram 2007]

Page 15: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Event archival – events involve people, places and artifacts

Exploit different forms of knowledge: • (Global) Similarity – media, events, people. • (Personal) Co-occurrence – what are the joint statistics

of occurrence?• (Social) Trust – determining whom to trust for effective

annotation?

April 18, 2023 15@NUS

Four Problems

Page 16: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

SimilarityGlobal, Systemic knowledge

April 18, 2023@NUS 16

Page 17: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

A bottom up approach • Edge, color and texture histograms for images. • Rely on ConceptNet for text tags

Why ConceptNet and not WordNet?• Expands on pure lexical terms, to compound terms –

“buy food”• Expands on number of relations – from three to twenty• Contains practical knowledge – we can infer that a

student is near a library.

April 18, 2023 17@NUS

Event similarity

Page 18: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

ConceptNet provides three functions:• GetContext(node): the neighborhood of the concept “book” includes “knowledge”, “library”

• GetAnalogousConcepts(node): concepts that share incoming relations; analogous concepts for the concept “people” are “human”, “person”, “man”

• FindPathsBetweenNodes(node1,node2) – returns a set of paths.

Our similarity measure is built using these functions.

April 18, 2023 18@NUS

A base similarity measure

Page 19: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The similarity between two concepts (e,f) is defined as follows:

We current use a uniform weighting on all three as the composite measure

April 18, 2023 19@NUS

Concept similarity

fe

fCeC

eA fA

context

analogous

path based1

1 1( , )

N

pi i

s e fN h

Page 20: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The distance between two concept sets is a modified Haussdorf similarity.

April 18, 2023 20@NUS

Computing similarity between sets

A

B

| |

1

1( , | ) max{ ( , )}

| |

A

H k ii

k

S A B m m a bA

Page 21: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Similarity between facets are computed using a weighted sum of frequency and the concept similarity measure:

Time distance is based on text tags, not actual time data – allows for temporal descriptions as “summer”, “holidays” etc.

Only frequency is used for “who” facet.April 18, 2023 21@NUS

Facet similarity (4w)

1 21 2 1 2

2

1 | |( , ) , | ,

2 | | H

L Ls L L S L L CS

L

Page 22: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Color, texture and edges are computed• 166 bin HSV color histogram• 71 bin edge histogram• 3 texture features

Euclidean distance on the composite feature vector.

The distance between two events is then a weighted sum of distances across all event facets.

April 18, 2023 22@NUS

Image facet similarity

Page 23: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023 23@NUS

The global similarity matrix Ms

Page 24: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Co-occurrencePersonal, statistical knowledge

April 18, 2023@NUS 24

Page 25: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The concept co-occurrences are just frequency counts.

(i= fun , j = new york) then the index (i,j) contains the number of occurrences of this tuple.

Notes:• Each concept is given a globally unique index• Co-occurrence matrixes are locally compact

Each user k, has a co-occurrence matrix Mck

associated with the user.

April 18, 2023 25@NUS

Statistics are computed per person

Page 26: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

TrustPeople we like

April 18, 2023@NUS 26

Page 27: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Narrow understanding of “trust” a priori value is important Computing trust:• Compute event-event similarity

Trust propagation• Biased PageRank algorithm

• Trust vectors are row normalized

April 18, 2023 27@NUS

Activity based trust

activity matrix apriori

(1 ) k t = A t + p

Page 28: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The recommendation algorithm

April 18, 2023@NUS 28

Page 29: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The framework is event centric We know:

How to combine the three?

April 18, 2023 29@NUS

A review of what we know

similarity co-occurrence trust vectors

global personal social

Page 30: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

1. Compute the social network trust vector (t) for the current user.

2. Compute the trusted, global co-occurrence matrix, for all tuples.

3. Iterate:

April 18, 2023 30@NUS

details

1

( , ) ( ) ( , ),N

k kc c

i

a b t i a b

M M

who where whatwhen image event

query

,

,c

s

y M x q

x M y q

Page 31: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Experiments

April 18, 2023@NUS 31

Page 32: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Developed and event based archival system

8 graduate students 58 events, 250 images,

over two weeks SVM – baseline

comparison Two cases• Uniform trust (global)• Personal trust

April 18, 2023 32@NUS

Details

Page 33: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Training is difficult – very small pool.• Modified bagging strategy • Train five symmetric classifiers• Pick one which maximizes the F-score

April 18, 2023 33@NUS

SVM training

Page 34: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Global Case:• 31 classifiers (who:8, when: 6, where: 10, what: 7)• Minimum number of images: 10 • Tested on 50 images (why?)

April 18, 2023 34@NUS

Uniform trust

Facets SVM CM (uniform)

H M X U H M

Who 13 23 5 9 22 28

When 11 20 6 13 24 26

Where 12 19 3 16 23 27

What 13 21 8 8 31 19

Event 10 12 22 6 22 28

H Hits

M Misses

X Unknown

U Undecidable

Page 35: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Trained classifiers per person• Very small pool• Min images – 5• 28 classifiers (who:9, when: 4, where: 6, what: 9)

April 18, 2023 35@NUS

Personal Network

Facets SVM CM (network)

H M X U H M

Who 45 81 62 62 183 67

When 51 96 73 30 167 83

Where 62 76 59 53 179 71

What 72 89 23 66 204 46

Events 0 0 250 0 153 97

H Hits

M Misses

X Unknown

U Undecidable

Page 36: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023 36@NUS

Positive examplesSVM ‘sky diving’

Social Network based method ‘fun’

Page 37: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

The Sum of Partial ObservationsBeyond web 2.0 hype

April 18, 2023@NUS 37

Page 38: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Which media object summarizes “my trip to Singapore?”

April 18, 2023 38@NUS

Experiential fragments

Page 39: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

April 18, 2023 39@NUS

A reconsideration of a traditional idea

Page 40: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

@NUS

The Creation of participatory knowledge

April 18, 2023 40

Page 41: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Conclusions

April 18, 2023@NUS 41

Page 42: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

An event based annotation system• Media are event meta-data• Issues: learnability, scalability, context

Employ three kinds of knowledge• Global – conceptnet, image similarity• Personal – statistical co-occurrence • Social – trust

Recommendations• Employ iterative schemes (HITS / PageRank)

Results:• Outperform SVM in small pools

April 18, 2023 42@NUS

summary

Page 43: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Power law tag distribution• Data pool will remain small for most tags• Fundamental issue

Participatory knowledge is powerful – trust within context is important issue.

Future work: • Careful math analysis of coupling equations• Event structure / relationships need to be incorporated • Multi-source (email / Calendar / IM / blogs)

integration.

April 18, 2023 43@NUS

Conclusions

Page 44: Contextual Wisdom Social Relations and Correlations for Multimedia Event Annotation Amit Zunjarwad, Hari Sundaram and Lexing Xie.

Thanks!Esp. Dick Bulterman, Mohan

April 18, 2023@NUS 44