Top Banner
Semantics at the multimedia fragment level or how enabling the remixing of online media Raphaël Troncy <[email protected] >
63

Semantics at the multimedia fragment level SSSW 2013

May 10, 2015

Download

Technology

Raphael Troncy

"Semantics at the multimedia fragment level or how enabling the remixing of online media" - Invited Talk given at the Semantic Web Summer School (SSSW), 12 July 2013
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Semantics at the multimedia fragment level SSSW 2013

Semantics at the multimedia fragment level or how enabling

the remixing of online media Raphaël Troncy <[email protected]>

Page 2: Semantics at the multimedia fragment level SSSW 2013

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 2

Page 3: Semantics at the multimedia fragment level SSSW 2013

Once upon a time …

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 3

Page 4: Semantics at the multimedia fragment level SSSW 2013

… leading to sharing Media Fragments

Publishing status message containing a Media Fragment URI Use a ‘#’ ! Highlight a

video sequence

Highlight a region to pay attention to

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 4

Page 5: Semantics at the multimedia fragment level SSSW 2013

What are Media Fragments?

t 0 20 35 temporal media fragment

spatial media fragment

track media fragment

11/07/2013 - - 5 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 6: Semantics at the multimedia fragment level SSSW 2013

Media Fragments (temporal)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 6

Fragment beginning Fragment end Playback progress

Original resource length

Page 7: Semantics at the multimedia fragment level SSSW 2013

Media Fragments (spatial) + Demo

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 7

semi-opaque overlay

highlighted fragment

Page 8: Semantics at the multimedia fragment level SSSW 2013

Media Fragments URIs

Bookmark / Share parts (fragments) of audio/video content

Annotate media fragments

Search for media fragments

Mash-ups

Conserve bandwidth

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 8

http://www.w3.org/TR/media-frags-reqs/ http://www.w3.org/TR/media-frags/

Page 9: Semantics at the multimedia fragment level SSSW 2013

Video annotation

11/07/2013 - - 9 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 10: Semantics at the multimedia fragment level SSSW 2013

Video interactivity

Cubism Expressionism

Fauvism

FACETS / PROPERTIES OF CONCEPT

CONCEPT IN PLAYER

CONTENT ENRICHMENT

11/07/2013 - - 10 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 11: Semantics at the multimedia fragment level SSSW 2013

Video Accessibility

What is required to make video accessible on the Web?

Technologies: Annotating: automatic (speech transcription) and manual (social

collaborative annotation tool) Addressing: pointing to, retrieving, transmitting only parts of media Rendering: video visualization for the impaired, Braille output

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 11

Benchmarking: Sphinx, HTK, Julius

Page 12: Semantics at the multimedia fragment level SSSW 2013

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 12

Page 13: Semantics at the multimedia fragment level SSSW 2013

Semantic indexing at the fragment level

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 13

Benchmarking: Sphinx, HTK, Julius

NER on subtitle blocks

Interlinking with the Linked Data Cloud to enable semantic search

Page 14: Semantics at the multimedia fragment level SSSW 2013

What is a Named Entity recognition task?

A task that aims to locate and classify the name of a person or an organization, a location, a brand, a product, a numeric expression including time, date, money and percent in a textual document

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 14

Page 15: Semantics at the multimedia fragment level SSSW 2013

NER Tools and Web APIs

Standalone software GATE Stanford CoreNLP Temis

Web APIs

http://nerd.eurecom.fr/

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 15

Page 16: Semantics at the multimedia fragment level SSSW 2013

Compare performances of NER and NEL tools Understand strengths and weaknesses of different Web APIs Adapt NER processing to different context

(Learn how to) Combine NER (/ NEL) tools

Participate in various benchmarks

NERD: Named Entity Recognition and Disambiguation

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 16

Page 17: Semantics at the multimedia fragment level SSSW 2013

What is NERD? REST API2 ontology1

UI3

1 http://nerd.eurecom.fr/ontology 2 http://nerd.eurecom.fr/api/application.wadl

3 http://nerd.eurecom.fr

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 17

Page 18: Semantics at the multimedia fragment level SSSW 2013

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 18/15

Alchemy API

DBpedia Spotlight

Evri Extractiv Lupedia Open Calais

Saplo Wikimeta Yahoo! Zemanta

Language EN,FR, GR,IT, PT,RU, SP,SW

EN GR* PT* SP*

EN,IT

EN EN,FR, IT

EN,FR SP

EN, SW

EN,FR SP

EN EN

Granularity OEN OEN OED OEN OEN OEN OED OEN OEN OED

Entity position

N/A char offset

N/A word offset

range of chars

char offset

N/A POS offset

range of

chars

N/A

Classification schema

Alchemy DBpedia FreeBase Scema.or

g

Evri DBpedia DBpedia LinkedM

DB

Open Calais

N/A ESTER

Yahoo FreeBase

Number of classes

324 320 5 34 319 95 5 7 13 81

Response Format

JSON MicroF XML RDF

HTML JSON RDF XML

HTML

JSON

RDF

HTML JSON RDF XML

HTML JSON RDFa XML

JSON MicroFormat

JSON JSON XML

JSON XML

XML JSON RDF

Quota (calls/day)

30000 unl 3000

3000 unl 50000 1333 unl 5000 10000

Factual comparison of 10 Web NER tools

Page 19: Semantics at the multimedia fragment level SSSW 2013

Aligned the taxonomies used by the extractors

NERD Ontology

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 19

Page 20: Semantics at the multimedia fragment level SSSW 2013

NERD type Occurrence

Person 10

Organization 10

Country 6

Company 6

Location 6

Continent 5

City 5

RadioStation 5

Album 5

Product 5

... ...

Building the NERD Ontology

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 20

Page 21: Semantics at the multimedia fragment level SSSW 2013

NERD REST API

GET, POST, PUT,

DELETE

/document /user /annotation/{extractor} /extraction /evaluation ...

JSON

“entities” : [{ “entity”: “Tim Berners-Lee” , “type”: “Person” , “uri”: "http://dbpedia.org/resource/Tim_berners_lee", “nerdType”: "http://nerd.eurecom.fr/ontology#Person", “startChar”: 30, “endChar”: 45, “confidence”: 1, “relevance”: 0.5 }]

Rizzo G., Troncy R. (2012), NERD: A Framework for Unifying Named Entity Recognition and Disambiguation Web Extraction Tools. In: European chapter of the Association for Computational Linguistics (EACL'12), Avignon, France.

RDF

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 21

Page 22: Semantics at the multimedia fragment level SSSW 2013

NERD meets NIF

Model documents through a set of strings deferencable on the Web

: offset_23107_ 23110 a str:String ; str:referenceContext :offset_0_26546 .

: offset_23107_ 23110 sso:oen dbpedia:W3C.

dbpedia:W3C rdf:type nerd:Organization .

Map string to entity

Classification

Rizzo G, Troncy R., Hellmann S. and Bruemmer M. (2012), NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud. In: (LDOW'12) Linked Data on the Web (WWW'12), Lyon, France.

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 22

Page 23: Semantics at the multimedia fragment level SSSW 2013

NERD User Dashboard

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 23

Page 24: Semantics at the multimedia fragment level SSSW 2013

NERD User Interface

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 24

Page 25: Semantics at the multimedia fragment level SSSW 2013

History of NER benchmarks CoNLL 2003 and CoNLL 2005

schema (4 types): person, organization, location and miscellaneous

ACE 2004, ACE 2005 and ACE 2007 schema (7 types): person, organization, location, facility, weapon,

vehicle and geo-political entity entity recognition, co-ref, find relationships among entities extracted

TAC 2009 (Knowledge Base Track) schema (3 types): person, organization and location create a knowledge base from the named entities extracted

ETAPE 2012 (Named Entity Task) schema: Quaero (7 main types, 32 sub-types)

MSM 2013: tweet corpus ! schema (4 types): person, organization, location, miscellaneous

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 25

Page 26: Semantics at the multimedia fragment level SSSW 2013

ETAPE 2012 challenge

genre train dev test sources

TV news 7h 40m 1h 40m 1h 40m BFM Story, Top QUestions (LCP)

TV debates 10h 30m 5h 10m 5h 10m Pile et Face, Ca vous regarde, Entre les lignes (LCP)

TV amusements - 1h 05m 1h 05m La place du village (TV8)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 26

Train Dev Eval Item length 26h 10h 55m 10h 55m Nb files 44 15 15 Nb words 290517 91656 115511 Nb Named Entities 46763 14398 13055 Nb unique categories 33 33 33

Page 27: Semantics at the multimedia fragment level SSSW 2013

NERD @ ETAPE (naïve combined strategy)

(eA1,tA1,URIA1,siA1,eiA1) ... ... ...

`

(eA2,tA2,URIA2,siA2,eiA2) (eA3,tA3,URIA3,siA3,eiA3)

(eN2,tN2,URIN2,siN2,eiN2) (eN1,tN1,URIN1,siN1,eiN1)

extraction

cleaning

fusion When at least 2 extractors classify the same entity with a different type then we apply a preferred selection order

(empirically defined): Wikimeta, AlchemyAPI, OpenCalais, Lupedia

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 27

Page 28: Semantics at the multimedia fragment level SSSW 2013

Participation at ETAPE (combined+ strategy)

(eA1,tA1,URIA1,siA1,eA1)

`

(eA2,tA2,URIA2,siA2,eiA2)

(eN2,tN2,URIN2,sN2,eN2) (eN1,tN1,URIN1,sN1,eN1)

...

ETAPE Train & Dev

Learned model

Created static rules

fusion Conflicts handled by

priority selection: own, Wikimeta,AlchemyAPI,OpenCalais,Lupedia

POS tagger

Apply rules

(e1,t1,URI1,si1,ei1)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 28

Page 29: Semantics at the multimedia fragment level SSSW 2013

NERD Global results

SLR Precision Recall F-measure %correct

combined 86.85% 35.31% 17.69% 23.44% 17.69%

combined+ 188.81% 15.13% 28.40% 19.45% 28.40%

Combined+ : Eval corpus differs substantially from the Train & Dev corpora. The static rules do not fit well the Eval corpora and they introduce classification noise.

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 29

Page 30: Semantics at the multimedia fragment level SSSW 2013

Per-extractor results SLR Precision Recall F-measure %correct

alchemyapi 37.71% 47.95% 5.45% 9.68% 5.45%

lupedia 39.49% 22.87% 1.56% 2.91% 1.56%

opencalais 37.47% 41.69% 3.53% 6.49% 3.53%

wikimeta 36.67% 19.40% 4.25% 6.95% 4.25%

combined (nerd)

86.85% 35.31% 17.69% 23.44% 17.69%

combined+ (nerd+)

188.81% 15.13% 28.40% 19.45% 28.40%

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 30

Page 31: Semantics at the multimedia fragment level SSSW 2013

- 31 11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 32: Semantics at the multimedia fragment level SSSW 2013

Learning How to Combine NER Extractors

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 32

Page 33: Semantics at the multimedia fragment level SSSW 2013

NERD on CoNLL 2003 (NER task)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 33

Page 34: Semantics at the multimedia fragment level SSSW 2013

NERD on MSM 2013 (NER task)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 34

Page 35: Semantics at the multimedia fragment level SSSW 2013

NERD on MSM 2013 (NEL task)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 35

Page 36: Semantics at the multimedia fragment level SSSW 2013

Media Fragment Enricher: http://mfe.synote.org/mfe/

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 36

Page 37: Semantics at the multimedia fragment level SSSW 2013

Linking pieces of knowledge

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 37

Page 38: Semantics at the multimedia fragment level SSSW 2013

Linking pieces of knowledge

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 38

Page 39: Semantics at the multimedia fragment level SSSW 2013

Named Entities for Video Classification

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 39

Page 40: Semantics at the multimedia fragment level SSSW 2013

Workflow

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 40

Media Fragment Enricher Services

Media Fragment Enricher UI

Metadata & timed-text

NERD Client RDFizator Triple Store

Categori-zation

Video and metadata preview

Video replay with subtitles and aligned NEs

1: Video URL

2: Metadata

3: meta-data 4:NERDify

5:Timed Text 6: NEs with time

alignment (json)

7: RDFize (ttl)

8: Generate Category

9: SPARQL query

Page 41: Semantics at the multimedia fragment level SSSW 2013

Channel signature based on NE distribution

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 41

Page 42: Semantics at the multimedia fragment level SSSW 2013

Media Collector

Composition of media item extractors (12 SNs) Rely on search APIs + a fix 30s timeout window to provide results Fallback on screen scraping when necessary (Twitter ecosystem)

Implemented as a NodeJS server

Serialize results in a common schema (JSON)

Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 11/07/2013 - - 42

Page 43: Semantics at the multimedia fragment level SSSW 2013

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 43

Deep link Permalink

Clean text for NLP processing

Aggregate view of ALL social interactions

12 Social Networks

Page 44: Semantics at the multimedia fragment level SSSW 2013

Media Finder (www2013)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 44

Page 45: Semantics at the multimedia fragment level SSSW 2013

Media Finder (zooming on media items)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 45

Page 46: Semantics at the multimedia fragment level SSSW 2013

Media Finder (timeline view)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 46

Page 47: Semantics at the multimedia fragment level SSSW 2013

Media Finder Architecture

Media items harvesting using the Media Server http://eventmedia.eurecom.fr/media-

server/search/{combined}/{term} https://github.com/vuknje/media-server (@tomayac fork)

Image near de-duplication DCT signature on image and video frame,

Hamming distance between image pairs

Clustering and disambiguation Named Entity Extraction using NERD Topic Generation using LDA

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 47

Page 48: Semantics at the multimedia fragment level SSSW 2013

Media Finder (named entities clustering)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 48

Page 49: Semantics at the multimedia fragment level SSSW 2013

Media Finder (zooming in a cluster)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 49

Page 50: Semantics at the multimedia fragment level SSSW 2013

Media Finder: http://mediafinder.eurecom.fr/

Live Topic Generation from Event Streams WWW 2013 Demo Session http://www.youtube.com/watch?v=8iRiwz7cDYY

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 50

Page 51: Semantics at the multimedia fragment level SSSW 2013

Tracking an event: Italian Election

Repeated queries over a period of time We have tracked and analyzed media posts tagged as

elezioni2013 from 2013-02-26 to 2013-03-03 Cron job: every 30 minutes over the 6 days Slice the data in 24 hours slots

Research questions: Can we re-create the news headlines?

Storyboarding: http://mediafinder.eurecom.fr/story/elezioni2013

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 51

Page 52: Semantics at the multimedia fragment level SSSW 2013

Tracking an event: Italian Election

Dataset: ~16501 microposts containing (duplicate) media items ~21087 Named Entities extracted

Clustering NER and LDA Generate Bag of Entities (BOE) disambiguated with a

DBpedia URI

Examples: Monti, Bersani, Italia, Berlusconi, Grillo, Stelle

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 52

Page 53: Semantics at the multimedia fragment level SSSW 2013

Tracking an event: Italian Election

Tracking and Analyzing The 2013 Italian Election ESWC 2013 Demo Session http://www.youtube.com/watch?v=jIMdnwMoWnk

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 53

Page 54: Semantics at the multimedia fragment level SSSW 2013

Multimedia and Semantic Web

Different Ecosystems: Local identifiers Specific metadata formats

Huge amount of Multimedia Content

Low number of links between content

11/07/2013 - - 54 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 55: Semantics at the multimedia fragment level SSSW 2013

Multimedia and Semantic Web

Universal Identifiers: URI’s

Common description formats

Easy interlinking between content

11/07/2013 - - 55 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 56: Semantics at the multimedia fragment level SSSW 2013

Media Fragments

nerd:Location Cafe Rick

Nerd:Person H. Bogart

Nerd:Person I. Bergman

nerd:Location Casablanca

Media Fragment URI 1.0 Chapters Scenes Shots etc…

http://data.linkedtv.eu/media/e2899e7f#t=14,15

LinkedTV Ontology

11/07/2013 - - 56 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 57: Semantics at the multimedia fragment level SSSW 2013

Hypervideo

nerd:Location Cafe Rick

Nerd:Person H. Bogart

Nerd:Person I. Bergman

nerd:Location Casablanca

Nerd:Person E. Tierney

nerd:Location China

11/07/2013 - - 57 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 58: Semantics at the multimedia fragment level SSSW 2013

Web + TV experience

http://www.youtube.com/watch?v=4mSC685AG7k

11/07/2013 - - 58 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013

Page 59: Semantics at the multimedia fragment level SSSW 2013

Research Vision (context)

Knowledge Graphs everywhere Google Knowledge Graph, Microsoft Entity Graph,

Yahoo! Web of Things, Wikidata Open Data, Structured Data, Linked Data

The rise of social media Events happen all the time and are the topic of social network

conversations, also in form of event-related multimedia data Videos and photos are (re-)shared on multiple social networks Events can be

planned or unplanned

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 59

(Read the background story http://www.washingtonpost.com/about-those-2005-and-2013-photos-of-the-crowds-in-st-peters-square)

Page 60: Semantics at the multimedia fragment level SSSW 2013

Research Vision (opportunity)

Video is a first class citizen on the Web Annotations: Ontology and API for Media Resources Access: Media Fragments URI NERD platform for extracting key information from

learning resources including videos

The Linked Media vision Extracting semantic knowledge from social media Collect, enrich and visualize media memes shared by

the crowd Generate visual stories about what is happening in the

world (summarization)

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 60

Page 61: Semantics at the multimedia fragment level SSSW 2013

Winter School: http://winterschool.mediamixer.eu/

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 61

Page 62: Semantics at the multimedia fragment level SSSW 2013

Credits

Giuseppe Rizzo, Vuk Milicic, José Luis Redondo Garcia (EURECOM)

Thomas Steiner (Google Inc.)

Marieke van Erp (Free University of Amsterdam)

Yunjia Li (University of Southampton)

… and many other students

11/07/2013 - Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013 - 62

Page 63: Semantics at the multimedia fragment level SSSW 2013

http://www.slideshare.net/troncy

11/07/2013 - - 63 Semantics at the multimedia fragment level - SSSW, Cercedilla, July 2013