Top Banner
Introduction MultiLing Pilot The Results Conclusion Overview of the MultiLing Pilot in TAC 2011 George Giannakopoulos 1 1 NCSR Demokritos, Greece [email protected] November 2011 George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011
44

Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

Jul 31, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Overview of the MultiLing Pilot in TAC 2011

George Giannakopoulos1

1NCSR Demokritos, [email protected]

November 2011

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 2: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Motivation

Outline

1 Introduction

2 MultiLing Pilot

3 The Results

4 Conclusion

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 3: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Motivation

Multilinguality

News

Blogs

Search results

Automatic translation

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 4: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Motivation

Brief history of DUC/TAC domains

Single document summarization

Multi-document summarization (Update, Guided, Opinion, ...)

Cross-lingual summarization

Something appears to be missing...

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 5: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Motivation

The missing piece: MultiLing

Create summaries regardless of underlying language on documentsets that use the same (possibly unknown) language.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 6: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Motivation

MultiLing aim

Detect multi-document summarization (MMS) research

Learn about MMS algorithms

Learn about multilingual reusable resources

Quantify performance

Check existing automatic measures

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 7: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Outline

1 Introduction

2 MultiLing Pilot

3 The Results

4 Conclusion

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 8: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Task definition

Generate a single, fluent, representative summary

from a set of documents describing an event sequence

language for document set within a given range

output summary should be (240-)250 words

An event Sequence

...is a set of atomic (self-sufficient) event descriptions, sequencedin time, that share main actors, location of occurence or someother important factor. Event sequences may refer to topics suchas a natural disaster, a crime investigation, a set of negotiationsfocused on a single political issue, a sports event.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 9: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Dataset

Human created

Multi-lingual

News

Freely available

Containing event sequences

Plain text

Solution

WikiNews (http://www.wikinews.org)

Translation

Preprocessing

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 10: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Mini-pilot for effort estimation

Small scale corpus (2 topics)

Everything was timed

Questions would be noted

Lesson

Always do a mini-pilot, note everything, do follow-up meetings.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 11: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Overview of full corpus creation

Determine topics (10 topics / language)

Translate documents (10 docs / topic)

Produce model summaries (3 models / topic)

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 12: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Determine topics

Use metadata (WikiNews categories)

Verify existence of event sequence

Cover several different news types (e.g., politics, environment,sports)

Find at least 10 documents per topic

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 13: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Translate documents

Sentence alignment

Keep original meaning

Produce readable, fluent text

Translation verified

Lesson

Difficult, error-prone, subjective, high cost process.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 14: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Summarizing

3 summarizers per topic and language

Keep human subjectivity related to important aspects

Use the minimum possible guidelines

Self-sufficient, clearly written text...providing no external information...fluent, easily readable language

Lesson

Few guidelines are better than a lot.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 15: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Types of evaluation

Automatic (ROUGE, AutoSummENG)

Manual (Overall Responsiveness)

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 16: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Automatic Methods

ROUGE (ROUGE-1, 2, SU-4), word n-gram matching, allowsgaps

AutoSummENG — Merged Model Graph (MeMoG), charactern-gram co-occurence, merged representation

Not (too) strongly correlated. Possibly describing slightly differentaspects.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 17: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Manual Evaluation Guidelines

Read source documents at least once

Give a grade between 1 and 5 (Overall Responsiveness: OR)

Content and fluency equally important

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 18: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

Task DetailsCorpus creationEvaluating summaries

Guidelines continued

We consider a text to be worth a 5, if it appears to coverall the important aspects of the corresponding documentset using fluent, readable language. A text should beassigned a 1, if it is either unreadable, nonsensical, orcontains only trivial information from the document set.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 19: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Outline

1 Introduction

2 MultiLing Pilot

3 The Results

4 Conclusion

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 20: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overview

Original aim: 3 groups per language

Achieved: 8+1 groups

Original aim: 5 languages

Achieved: 7 languages

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 21: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overview

Original aim: 3 groups per language

Achieved: 8+1 groups

Original aim: 5 languages

Achieved: 7 languages

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 22: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overview

Original aim: 3 groups per language

Achieved: 8+1 groups

Original aim: 5 languages

Achieved: 7 languages

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 23: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overview

Original aim: 3 groups per language

Achieved: 8+1 groups

Original aim: 5 languages

Achieved: 7 languages

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 24: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Baseline — Topline

global baseline system (ID9) , vector space, bag-of-words, highestcosine similarity to the centroid of documents.

global topline system (ID10) uses the model summaries, producesrandom summaries by combining sentences, find theone closest to the Merged Model Graph of themodels.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 25: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Our champions

Participant System ID Arabic Czech English French Greek Hebrew Hindi Notes

CIST ID1 X X X X X X X PeerCLASSY ID2 X X X X X X X PeerJRC ID3 X X X X X X X Coorg (Czech)LIF ID4 X X X X X X X Coorg (French)SIEL IIITH ID5 X X X Coorg (Hindi)TALN UPF ID6 X X X X PeerUBSummarizer ID7 X X X X X X X PeerUoEssex ID8 X X Coorg (Arabic)

Baseline ID9 Centroid baseline for all languages Coorg (All)Topline ID10 Using model summaries for all languages Coorg (All)

Lesson

The community will respond if you take the first step.

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 26: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Evaluation aims

Allow, but penalize, out-of-limit text sizes

Measure per language performance

Reward multi-lingual systems

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 27: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Length-Aware Grading (LAG)

Given a summary S of length |S | (in words) assigned a grade g , alower word limit count lmin and an upper word limit count lmax :

LAG (g , S) = g ∗(1− max(max(lmin−|S |,|S|−lmax ),0)

lmin

)Example

An excellent summary (graded with OR 5) with 120 words, wouldbe assigned a LAG-OR grade of 2.5 (less than mediocre).

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 28: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Combined Multi-lingual Performance (CMP)

gs(l) is the LAG grade of system s in a given language l from thefull set of languages L:

CMPs =

∑l∈L

gs(l)

|L|Non-participation implies a LAG value of 1.

Instability

System s participated in the set Ls of languages, Ls ⊂ L, and thest.dev. of its LAG grades in these languages is σs , then:

Instabilitys =σs√|Ls |

Higher instability indicates more uncertainty on future performance

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 29: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overview

System CMP InstabilityID1 (CIST) 2.99 0.19ID2 (CLASSY) 2.95 0.18ID3 (JRC) 3.13 0.18ID4 (LIF) 1.86 0.21ID5 (SIEL IIITH) 1.6 0.48ID6 (TALN UPF) 1.6 0.34ID7 (UBSummarizer) 2.41 0.19ID8 (UoEssex) 1.63 0.78ID9 (Baseline) 2.81 0.27ID10 (Topline) 2.71 0.22

Table: Combined Multi-lingual Performance and Instability per System

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 30: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Per Language Overview — Arabic

●●

● ●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

●●

● ●

A B C ID1 ID2 ID3 ID4 ID6 ID7 ID8 ID9

●● ●

● ●

●●

12

34

5SysID

LAG

●● ●

● ●

●●

ID1 ID10 ID2 ID3 ID4 ID6 ID7 ID8 ID9

Overall Responsiveness LAG (Systems only)

Lesson

Model summaries may be bad summaries. How does this influenceevaluation?

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 31: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overall Responsiveness — Czech, English

●●

● ●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

●●

● ●

A B C D ID1 ID10 ID2 ID3 ID4 ID7 ID9

●●

● ●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

●●

● ●

A B C ID1 ID2 ID3 ID4 ID5 ID6 ID7 ID8 ID9

Czech English

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 32: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overall Responsiveness — French, Greek

●●

● ●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

●●

● ●

A B C D E F ID1 ID2 ID4 ID6 ID9

● ●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

● ●

A B C ID1 ID10 ID2 ID3 ID4 ID7 ID9

French Greek

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 33: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Overall Responsiveness — Hebrew, Hindi

● ●

●●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

● ●

●●

A B C ID1 ID10 ID2 ID3 ID4 ID7 ID9

●●

●●

12

34

5

SysID

Ove

rall

Res

pons

iven

ess

●●

●●

A B C ID1 ID2 ID3 ID4 ID5 ID6 ID7 ID9

Hebrew Hindi

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 34: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Summary of system performances

Systems good enough for many languages

Big variance across languages

Human grades not always stable

Human grades not always high

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 35: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

ParticipationSystem EvaluationPerformanceAutomatic Evaluation

Correlations

Language ROUGE2 to OR MeMoG to OR ROUGE2 to MeMoG

Arabic 0.25 -0.36 0.11Czech 0.33 -0.04 0.24English 0.56 0.47 0.47French 0.42 0.37 0.50Greek 0.14 0.33 0.24Hebrew 0.52 0.05 -0.24Hindi 0.18 0.33 0.13

All languages 0.12 0.12 0.42

Table: Correlation (Kendall’s Tau) Between Gradings. Note: statisticallysignificant results in bold.

Lesson

Much space for improvement. Negative examples can be goodexamples...

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 36: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Outline

1 Introduction

2 MultiLing Pilot

3 The Results

4 Conclusion

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 37: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Community

MMS Researchers are present

MMS Researchers are active and collaborating

Researchers need data and evaluation

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 38: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Community

MMS Researchers are present

MMS Researchers are active and collaborating

Researchers need data and evaluation

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 39: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Dataset

Useful

Publicly available

A basis for future work

Measured effort

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 40: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

From pilot to track

Dataset

Evaluation

Support

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 41: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Dataset

Change of scale

More languagesMore texts

Dataset creation support software

(Funded) Community work

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 42: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Evaluation

Larger dataset

Use negative examples of summaries

Optimize existing metrics

Devise better metrics

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 43: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Support

TAC support

Community support

AIJ funding

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011

Page 44: Overview of the MultiLing Pilot in TAC 2011tac.nist.gov/publications/2011/presentations/Summarization2011_MultiLing...Use metadata (WikiNews categories) Verify existence of event sequence

IntroductionMultiLing Pilot

The ResultsConclusion

AchievementsThe Future

Thank you!

Last lesson

United we stand, divided we fall... (attributed to Aesop, GreekFabulist)We stand. (TAC MultiLing Pilot Community)

Co-organizers:

Ilias Zavitsanos, (NCSR Demokritos, Greece)

Vasudeva Varma (IIT Hyderabad, India)

Josef Steinberger (JRC, Italy in collaboration with the Univ. of WestBohemia, Czech Republic)

Benoıt Favre (LIF, France)

Marina Litvak (Sami Shamoon College of Engineering, Israel)

Mahmoud El - Haj (Univ. of Essex, UK)

William Darling (Univ. of Guelph, Canada)

George Giannakopoulos Overview of the MultiLing Pilot in TAC 2011