Top Banner
NED with two-stage coherence optimization Filip Ilievski, Marieke van Erp, Piek Vossen, Wouter Beek & Stefan Schlobach or How I am teaching my bottle of Jack Daniel’s not to turn into a 168-years-old person with a net income of $120.000.000
33
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: CLiN 25: NED with two-stage coherence optimization

NED with two-stage coherence optimization

Filip Ilievski, Marieke van Erp, Piek Vossen, Wouter Beek & Stefan Schlobach

or

How I am teaching my bottle of Jack Daniel’s not to turn into a 168-years-old person with a net income of $120.000.000

Page 2: CLiN 25: NED with two-stage coherence optimization

Context

... is being persistently avoided when processing language by machines. No wonder. The context

is hard to quantify.

but the context lies in the basis of the human communication!

Page 3: CLiN 25: NED with two-stage coherence optimization

The burden of context in language

● The language is context-dependent● Verbal context

○ Ford fell from a tree.■ What is “Ford” ?

● Social context○ What is “2+2” ?

■ In mathematics it is 4■ In the car domain it is a car configuration: 2 front + 2

back seats■ In psychology it is a family with 2 parents and 2 children

Page 4: CLiN 25: NED with two-stage coherence optimization

Lincoln increased the annual vehicle sales to 300.000.y was born in Lincoln.Lincoln fell from a tree.Lincoln was standing on the shelf. It was covered in leather.

Shallow processing

Page 5: CLiN 25: NED with two-stage coherence optimization

Motivation

The shallow approaches can do only this much.Claim #1: we need to deepen the processing.Claim #2: context is a limitless inspiration

- verbal- social- domain

- spatial- temporal- discourse

- (you-name-it)

Page 6: CLiN 25: NED with two-stage coherence optimization

Shall we go a step further?

Page 7: CLiN 25: NED with two-stage coherence optimization

How to go about it

Combine many pieces (algorithms) in a puzzle (solution)Use as extensive and global knowledge as possible:

Semantic WebNatural Language Processing Lexical resources

Page 8: CLiN 25: NED with two-stage coherence optimization

Approach

Optimize the semantic coherence of the disambiguated entities, while still excluding the verbally incorrect options and skewing towards the domain and the popularity of the entities.

Page 9: CLiN 25: NED with two-stage coherence optimization

Components

- Verb-based knowledge from NLP, VerbNet, FrameNet and a domain ontology

- Domain skew (based on corpus analysis)- Popularity of the candidates (from DBpedia)- Semantic connectivity and similarity (based on DBpedia

information)

No module or knowledge source is perfect,but >1 of both will be helpful !

Page 10: CLiN 25: NED with two-stage coherence optimization

System design

Page 11: CLiN 25: NED with two-stage coherence optimization

The background knowledge

Page 12: CLiN 25: NED with two-stage coherence optimization

Data

Annotated WikiNews articles3 subcorpora:- Airbus Boeing (30)- General Motors (30)- Stock Market (30)

Page 13: CLiN 25: NED with two-stage coherence optimization

Results

Page 14: CLiN 25: NED with two-stage coherence optimization

FrameNet+Domain ontology filterAirbus GM Stock market

# links filtered 3 21 22

# incorrect links filtered

3 13 19

# correct links filtered

0 0 3

# not in GS filtered

0 8 0

“Trading on Russia’s stock markets ...”predicate: markets, Commerce_sell@Seller: Russia

Page 15: CLiN 25: NED with two-stage coherence optimization

Combinations

Page 16: CLiN 25: NED with two-stage coherence optimization

Conclusions

Context is usefulSemantic Web can help to model background knowledge

We are still finding new puzzle pieces

Page 17: CLiN 25: NED with two-stage coherence optimization
Page 18: CLiN 25: NED with two-stage coherence optimization

Thank You !

Page 19: CLiN 25: NED with two-stage coherence optimization

Appendices

Page 20: CLiN 25: NED with two-stage coherence optimization

Future

Get rid of the boring pipeline approach.Use full-blown optimization system!

Page 21: CLiN 25: NED with two-stage coherence optimization

Resources

Grammatical structure and meaning of words Background knowledgeStructured linguistic

information

Semantic WebNatural Language Processing Lexical resources

Page 22: CLiN 25: NED with two-stage coherence optimization

Example

“The United States transferred six detainees from the Guantánamo Bay prison to Uruguay this weekend, the Defense Department announced early Sunday.”

Page 23: CLiN 25: NED with two-stage coherence optimization

State-of-the-art: United States Guantanamo Bay Uruguay Defence Department

Geographical region GB detention camp Geographical region US Dept. of Defence

Fed. Government Place Football team Ministry of Defence of Rep. of Korea

Men’s soccer team The naval base River

Women’s soccer team Battle of GB Rugby union team

Rugby union team U20 football team

Men’s ice hockey team U17 football team

Men’s basketball team

Secondary education in US

Page 24: CLiN 25: NED with two-stage coherence optimization
Page 25: CLiN 25: NED with two-stage coherence optimization

VN: send-11.1

transferred

A0 is Animate or OrganizationA0:United States

United States is Animate or Organization

A1: from Guantanamo Bay

A2: to Uruguay

A1 is Location

A2 is Location

Guantanamo Bay is a location Uruguay is a location

Page 26: CLiN 25: NED with two-stage coherence optimization

VN: say-37.7

announced

A0 is Animate or OrganizationA0:the Defence Department

The Defence Department is an Animate or an Organization

Page 27: CLiN 25: NED with two-stage coherence optimization

After VerbNetUnited States Guantanamo Bay Uruguay Defence Department

Geographical region GB detention camp Geographical region US Dept. of Defence

Fed. Government Place Football team Ministry of Defence of Rep. of Korea

Men’s soccer team The naval base River

Women’s soccer team Battle of GB Rugby union team

Rugby union team U20 football team

Men’s ice hockey team U17 football team

Men’s basketball team

Secondary education in US

Page 28: CLiN 25: NED with two-stage coherence optimization
Page 29: CLiN 25: NED with two-stage coherence optimization

Results

Page 30: CLiN 25: NED with two-stage coherence optimization
Page 31: CLiN 25: NED with two-stage coherence optimization

VN: send-11.1

transferred

A0 is Animate or OrganizationA0:United States

United States is Animate or Organization

A1: from Guantanamo Bay

A2: to Uruguay

A1 is Location

A2 is Location

Guantanamo Bay is a location Uruguay is a location

Page 32: CLiN 25: NED with two-stage coherence optimization

VN: say-37.7

announced

A0 is Animate or OrganizationA0:the Defence Department

The Defence Department is an Animate or an Organization

Page 33: CLiN 25: NED with two-stage coherence optimization

After VerbNetUnited States Guantanamo Bay Uruguay Defence Department

Geographical region GB detention camp Geographical region US Dept. of Defence

Fed. Government Place Football team Ministry of Defence of Rep. of Korea

Men’s soccer team The naval base River

Women’s soccer team Battle of GB Rugby union team

Rugby union team U20 football team

Men’s ice hockey team U17 football team

Men’s basketball team

Secondary education in US