A Robust Approach to Aligning Heterogeneous Lexical Resources

Post on 07-Feb-2016

47 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

A Robust Approach to Aligning Heterogeneous Lexical Resources. Mohammad Taher Pilehvar Roberto Navigli. MultiJEDI ERC 259234. Lexical Resource. WordNet. BabelNet. UBY. Lexical Resource. WordNet. BabelNet. UBY. Lexical Resource. WordNet. BabelNet. UBY. - PowerPoint PPT Presentation

Transcript

A Robust Approach to Aligning Heterogeneous Lexical Resources

Mohammad Taher PilehvarRoberto Navigli

MultiJEDI ERC 259234

Lexical Resource

WordNet

BabelNet

UBY

Lexical Resource

WordNet

BabelNet

UBY

Lexical Resource

WordNet

BabelNet

UBY

Improved word and concept coveragee.g., named entities, new senses

Improved domain coverage

Improved multilingualitydozens of new languages

Expert-made relations preservede.g., Hypernymy, meronymy, etc.

Why combine resources?

Provides complementary knowledge

Applications:

Semantic Parsing Shi and Mihalcea, 2005

Semantic Role LabelingPalmer et al., 2010

WSD and entity linkingMoro et al., TACL 2014

Why combine resources?

Difficulty of resource alignment

Fine granularity of lexical resources

plant

WordNet

4 senses 15 senses

How resource alignment works?

WordNetplant#n#1plant#n#1

• Usually measures the similarity of two concepts

WKT: WN:

How resource alignment works?

• Usually measures the similarity of two concepts

• And aligns two concepts if their similarity exceeds a certain threshold

?WordNetplant#n#1plant#n#1WKT: WN:

How resource alignment works?

Alignment approaches differ in the way they calculate this similarity

?WordNetplant#n#1plant#n#1WKT: WN:

Gloss similarity WordNet

Denfinitional similarity

• Strong baseline

• Fall short when• Different wordings are used for same concepts• When two words lack quality glosses

plant -- Buildings for carrying on industrial labor.

plant -- The necessary infrastructure used in support and maintenance of a given facility.

Denfinitional similarity

Contributions

A novel concept similarity measure

Denfinitional similarity

A robust technique foralignment of resourcesA robust technique for

alignment of heterogeneous resources

An effective ontologization approach

Our approach: SemAlign

Our approach: SemAlign

Definition similarity

Our approach: SemAlign

Structural similarity

SemAlign: structural similarity

WordNet

SemAlign: core

Modeling concepts through Semantic Signatures

conceptsome

Personalized PageRankSemantic Signature of a concept concept

Distributional representationover all concepts in the semantic network

Personalized PageRankSemantic Signature of a concept concept

SemAlign

Definition similarity

WordNet

SemAlign

WordNet

SemAlign: signature unification

conc

ept_4

conc

ept_5

conc

ept_2

conc

ept_6

conc

ept_1

conc

ept_3

conc

ept_4

conc

ept_5

conc

ept_2

conc

ept_6

conc

ept_1

conc

ept_3

WordNet

SemAlign: signature unificationFind concepts associated with monosemous words

conc

ept_4

conc

ept_5

conc

ept_2

conc

ept_6

conc

ept_1

conc

ept_3

conc

ept_4

conc

ept_5

conc

ept_2

conc

ept_6

conc

ept_1

conc

ept_3

WordNet

SemAlign: signature unificationTruncate vectors to the overlapping concepts

SemAlign: signature unification

WordNet Synsets containing at least one monosemous word

~ 60% (72,000)

The reliability of leveraging monosemous words

Semantic Signature Comparison

Weighted Overlap(Pilehvar et al., ACL 2013)

Semantic Signature Comparison

SemAlign: score combination

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

Ontologizationof lexical resources

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

Ontologizationof lexical resources

Ontologizationof lexical resources

Definition page for sail

WordNet

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

Ontologizationof lexical resources

WordNet

Ontologizationof lexical resources

Definition page for windmill

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Ontologizationof lexical resources

Definition page for windmill

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

windmill.n.1linear motion.n.1

rotational motion.n.1translate.v.3

wind.n.1

machine.n.1sail.n.3

adjustable vanes.v.4

Ontologizationof lexical resources

Ontologization: similarity-based disambiguation

Definition page for windmill

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Ontologization: similarity-based disambiguation

1. A trip in a boat, especially a sailboat.

2. A sailing vessel; a vessel of any kind; a craft.

3. The blade of a windmill.

4. A tower-like structure found on the dorsal (topside) surface of submarines.

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Definition page for sail

Definition page for windmill

Ontologization: similarity-based disambiguation

1. A trip in a boat, especially a sailboat.

2. A sailing vessel; a vessel of any kind; a craft.

3. The blade of a windmill.

4. A tower-like structure found on the dorsal (topside) surface of submarines.

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Definition page for sail

Definition page for windmill

Ontologization: similarity-based disambiguation

1. A trip in a boat, especially a sailboat.

2. A sailing vessel; a vessel of any kind; a craft.

3. The blade of a windmill.

4. A tower-like structure found on the dorsal (topside) surface of submarines.

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Definition page for sail

Definition page for windmill

Ontologization: similarity-based disambiguation

1. A trip in a boat, especially a sailboat.

2. A sailing vessel; a vessel of any kind; a craft.

3. The blade of a windmill.

4. A tower-like structure found on the dorsal (topside) surface of submarines.

1. A machine which translates linear motion of wind to rotational motion by means of adjustable vanes called sails.

Definition page for sail

Definition page for windmill

Ontologization:evaluation

Our methodWKTWSD

0.7

0.8

0.9

F1

Acc

(Meyer & Gurevych, 2012)

Alignment Experiments:Datasets

(Matuschek & Gurevych, TACL 2013)

WordNetWN synsetsmanually mapped to their corresponding concepts

320

484315

Alignment Experiments:System configurations

Parameter 1:

score combination parameter

𝜷𝜷

𝜷

Alignment Experiments:System configurations

Parameter 2:

similarity threshold 𝜽

Alignment Experiments:System configurations

𝜽 𝜷

0 1

0.5

Setting and

a. Unsupervised system

Alignment Experiments:System configurations

𝜽 𝜷

0 1

a. Unsupervised system

Setting and

b. Tuning on subset

0.42 0.53

Alignment Experiments:System configurations

𝜽 𝜷

a. Unsupervised system

Setting and

b. Tuning on subsetc. Cross validation

0 1

0.44 0.51

Comparison systemDijkstra-WSA

(Matuschek & Gurevych, TACL 2013)

Alignment Experiments

WN-WP WN-WT WN-OW0.6

0.65

0.7

0.75

0.8

0.85

0.9

0.805

0.712 0.709

0.833

0.73 0.733

0.84

0.722

0.749

Unsupervised Tuning on subset Cross-validation

SB

SB+DWSAF1

Alignment Experiments

WN-WP WN-WT WN-OW0.6

0.65

0.7

0.75

0.8

0.850.824

0.736

0.824

0.723

0.684 0.684

Tuning on WN-WP Tuning on WN-WT Tuning on WN-OW

F1SB

SB+DWSA

SemAlign: structural similarity

Alignment Experiments

WN-WP WN-WT WN-OW0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.71

0.39

0.473

0.83

0.623 0.627

DWSA SemAlign

F1

Conclusions

• A novel approach for aligning lexical resources– Accurate even in the absence of training data

– Robust across different resources

• An effective ontologization approach

• Experiments on aligning – WN to WP, WT, and OW

Future directions

• Integrating the approach into BabelNet for boosting the alignment accuracy

• Alignment across different languages

• Updating lexicons with novel terms

1. thanks -- an acknowledgment of appreciation

2. thanks -- with the help of or owing to;

2. thanks -- an expression of gratitude.

2. thanks -- because of; normally used with a positive connotation, though it can be used sarcastically.

WordNet

Thanks!

top related