Acquiring a Verbnet like Classification for French. Making Use of Existing Lexical Resources to Build a Verbnet like Classification of French Verbs Ingrid Falk ´ Ecole Doctorale IAEM Sp´ ecialit´ e Informatique Soutenance de th` ese 13/06/2012 Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 1 / 55
67
Embed
Making Use of Existing Lexical Resources to Build a ...Syntactic classification Semantic classification Syntactic classification
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Acquiring a Verbnet like Classification for French.
Making Use of Existing Lexical Resources to Build aVerbnet like Classification of French Verbs
Ingrid Falk
Ecole Doctorale IAEMSpecialite Informatique
Soutenance de these 13/06/2012
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 1 / 55
Acquiring a Verbnet like Classification for French.
Overview
Topic of the thesis
Explore ways of building a syntactic semantic classification of French verbs
where groups of verbs are associated with:
I syntactic information (subcategorisation frames)
I semantic information (thematic role sets)
Using existing lexical resources for French and English.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 2 / 55
Acquiring a Verbnet like Classification for French.
Overview
More specifically
I we explore ways of building a syntactic classificationI using the classification methods
I Formal Concept Analysis (FCA) – symbolicI Incremental Growing Neural Gas with Feature maximisation (IGNGF) –
neural clustering
I two-fold evaluation
1. on verb groups2. on associations of verbs with syntactic frames and thematic role sets
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 3 / 55
Acquiring a Verbnet like Classification for French.
Overview
Contributions
I automatic acquisition of a syntactic-semantic classification
I two classification techniques not yet used for verb classification
I novel translation approach to build a semantic classification
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 4 / 55
Syntactic classification
<verbs , SCFs>Semantic classification
<verbs , themat ic ro le se t s>
Syntactic classification with semantic labels
<verbs , SCFs, themat ic ro le se ts>
French syntact ic lexicon
Syntactic classification
English syntact ic-semantic verb classes (Verbnet)
Translation
Align
Acquiring a Verbnet like Classification for French.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 17 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Outline
4 Clustering MethodsFormal Concept Analysis (FCA)Incremental Growing Neural Gas with Feature Maximisation (IGNGF)
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 18 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA) [Ganter and Wille, 1999]
I symbolic method for deriving conceptual structures – concepts –out of data
I FCA organises concepts into a hierarchy – concept latticeI Concepts determined by:
I extent: set of objects shared by attributes in intentI intent: set of attributes shared by objects in extent
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 18 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
The data
Objects: 2091 verbs
Attributes: I 238 frames from merged syntactic lexiconI additional syntactic and semantic features from
Dicovalence and Ladl
Example
framesSUJ:NP,OBJ:NP,AOBJ:PP SUJ:NP,OBJ:NP,DEOBJ:PP Sym ArgNbr Loc Nhum
expedier X X X X
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 19 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
The concept lattice
12 802 concepts
I need to filter
How to select the most relevant concepts?
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 20 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
Concept selection indices
I introduced in [Klimushkin et al., 2010]I select relevant conceptsI in concept lattices built on noisy data
Stability I How much does a concept depend on individualmembers in extent/intent?
Separation I How well does a concept sort out verb and frames itcovers from other verb and frames.
Probability I What is the probability of a concept intent/extent to bea concept intent/extent by chance?
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 21 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
Which indices to select the best classes?
Method:Using fixed combination of indices
I select N, (N ∈ {1500, 1000, 500}) concepts from concept lattice withhighest index combination
I align classes translated from Verbnet with these concepts
I select FCA concepts with associated Verbnet class
I compare obtained 〈verb, Verbnet class〉 associations with a reference
Best combination of indices:
I 〈verb, VN class〉 associations are closest to reference
I concepts associated to VN classes cover large proportion of verbs
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 22 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
Best combination of concept selection indices
stability + separation
I F2 = 25.16
I close to upper bound (no selection)
I coverage 98.04%
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 23 / 55
Acquiring a Verbnet like Classification for French.
Clustering Methods
Formal Concept Analysis (FCA)
Final classification method
1) use FCA to build classes grouping French verbs and SCFs2) select 1500 concepts where stability + separation is highest3) align translated Verbnet classes with selected concepts4) keep FCA concepts aligned with a translated Verbnet class5) associate these FCA concepts with the Verbnet class thematic role sets
Effectively we obtain a classification associating:
I groups of French verbsI groups of subcategorisation framesI sets of thematic roles
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 24 / 55
Acquiring a Verbnet like Classification for French.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 32 / 55
Acquiring a Verbnet like Classification for French.
Evaluation and Comparison
Outline
5 Evaluation and ComparisonEvaluating Semantic Verb Classes wrt. Existing ReferenceEvaluating Syntactic-Semantic Verb Classes wrt. Corpus AnnotationsSummary
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 33 / 55
Acquiring a Verbnet like Classification for French.
Evaluation and Comparison
Evaluation
Goal: evaluate both FCA and IGNGF wrt.
I groups of verbsI associations with syntactic frames – 〈verb, frame〉 pairsI associations with thematic grids – 〈verb, thematic role set 〉 pairsI associations with both syntactic frames and thematic grids –〈verb, syntactic frame, thematic role set〉 triples
Other question:
I Which features work best for what classification technique?
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 33 / 55
Acquiring a Verbnet like Classification for French.
Evaluation and Comparison
Resources for evaluation
V-gold by [Sun et al., 2010]
I groups ≈160 verbs in 16 Levin classes
VN class French translations in goldrole setamalgamate-22.2 incorporer; associer; reunir; melanger; meler; unir; assembler;
I VerbsI in Verbnet classes from V-gold translated to FrenchI 2100 verbs
I FeaturesI scf: subcategorisation framesI sem/synt: additional syntactic and/or semantic featuresI grid: translated classes a verb is a member of (IGNGF only)
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 36 / 55
Acquiring a Verbnet like Classification for French.
Acquiring a Verbnet like Classification for French.
Conclusion
Future Work
Improve classifications
I Better associations with syntactic frames:
FCA I attribute (scf) based selection indicesI exploit hierarchical structure
IGNGF I cluster labeling depending on individual framesI towards creating overlapping classifications
I Better associations with thematic grids:I better methods of aligning clusters and translated Verbnet classesI explore other methods of associating verbs/frames with thematic role
sets.
I Better evaluation method:I How significant is comparison with < 10% reference data?I Use unsupervised evaluation measures (eg. cumulated micro precision
[Lamirel et al., 2011a]).
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 52 / 55
Acquiring a Verbnet like Classification for French.
Conclusion
Future Work
Polysemy
I How to adequately represent it?
I How to evaluate?
Explore fully unsupervised approach
I using distributional data – eg. LexSchem
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 52 / 55
Acquiring a Verbnet like Classification for French.
Conclusion
Publications
Ingrid Falk, Claire Gardent, and Jean-Charles Lamirel.Classifying French Verbs Using French and English Lexical Resources.In Proceedings of the 50th annual meeting of the ACL, July 2012.
Ingrid Falk and Claire Gardent.Combining Formal Concept Analysis and Translation to Assign Frames andThematic Grids to French Verbs.In Concept Lattices and their Applications, October 2011.
Ingrid Falk and Claire Gardent.Bootstrapping a Classification of French Verbs Using Formal ConceptAnalysis.In Interdisciplinary Workshop on Verbs, November 2010.
Ingrid Falk, Claire Gardent, and Alejandra Lorenzo.Using Formal Concept Analysis to Acquire Knowledge about Verbs.In Concept Lattices and Their Applications, October 2010.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 53 / 55
Acquiring a Verbnet like Classification for French.
Associations with frames and thematic role sets (moredetailed)
〈verb, frame〉 pairs in corpus: recall 59.59 for IGNGF, 88.69 for FCA.
FCA better reflects associations with frames and grids in SRL gold.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 54 / 55
Acquiring a Verbnet like Classification for French.
IGNGF vs. FCA
Differences
I crisp, non-overlapping, no hierarchical structure
I features can be weighted (not only binary):
weight of feature f for verb v 7−→W fv ∈ [0, 1]
Analogy
[Lamirel, 2010]: A cluster c where for all maximal features f :
FPc(f ) = 1 and FRc(f ) = 1
=⇒ c is formal concept:
I extent: verbs in c
I intent: maximal features for cIngrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 55 / 55
Acquiring a Verbnet like Classification for French.
M.-H. Candito, B. Crabbe, and M. Falco.Dependances syntaxiques de surface pour le francais.Technical report, Universite de Paris 7, 2009.
Bernhard Ganter and Rudolph Wille.Formal concept analysis: Mathematical foundations.Springer, Berlin-Heidelberg, 1999.
Mikhail Klimushkin, Sergei Obiedkov, and Camille Roth.Approaches to the selection of relevant concepts in the case of noisydata.In Leonard Kwuida and Baris Sertkaya, editors, Formal ConceptAnalysis, volume 5986 of Lecture Notes in Computer Science,chapter 18, pages 255–266. Springer Berlin / Heidelberg, Berlin,Heidelberg, 2010.
J. C. Lamirel, P. Cuxac, and R. Mall.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 55 / 55
Acquiring a Verbnet like Classification for French.
A new efficient and unbiased approach for clustering qualityevaluation.In QIMIE’11, PaKDD, Shenzen, China, 2011.
J.-C. Lamirel, R. Mall, P. Cuxac, and G. Safi.Variations to incremental growing neural gas algorithm based on labelmaximization.In Neural Networks (IJCNN), The 2011 International Joint Conferenceon, pages 956 –965, 2011.
Jean-Charles Lamirel.A new multi-viewpoint and multi-level clustering paradigm for efficientdata mining tasks.In Kimito Funatsu, editor, New Fundamental Technologies in DataMining, INTECH E-Book Series, pages chapitre 15, pp. 283–304.INTECH Open Access Publisher, 2010.
Karin Kipper Schuler.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 55 / 55
Acquiring a Verbnet like Classification for French.
VerbNet: A Broad-Coverage, Comprehensive Verb Lexicon.PhD thesis, University of Pennsylvania, 2006.
Lin Sun, Anna Korhonen, Thierry Poibeau, and Cedric Messiant.Investigating the cross-linguistic potential of VerbNet-styleclassification.In Proceedings of the 23rd International Conference on ComputationalLinguistics, COLING ’10, pages 1056–1064, Stroudsburg, PA, USA,2010. Association for Computational Linguistics.
Robert S. Swier and Suzanne Stevenson.Unsupervised semantic role labellin.In EMNLP, pages 95–102, 2004.
Ingrid Falk Acquiring a Verbnet like Classification for French. Nancy 13/06/2012 55 / 55