Top Banner
Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School of Computer Science Carnegie Mellon University presented by Thomas Packer
13

Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Dec 19, 2015

Download

Documents

Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Coupling Semi-Supervised Learning of Categories and

Relationsby

Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell

School of Computer ScienceCarnegie Mellon University

presented byThomas Packer

Page 2: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Bootstrapped Information Extraction

• Semi-Supervised:– Seed knowledge (predicate instances & patterns)– Pattern learners (uses learned instances)– Instance learners (uses learned patterns)

• Feedback Loop:– Rel1(X, Y)

– Sent1(X, Y), Rel0(X, Y) Pat1

– Pat1: Sent2(A, B) Rel1(A, B)

Page 3: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Challenges and Previous Solutions

• Semantic drift: Feedback loop amplifies error and ambiguities.

• Semi-Supervised learning often suffers from being under-constrained.

• Multiple mutually-exclusive predicate learning: Positive examples of one predicate are also negative examples of others.

• Category and predicate learning: Arguments must be of certain types.

Page 4: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Does More Look Harder?

Page 5: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Approach

• Simultaneous bootstrapped training of multiple categories and multiple relations.

• Growing related knowledge provides constraints to guide continued learning.

• Ontology Constraints:– Mutually exclusive predicates imply negative instances

and patterns.– Hypernyms imply positive instances.– Relation argument type constraints imply positive

category and negative relation instances.

Page 6: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Mutual Exclusion Constraint

• “city” and “scientist” categories are mutually exclusive.

• If “Boston” is an instance of “city”, then it is also a negative instance of “scientist”.

• If “mayor of arg1” is a pattern for “city”, then it is also a negative pattern for “scientist”.

Page 7: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Hypernym Constraints

• “athlete” is a hyponym of “person”.• If “John McEnroe” is a positive instance of

athlete, then it is also a positive instance of “person”.

Page 8: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Type Checking Constraints

• The “ceoOf()” relation must have arguments of type “person” and “company”.

• If “bicycle” is not a “person” then “ceoOf(bicycle, Microsoft)” is a negative instance of “ceoOf()”.

• If “ceoOf(Steve Ballmer, Microsoft)” is true, then “Steve Ballmer” is a positive instance of “person”. “Microsoft” handled similarly.

Page 9: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Coupled Bootstrap Learner

Page 10: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Knowledge Constraints Makes Extraction Easier

Page 11: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Knowledge Constraints Makes Extraction Easier

Page 12: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Conclusion

• Clearly shows improvements based on constraints.

• Could probably benefit by– adding probabilistic reasoning– larger corpus– higher thresholds– more contrastive categories– other techniques discussed in this class

Page 13: Coupling Semi-Supervised Learning of Categories and Relations by Andrew Carlson, Justin Betteridge, Estevam R. Hruschka Jr. and Tom M. Mitchell School.

Questions