Bolette Sandford Pedersen, Sanni Nimb*, Anna Braasch University of Copenhagen, * Danish Society for Language and Literature Merging specialist taxonomies and folk taxonomies in wordnets - a case study of plants, animals, and foods in the Danish wordnet
20
Embed
Bolette Sandford Pedersen, Sanni Nimb*, Anna Braasch University of Copenhagen, * Danish Society for Language and Literature Merging specialist taxonomies.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Bolette Sandford Pedersen, Sanni Nimb*, Anna Braasch
University of Copenhagen, * Danish Society for Language and Literature
Merging specialist taxonomies and folk taxonomies in wordnets
- a case study of plants, animals, and foods in the Danish wordnet
LREC 2010 2
• Introduction to DanNet Food taxonomies in DanNet based on DDO: ideal cases and problems
• Interrelating natural and functional taxonomies in DanNet
• Conclusions
Outline
LREC 2010
• Joint work: University of Copenhagen & Society for Danish Language and Literature
• Monolingually-based approach:
Danish lexical sources based on corpora:
The Danish Dictionary (DDO); SIMPLE-DK
• International frameworks: Princeton WordNet, EuroWordNet
(umbelliferous plant) rod (tuber) stilk (stalk) indvolde (entrails)
LREC 2010
Tricky cases (≈ false friends)
Examples of these are such as frugt (fruit) nød (nut) bær (berry)
In principle, these cases should evoke two (unrelated) synsets, one in each taxonomy
if you actually encode also the botanical ones
LREC 2010
Excerpt from Food taxonomy (offals and meat)
LREC 2010
Offals and their relations
LREC 2010
Conclusions (1)
Foods taxonomies are typically folk taxonomies emerged spontaneously depending on the goods available and on cooking traditions of a particular region.
Inspired by and related to botanical and zoological taxonomies
The fact that terms are taken over from these natural taxonomies causes problems that require a consistent framework
Monolingual dictionaries do not always have a clear strategy
LREC 2010
Conclusions (2)
We have developed a framework that enables to distinguish and interrelate between the natural taxonomies and the functional taxonomies of the network.
Even if you have monolingual lexical resources that are inconsistent in this respect, the monolingual starting point is important in order to capture correct conceptual structures of a given language