Apertium RDF: an experience in generating linguistic linked open data Jorge Gracia Ontology Engineering Group (OEG) Universidad Politécnica de Madrid (UPM) [email protected]1st Summer Datathon on Linguistic Linked Open Data Cercedilla (Spain), 15-19th June 2014
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
16/06/2015 1 Presenter name
Apertium RDF: an experience in generating linguistic linked open data
Motivation The Apertium platform Representing translations in RDF Building the Apertium RDF graph Traversing the graph Linking with external sources Conclusions 2
16/06/2015 3 Jorge Gracia
3
Motivation
16/06/2015 4 Jorge Gracia
Motivation
Current multilingual lexica and electronic dictionaries • Proprietary formats • Non-standard APIs • Disconnected from other resources
4
16/06/2015 5 Jorge Gracia
Motivation
GOAL: to expose translations contained in bilingual dictionaries as Linked Data on the Web
Joint effort by
5
16/06/2015 6 Jorge Gracia
6
The Apertium platform
16/06/2015 7 Jorge Gracia
Apertium
Apertium [http://www.apertium.org] open source platform for Machine Translation. Its bilingual dictionaries available in XML.
7
16/06/2015 8 Jorge Gracia
Apertium
8
Afrikaans <-> Dutch Breton --> French Catalan <-> Italian Welsh <-> English Danish <-- Norwegian English <-> Catalan English <-> Spanish English <-> Galician Esperanto <-- Catalan Esperanto <-> English Esperanto <-- Spanish Esperanto <-- French Spanish <-> Aragonese Spanish <-> Asturian Spanish <-> Catalan Spanish <-> Galician
Spanish <-> Italian Spanish <-> Portuguese Spanish <-> Romanian Basque --> English Basque --> Spanish French <-> Catalan French <-> Spanish Serbo-Croatian <-> English Serbo-Croatian <-> Macedonian Serbo-Croatian <-> Slovenian Indonesian <-> Malaysian Icelandic <-> Swedish Icelandic --> English Kazakh <-> Tatar Macedonian <-> Bulgarian Macedonian --> English
Norwegian Nynorsk <-> Norwegian Bokmål
Occitan <-> Catalan Occitan <-> Spanish Portuguese <-> Catalan Portuguese <-> Galician Northern Sami --> Norwegian
• Apertium data on the Web following SW standards • Common entry point for all the Apertium dictionaries • Direct and indirect translations can be easily obtained
via SPARQL • Confidence degree for indirect translations • Linked with BabelNet
35
16/06/2015 36 Jorge Gracia
Conclusions
Related reading… http://kdictionaries.com/kdn/kdn23.pdf