Dr. Hemant Darbari Programme Co-ordinator Applied Artificial Intelligence Group, & ACTS Advanced Computing Training School C-DAC, Pune [email protected]TAG Based Parsing TAG Based Parsing for for Machine Translation - Machine Translation - English to Indian Language English to Indian Language WELCOME
58
Embed
Dr. Hemant Darbari Programme Co-ordinator Applied Artificial Intelligence Group, & ACTS Advanced Computing Training School C-DAC, Pune [email protected].
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Dr. Hemant DarbariProgramme Co-ordinator
Applied Artificial Intelligence Group, & ACTS Advanced Computing Training School
Parsing Process in TAG: An OverviewParsing Process in TAG: An Overview
Workflow of TAG ParserWorkflow of TAG Parser
Generation Process in MANTRAGeneration Process in MANTRA
Generation Process in MANTRA for Multlingual TranslationGeneration Process in MANTRA for Multlingual Translation
Sample Outputs of MANTRASample Outputs of MANTRA
Samples of Constructions Solved through TAG Samples of Constructions Solved through TAG
Issues Regarding Structural Differences and Translation AccuracyIssues Regarding Structural Differences and Translation Accuracy
System specifications System specifications
MANTRA: AchievementsMANTRA: Achievements
MANTRA: IntroductionMANTRA: Introduction
MANTRAMANTRA
MANTRAMANTRA is an acronym of is an acronym of
MAMAchichiNNe assisted e assisted TRATRAnslation tool.nslation tool.
A Tree Adjoining Grammar (TAG) based Machine Translation System of A Tree Adjoining Grammar (TAG) based Machine Translation System of
Applied AI Group of C-DAC, PuneApplied AI Group of C-DAC, Pune
MANTRA translates English documents into Hindi and other Indian MANTRA translates English documents into Hindi and other Indian
Languages, such as Oriya <O>, Tamil <T>, Urdu <U>, Marathi <M> & Languages, such as Oriya <O>, Tamil <T>, Urdu <U>, Marathi <M> &
Bangla <B>Bangla <B>
MANTRA covers the following domains: MANTRA covers the following domains: Administration, Finance, Administration, Finance, Agriculture, Small Scale Industries, Information Technology and Agriculture, Small Scale Industries, Information Technology and Healthcare, Tourism and Proceedings and documents of Rajya SabhaHealthcare, Tourism and Proceedings and documents of Rajya Sabha
Parsing Process in TAG -Parsing Process in TAG -
An OverviewAn Overview
TAG Stands for Tree Adjoining Grammars
• The formalism of this grammar is based on investigation and research of Arvind Joshi (1987)
• Tree is the basic building blocks of this formalism
• In contrast to other formalism, where dependencies are defined between elements of rule (node), in TAG dependencies are defined between different trees .
The adjective like all, both etc takes singular noun form in sentence rather than the plural.
Ex: Rajasthan State Transport Corporation (RSTC) has bus services to all the major destinations of north India..
Relative pronoun sentence has syntax variation output
Ex: Bikaner is also one major hub for the tourists looking for an adventurous Camel ride, which gives an insight into the exquisite lifestyle of remote Rajasthan.
English to Oriya
Honorific Problem:
It is not possible to provide honorific mark at the contextual behavior.
Ex: The majestic Ashoka pillar records visit of emperor Ashoka to Sarnath.
English to Oriya
Accuracy in Translation from English to Oriya is 50%Accuracy in Translation from English to Oriya is 50%
Postposition not joined to the root
Jaipur , popularly-known-as the Pink-City , is the capital of Rajasthan-state , India
Position of clause
Kaziranga National Park is best known for the one-horned Rhinoceros.
English to Marathi
Accuracy in Translation from English to Marathi is 30%Accuracy in Translation from English to Marathi is 30%
English to Urdu
Urdu is a inflectional or isolating language like Hindi. Basically, the variations in the lexical choices are major features in Urdu.
Problem identified in syntactic level
Arrangement of clausesActivisation of the passive sentence
Accuracy in Translation from English to Urdu is 40%Accuracy in Translation from English to Urdu is 40%
System Specification in MANTRASystem Specification in MANTRA
Available Platforms
Technology
Web Based Solution
(Internet)
Java, EJB
Enterprise Solution
(Intranet)
VC++
Desktop solutions
(Standalone)
VC++
Desktop solutionsDesktop solutions
StandaloneStandalone
SQL versions
(Normal, Encrypted)
My SQL versions
(Normal, Encrypted)
Access version
(Normal)
SQL Express version
(Normal)
MSDE version
(Normal)
MANTRA: AchievementsMANTRA: Achievements
MANTRA Technology MANTRA Technology is a recipient is a recipient
of the Computer world Smithsonian of the Computer world Smithsonian
Award and is a part of theAward and is a part of the
“1999 Innovation Collection” “1999 Innovation Collection” in the in the
National Museum for American National Museum for American
History.History.
MANTRA: Achievements
Launched on 14th Sept 2007 by Honorable Minister of Home Affairs, GOI
MANTRA: Achievements
Papers to be Laid on the Table [PLOT]
List Of Business [LOB]
Parliamentary Bulletin Part-I
MANTRA: Achievements
Launched on 29th August 2007 by Honorable Vice-President of India