Web-Based Concordancer to L earn Usage of English Expre ssions Takashi Yamanoue Kyushu Institute of Technol ogy, Japan Toshiro Minami Kyushu Institute of Informa tion Sciences & Kyushu University, Japan Ian Ruxton Kyushu Institute of Technol ogy, Japan ICITA2002@Bathurst (02.11.25-2 8)
23
Embed
Web-Based Concordancer to Learn Usage of English Expressions Takashi Yamanoue Kyushu Institute of Technology, Japan Toshiro Minami Kyushu Institute of.
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Web-Based Concordancer to Learn Usage of English Expressions
Takashi Yamanoue Kyushu Institute of Technology, JapanToshiro Minami Kyushu Institute of Information Sciences &
Kyushu University, JapanIan Ruxton Kyushu Institute of Technology, Japan
ICITA2002@Bathurst (02.11.25-28)
Contents
Motivation System Design and Implementation Examples and Experiments for
Corpus linguistics Time-consuming Needs hard work in order to make a good
corpus Copyright problem
(often) Outdated from the beginning Tools are mainly for experts
Difficult to use for ordinary learners
A Solution
Use of “Web-Corpus”= Using Web Documents as a Corpus
Maintenance free: Exists as it is Always new, reflects current status of
languagesA lot of applications/services are available on
the Internet
System Design and Implementation( WebLEAP )
Features
With WebLEAP(Web Language Evaluation Assistant Program) , we can get some information for: comparing expressions, deciding if the expression is right or wrong, and finding out wrong parts.
System Organization
WebLEAP
See/watch the TV/movie
“the TV” “see the TV” “watch the TV”
216,107 873* 1,593
“the movie” “see the movie” “watch the movie”
472,776 13,638 6,666*
common.
Not common
Examples and Experiments for Evaluation
ExperimentsPerson’s Name
Place Names
“Bertrand Russell” “Burtrand Russell”
12,575 3
Not only for Historical names
“Jenolan caves” “Genolan caves”
992 2
Choose one phrase or word that should be corrected.