中中中中中中中中中中中 中中中中中中 许许许 许许许许许许许许许许 许许许许许许许
Jan 12, 2016
中国学习者英语笔语中的词块能力研究
许家金中国外语教育研究中心北京外国语大学
Lexical Chunks in Chinese Learners’ Writing (WECCL)
Xu JiajinBeijing Foreign Studies University
3
Interpretations
Contents
How to extract chunks in WS4?
Comparisons of chunks
Chunks
Other areas of collocation
4
chunk
• 词块• http://home.henannu.edu.cn/fl/clweek/
pujz/chunks.ppt
• Chunk/lexical bundle/n-gram/multi-word unit (expression)/formulaic sequence/prefabs
• Cluster ( 词丛 ) in WordSmith
5
Why is lexical chunk important?
• Break language into units on a probabilistic basis
• Recurrent
• Psychologically real
6
Why is lexical chunk important?
• Form-function composite
• Can we build the entire edifice of language solely on lexical chunks?
• If not, what are the bricks & mortar of language?
7
Lexical grammar
• Lexico-grammar
• Pattern grammar
• Construction grammar
• Collostruction
• etc
8
Research question
• Are there any differences of chunk use between Chinese learners of English and NNS?
• What are some of the possible underlying reasons?
9
Lexical chunk extraction in WS4
• Step 01
• Cluster in [Concord]: Focusing on individual words
WS4 中词块的自动提取
10
Lexical chunk extraction in WS4
• Step 02 Cluster in [WordList]
• Indexing corpus data before computing cluster
WS4 中词块的自动提取
11
Lexical chunk extraction in WS4
• Step 03 How to index a corpus?
• 1. [Settings-Index] First assign a name to the text(s) to be indexed
• 2. [Make/Add to Index] Then index the selected text(s)
• .tokens and .types pair
WS4 中词块的自动提取
12
• Step 04 Extracting clusters in WordList
• [WordList-Open] *.tokens
• [Compute-Clusters]
• [File-Save]/[File-Save As]
13
• 2-8 word cluster list 多词词表 of WECCL and LOCNESS
• weccl index 2-word clusters• …• weccl index 6-word clusters• locness index 2-word clusters• …• locness index 6-word clusters
14
• Step 05 keyword list generation to test for chunk over- and under-use significance
• [Keyword list] - [New]• Reference corpus• Chi-square test 卡方检验• Log likelihood 对数拟然检验• Keyness p value ≤ .05
15
类似研究
• 管博、郑树堂, 2005 ,中国大学生英语口语 Small Words 的研究,《外语教学与研究》第 6 期。
16
Other tools for extracting lexical chunks
• Kfngram William Fletcher √• Collocation Extraction• C-ngram
• N-gram PhraseExtract Tom Cobb’s Page
• PIE: Phrase in English Fletcher
17
A quick summary
• Chunk defined
• How to extract chunks from a corpus with WS4
• How to compare chunks with keyword
• Interpretation
18
• A bibliography of chunk research
• Aspects of collocation study
19
www.ddyyx.com/wqf.rar ../lmc.rar ../wlf.rar
../xjj.rarwww.corpus4u.com
Thank you!