Top Banner
Language corpora for grammar and vocabulary instruction. Jon Smart Department of Linguistics University of Pittsburgh
26
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Using corpora in instruction

Language corpora for grammar and vocabulary instruction.

Jon Smart

Department of Linguistics

University of Pittsburgh

Page 2: Using corpora in instruction

Language corpora

¤ Originally corpora were paper-based and were searched by hand.

¤ Modern corpora are computer databases that can be searched or analyzed using software programs.

¤ Some corpora are available as online searchable databases.

¤ Many Modern grammar references and some ESL textbooks are based on/informed by corpora.

Page 3: Using corpora in instruction

Corpora in language learning

Corpora can offer language learners the following benefits:

¤ More accurate descriptions of the language than teachers’/textbook writers’ intuitions (Biber & Reppen, 2002, Conrad, 1999).

¤  Exposure to contextualized, meaningful language and “real” language data (Reppen, 2010).

¤  Language input for a specific registers (Conrad, 2000, Flowerdew, 2009, 2012).

¤ A reference tool for autonomous learning (Johns, 1991).

Page 4: Using corpora in instruction

Application to language teaching

1.  Corpus research informs syllabus design (e.g., word frequency, register information for ESP).

2.  Textbooks and materials informed by language corpora

3.  Learners use corpora or corpus data directly as learning tools.

Page 5: Using corpora in instruction

Textbooks/materials

¤ Grammar & Beyond, Reppen et al,, 2012

¤ Real Grammar, Conrad & Biber, 2009

¤ Touchstone Series, McCarthy, McCarten, &

Sandiford, 2004/2006

¤ Natural Grammar, Thornbury, 2004 [example]

Page 6: Using corpora in instruction
Page 7: Using corpora in instruction
Page 8: Using corpora in instruction

Corpora as learning tools

¤  Data-driven learning pioneered by Johns (1991) having relatively advanced language learners search corpora he built using micro-computers.

¤  Learner-focused approach to identifying patterns/trends in language learning.

¤  Can be done either by having learners conduct their own searches of corpora using a search tool (e.g., a concordancer) OR by pre-selecting materials from corpora to help learners discover language patterns.

Page 9: Using corpora in instruction

Data-driven learning

¤  Has been used in several areas of language teaching: vocabulary, grammar, discourse, writing, translation studies, etc.

¤  Research on both hands-on and paper-based DDL, but empirical evidence not entirely clear (see Boulton 2010, 2012 for summaries of empirical research).

¤  Can use existing corpora or you can have students create their own (valuable for ESP contents).

Page 10: Using corpora in instruction

Hands-on DDL

Two strategies for doing hands-on concordancing with your students:

1.  Build your own corpus of documents relevant to your course(s).

2.  Use an already existing, web-based corpus or corpora.

Page 11: Using corpora in instruction

Build your own corpus What you need to get started:

¤ Collection of language texts (saved as individual text files (.txt) or similar).

¤ Concordancer (a search interface for your texts) ¤ AntConc (free) ¤ Wordsmith ¤ MonoConc

Page 12: Using corpora in instruction

Using an existing corpus

¤  Several important, powerful corpora available online. Here are a few

¤  Lextutor – very valuable for vocabulary study. Lextutor.ca

¤ Michigan Corpus of Academic Spoken English (MICASE)

¤ Michigan Corpus of Upper-Level Student Papers (MICUSP)

¤  Sacodeyl EU-sponsored project, Variety of languages.

¤ Collection of corpora at Brigham Young University. corpus.byu.edu.

Page 13: Using corpora in instruction

Using an existing corpus

¤  Several important, powerful corpora available online (all free, but some require registration) Here are a few

¤  Lextutor – very valuable for vocabulary study. Lextutor.ca

¤ Michigan Corpus of Academic Spoken English (MICASE)

¤ Michigan Corpus of Upper-Level Student Papers (MICUSP)

¤  Sacodeyl EU-sponsored project, Variety of languages.

¤ Collection of corpora at Brigham Young University. corpus.byu.edu.

Page 14: Using corpora in instruction

BYU Corpus interface

Page 15: Using corpora in instruction

Paper-based DDL

¤  Using a concordancer or web-interface can be daunting for students.

¤  Teachers can use corpora to build teaching activities that use authentic language materials but are not overwhelming for their students. Focus on language, not learning to search corpora.

Page 16: Using corpora in instruction

Steps in a DDL activity

1.  Illustration: looking at data.

2.  Interaction: discussion and sharing observations and opinions.

3.  Intervention: optional step to provide learners with hints or clearer guides for induction.

4.  Induction: making one’s own rule for a particular feature.

Flowerdew, 2009

Page 17: Using corpora in instruction

Let’s look at some examples.

¤  Teach vocabulary (word meaning, collocations, differences in meaning, parts of speech)

¤  Grammar (valuable for looking at differences between forms)

¤  Corpora are especially useful for teaching lexico-grammatical information.

Page 18: Using corpora in instruction

CAN (modal): how does it function

¤  Students examine a corpus to find examples of different functions of the modal CAN.

¤  CAN has four functions in English: ¤  Ability

¤  Possibility

¤  Permission

¤  Request

•  Find examples of each function.

Page 19: Using corpora in instruction

Examples of CAN

Page 20: Using corpora in instruction

Expanded context

Page 21: Using corpora in instruction

A number of vs. The number of

¤  With respect to these approaches, __________ number of countries in the world have revised their education systems in general.

¤  And after decades of progress against malnutrition, __________ number of its victims is rising.

¤  In Brazil, for instance, in 1991, __________ number of people 60 years old or older was of 10.7 million.

¤  __________ number of lifecycle assessment software packages have been released in recent years to help designers.

Page 22: Using corpora in instruction

Audience Participation Oh No!

Page 23: Using corpora in instruction

Avaunt: What does it mean?

¤  “Avaunt and quit my sight, thy face is dirty, and thy hands unwashed; Avaunt, avaunt! I say!”

¤  “Avaunt, ye malicious enchanters, avaunt, ye wizard tribe! For I am Don Quixote de la Mancha, against whom your wicked arts avail not.”

¤  Now I comprehend the meaning of thy damned visit; Avaunt, thou hag of darkness, and quit my

¤  He called me villain! bade me avaunt!

Page 24: Using corpora in instruction

Grammercy: What does it mean?

¤  “Ha! Salisbury, good knight, and true,” returned the Frenchman. "I knew not thou wert here. Gramercy for thy caution, else had it fared with me right hardly.”

¤  “Thou shalt have a royal guard of salvages to escort thee whither thou wilt go." "Gramercy for thy courtesy good my Valiant," replied Winslow in the same tone.

¤  “My good lord,” cried Tristram, “gramercy of your goodness which ye showed me in your marches, and of your nobleness in calling me unto your aid, for it is great honour to me that ye ask this, and I will do all for you to the utmost of my strength.”

Page 25: Using corpora in instruction

Considerations for using corpora

¤  What “meta-language” do students need to use corpora or complete inductive activities?

¤  How much do students need to understand about the online tools?

¤  What level is best to have students work with corpora?

¤  What vocabulary/grammar is best taught deductively or inductively?

¤  Research on DDL outcomes is limited.

Page 26: Using corpora in instruction

Thank you?

¤  Feel free to contact me at [email protected]