Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.

Slides:



Advertisements
Similar presentations
Multilingual Education from Traditional to e-Learning ZuoRui SHEN China Agricultural University Beijing , China APAN Meeting,
Advertisements

The Cambridge Learner Corpus, English Profile, the Sketch Engine and the Kelly Project Adam Kilgarriff Lexical Computing Ltd
Finding multiwords of more than two words Adam Kilgarriff, Pavel Rychly, Vojtech Kovar, Vıt Baisa Lexical Computing Ltd; Masaryk Univ., Cz.
1 Corpora for all Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
MIP - Politecnico di MilanoCopyright prof. M. Garetti 1 AIM 2004 Conference Education for IM MSc engineer in the 21th century Item: Learning Methodologies.
L EARNERS ’ D ICTIONARY Deny A. Kwary
The Sketch Engine -What is The Sketch Engine? -What is a corpus? -Looking at the BASE and the BAWE corpora. -How can this help.
Making useful wordlists for ELT Topical vocabulary from the WWW Simon Smith & Scott Sommers Ming Chuan University, Taipei Adam Kilgarriff, Lexical Computing.
Talking about your homework News story? –What made you choose…? One of your words? –What made you choose…? (Give your vocabulary books to another student.
1 Corpora for the coming decade Adam Kilgarriff Lexical Computing Ltd.
Today Writing: using the comma –Writing task Corpus linguistics talk, Part 2 Re-organize groups –Group news discussion.
Corpora and Language Teaching
Considerations in Selecting Teaching Materials for K-12 Chinese Classes Xiaoning Wang ChinaSprout, Inc. April 27, 2007.
1 Vocab Assessment & Corpora and Concordancing Major vocabulary assessment tools Major corpora and concordancers.
Speaking Of all the four skills (speaking , listening, reading, and writing) speaking seems intuitively the most important. Most foreign language learners.
What's on the Web? The Web as a Linguistic Corpus Adam Kilgarriff Lexical Computing Ltd University of Leeds.
Memory Strategy – Using Mental Images
Labels: automation Adam Kilgarriff. Auckland 2012Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English?  Most often.
Tomaž Erjavec 1, Adam Kilgarriff 2, Irena Srdanović Erjavec 3 1 Jožef Stefan Institute, Slovenia 2 Lexical Computing Ltd. and University of Leeds, UK 3.
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
1 Linguistic evidence within and across languages, word frequency lists and language learning Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass.
Using corpora for bespoke language teaching
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
Terminology, translation, and PRESEMT; word frequency lists and KELLY 1 Adam Kilgarriff Lexical Computing Ltd SKEW-2, March 2011Kilgarriff: PRESEMT and.
GDEX: Automatically finding good dictionary examples in a corpus Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý Lexical Computing.
1 Corpora, Dictionaries, and points in between in the age of the web Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of.
Class 3 Corpora in language teaching. Current trends in FLT  Communicative Language Teaching  Trends within CLT authentic language contextualised language.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
A Brief Introduction My Chinese name is Cao Rui, which means I am bright. In fact, I am good at communicate with others. My Chinese name is Cao Rui, which.
Why We Need Corpora and the Sketch Engine Adam Kilgarriff Lexical Computing Ltd, UK Universities of Leeds and Sussex.
1 Comparable Corpora Within and Across Languages, Word Frequency Lists and the KELLY Project Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass.
Comparable Corpora BootCaT (CCBC) (or: In Praise of BootCaT) Adam Kilgarriff, Jan Pomikalek, Avinesh PVS Lexical Computing Ltd. Work Supported by EU FP7.
1 Using Corpora in Language Research -also Introduction to the Sketch Engine (WS15) part 1 Adam Kilgarriff Lexical Computing Ltd Universities of Leeds.
1 Introduction to Linguistics Teacher: Simon Smith ( 史尚明 ) – “Dr Smith”, “Simon” or “ 老師 ”: OK – “Smith” or “Teacher”: not OK This semester’s course: –
Corpora and Concordancers in ESL/EFL Class: Truly Authentic Language for Language Learning. and opening.
Corpus Evaluation Adam Kilgarriff Lexical Computing Ltd Corpus evaluationPortsmouth Nov
Using Corpora in Language Research Adam Kilgarriff Lexical Computing Ltd Universities of Leeds January 2013Adam Kilgarriff.
Malta, May 2010Kilgarriff: Corpora by Web Services1 Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities.
The Rise of the Asian Giants: China and India compared Martin Wolf, Associate Editor & Chief Economics Commentator, Financial Times Leverhulme Centre for.
October 3 rd, Present Perfect Continuous Its structure: Subject [ I, she, he, you, we …] Auxiliary verb [Have / Has] Auxiliary verb Been main verb.
New RCLayout. Do product layout 3 improvements All products Local databases New functionalities.
The Sketch Engine as Infrastructure for Large Scale Text Collections for Humanities Research Adam Kilgarriff Lexical Computing Ltd. & Univ of Leeds, UK.
Here’s what the article says: “Ivan Vazov” Primary School has been approved to work on another Comenius multilateral project “Art on the Move”. “The.
SURVEY RESULTS. NUMBERS OF QUESTIONNAIRES COMPLETED  ITALY 186  AUSTRIA 132  POLAND 163  ENGLAND 151  SPAIN 139  TURKEY 182  ROMANIA 52.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Grammar is to Meaning as the Law if to Good Behaviour Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Corpus search What are the most common words in English
What is Academic Research and Where Does It Come From? Database v Internet.
GDEX: Automatically finding good dictionary examples in a corpus Auckland 2012Kilgarriff: GDEX1.
Exploring Variation in Lexis and Genre in the Sketch Engine Adam Kilgarriff Lexical Computing Ltd., UK Supported by EU Project PRESEMT.
An introduction for parents Jane Williams. To be a lifelong learner there a certain skills and attributes a person needs in order to be a successful lifelong.
OBJECTIVES: Learners will understand there is a digital disparity among students Learners will understand what causes digital inequality Learners will.
Present Perfect Present Perfect Continuous 6 Th class theory presentation Natalia A.Maximova School #328.
21 st century Teaching and Learning District Educator Deborah Harris EDU620: Meeting Individual Student Needs With Technology Instructor: Adriane Wheat.
What is a Corpus? What is not a corpus?  the Web  collection of citations  a text Definition of a corpus “A corpus is a collection of pieces of language.
Class 3 Corpora in language teaching. Trends in FLT  Communicative Language Teaching  Trends within CLT authentic language contextualised language focus.
GDEX: Automatically finding good dictionary examples in a corpus Kivik 2013Kilgarriff: GDEX1.
GDEX: Automatically finding good dictionary examples in a corpus.
What is Academic Research and Where Does It Come From? Database v Internet.
INTERNATIONAL RELATIONS
Using language corpora in developing Arabic lessons & syllabuses
Just over 6,500 The Importance of Languages
Making useful wordlists for ELT
A self-introduction Mark Wyatt.
Corpora and Concordancers in ESL/EFL Class:
30 December, 2018 IBC – Exam Information.
Revision Obama and Brown.
A self-introduction Mark Wyatt.
Claro ScanPen Reader By Claro Software Limited
Corpora, Language Technology and Maltese
Presentation transcript:

Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK

The argument  English  Biggest language-teaching market  For thirty years  Corpora  We have learnt how to use them  Chinese  Learn from English

Global politics  My son Boris’s history course  The American century  The 20 th Century  The 21 st century  Chinese?  Economics  Culture  Confucius Institutes ( 孔子學院 ), 2004

The role of corpora in language teaching  Word lists  Dictionaries  Textbooks  In the classroom

Word lists  Define a syllabus  Define different levels of competence  Testing

“What words shall we teach?”  Guess  Copy  Count  Teach the commonest

English word lists  Since 1940s  For US school teaching  Thorndike and Lorge, 1944  For English as a Foreign Language  West’s General Service List, 1953  Very well done  Dominant until recently  Early 1990s: British National Corpus  My lists (1995)  200 downloads per month ever since

Kelly project  European Union Lifelong Learning Program  “Word cards”  Nine languages  4 big languages  Arabic Chinese English Russian  5 smaller  Greek Italian Norwegian Polish Swedish  Word cards for all 36 pairs  Which words? use corpora

Kelly database  Nine-language cliques (English members): hospital library music sun theory

Dictionaries  What words to include  (word lists) and  What to say about them  COBUILD project (1980s)  If you do not use corpora you distort  Everyone accepts

Learners need  Phraseology, collocations  Find in corpora  See brochure  Put into dictionaries  examples  From corpora, or they distort

Learner corpora

 Find out what learners do  Strengths  Weaknesses  At different levels  Longitudinal  How learner learns over time

Course design, textbooks  Make sure what you teach is true  Teach commonest things first

Example: English verbs  Which is commonest form/tense  Base  play “let’s play football”  -ing  playing “I’m playing with my friends”  Past  played “He played well yesterday”  Past participle  played “We’ve played six times”

In the classroom  “Data driven learning”  Tom Johns, Birmingham  Students look at concordances  Discover language facts for themselves  It’s tough  Concordances are not easy to read  Motivated, advance learners  Simon Smith, “build your own corpus”  Uses WebBootCaT, Sketch Engine

The role of corpora in teaching Chinese  Word lists ?  No (but see McEnery and Xiao)  Dictionaries ?  Just very recently  Textbooks ?  no  In the classroom ?  Simon Smith, courses using Sketch Engine

Chinese corpora  Newspaper  Gigawords  From 2003 with updates  Simp and Trad  LIVAC  Live updates  Web corpora

LIVAC, Hong Kong

Web Corpora  From Lexical Computing  Brand new  Simplified  zhTenTen; 2.1 billion words zhTenTen  Traditional  TaiwanWaC; 100m words TaiwanWaC  Segemented using Stanford Univ tools

部落格