Today Writing: using the comma –Writing task Corpus linguistics talk, Part 2 Re-organize groups –Group news discussion.

Slides:



Advertisements
Similar presentations
The people Look for some people. Write it down. By the water
Advertisements

Introducing Extensive Reading
Your Memory At Work Chapter 14. Pre-Reading! We are going to do 2 memory tests.
Integrating corpus-based vocabulary activities into an academic writing course TESOL 2005, San Antonio, Texas March 30, 2005 John Bunting Georgia State.
1 Corpora for all Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
L EARNERS ’ D ICTIONARY Deny A. Kwary
Macrostructure  Front matter  Body  Appendices Jackson, Howard Lexicography: An Introduction. London: Routledge, p. 25.
Augmenting online dictionary entries with corpus data for Search Engine Optimisation Holger Hvelplund, 1 Adam Kilgarriff, 2 Vincent Lannoy, 1 Patrick White.
1 Chinese WordSketch Online, corpus-based summaries of word usage.
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
The Sketch Engine -What is The Sketch Engine? -What is a corpus? -Looking at the BASE and the BAWE corpora. -How can this help.
TUESDAY DECEMBER 3, 2013 CO: SWD applying of word choice by constructing a 6-word memoir. LO: SW write sentences using commas correctly.  6 minutes--Bell.
How To Teach Vocabulary. Best Practices What does effective, comprehensive vocabulary instruction look like? It has identified four key components: 1.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Making useful wordlists for ELT Topical vocabulary from the WWW Simon Smith & Scott Sommers Ming Chuan University, Taipei Adam Kilgarriff, Lexical Computing.
Today Listening test Corpus linguistics talk, Part 3 News task NEOs Life on Mars.
Talking about your homework News story? –What made you choose…? One of your words? –What made you choose…? (Give your vocabulary books to another student.
Second Grade English High Frequency Words
Today Writing: using the comma –Quiz Other punctuation Listening test Corpus linguistics talk, Part 3 The healthy diet Recipes.
Corpus Linguistics: session 2 Corpus Linguistics (2): The Tools of the Trade 669o4zt
Simple Maths for Keywords Adam Kilgarriff Lexical Computing Ltd.
Labels: automation Adam Kilgarriff. Auckland 2012Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English?  Most often.
1 Evaluating word sketches Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
Mass Media. What’s the news?
T e x a s. Have you ever been to Texas? Now just imagine that you are in this American city and happened hearing an interesting story told in local dialect.Listen.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
1 The Long Road from Text to Meaning Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
GDEX: Automatically finding good dictionary examples in a corpus Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý Lexical Computing.
1 Corpora, Dictionaries, and points in between in the age of the web Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of.
1 Chinese WordSketch Engine Online, corpus-based summaries of word usage.
Using the Sketch Engine for second language learning Simon Smith & Alice Chen.
Class 3 Corpora in language teaching. Current trends in FLT  Communicative Language Teaching  Trends within CLT authentic language contextualised language.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
Using the Sketch Engine for second language learning: an experiment Simon Smith & Alice Chen |
English for Engineers Simon Smith. Today’s class Introducing ourselves to each other Talking about the class Technology in use – Discussion and listening.
Why We Need Corpora and the Sketch Engine Adam Kilgarriff Lexical Computing Ltd, UK Universities of Leeds and Sussex.
Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
PE1 week 4 Page 16 minimal pairs 3. Grades and homework In future, all late work will get a maximum of 60% Sketch Engine grades are on the web, but not.
TALC Applying some Developments in Corpus Building Technology to Language Teaching and Learning TALC 2006 Paris.
 Make sure you complete a slide for all five areas. When you leave out a section, it costs you many points. (ex. Leaving out the character description.
Corpus Evaluation Adam Kilgarriff Lexical Computing Ltd Corpus evaluationPortsmouth Nov
Welcome. Module 1 Small Talk What are they doing ?
Malta, May 2010Kilgarriff: Corpora by Web Services1 Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities.
Writer’s Notebooks 3/17/2010 List 5 significant emotional memories that are on your heart map.
CL 2005, Birmingham Web as Corpus Workshop Intro: Adam Kilgarriff 1 Web as Corpus Workshop Co-chairs: Marco Baroni Adam Kilgarriff Sebastian Hoffman.
Comparative Construction II 2 nd sem. 1433/ Spring 2013 #2 - Punctuation: Practice.
Corpus Linguistics in Research Doctorate in Education University of Warwick 6th November 2008.
Sketch engine for Chinese Discussion notes. Wordsketch, subsequently Sketch Engine Was developed by Kilgarriff et al at Brighton Gives automatic, corpus-based.
High Frequency Words.
Learners' Dictionaries Oxford1948 Longman1978 Collins COBUILD1987 Macmillan2002 Macmillan2008 (bilingualized) Merriam-Webster2008 Jackson, Howard
Unit 7 Grammar Forms & Functions 3
Using the Sketch Engine for second language learning: an experiment Simon Smith & Alice Chen |
GDEX: Automatically finding good dictionary examples in a corpus Auckland 2012Kilgarriff: GDEX1.
1 Taking Notes. 2 STOP! Have I checked all your Source cards yet? Do they have a yellow highlighter mark on them? If not, you need to finish your Source.
English Writing Logic – Articles and Commas Articles & Commas.
外研版 初三 ( 上 ) Module 8. Unit 3 Language in use 教学设计思路 Task1: 先通过本单元的练习 1 来复习前 两个单元的新单词, 词组, 以检查学生 掌握新知识的情况。 Task2: 通过练习 7 来让学生进一步辨析 和掌握本单元的重要词组。
Let’s Put an end to sentences! Language Arts Objective Today we will learn about the four kinds of sentences and how to punctuate them.
GDEX: Automatically finding good dictionary examples in a corpus Kivik 2013Kilgarriff: GDEX1.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
GDEX: Automatically finding good dictionary examples in a corpus.
Day 4 Elements of Voice Quiz
Introduction to Corpus Linguistics
Making useful wordlists for ELT
Adverbs of Frequency Let’s begin!
Corpus Linguistics I ENG 617
Comparative Construction II
Edit.
Corpora, Language Technology and Maltese
Grammar – Unit 1 Present Continuous
Presentation transcript:

Today Writing: using the comma –Writing task Corpus linguistics talk, Part 2 Re-organize groups –Group news discussion

1. 2.He left the scene of the accident and tried to forget that it had happened. 3. Oil which is lighter than water rises to the surface. 4. Madame de Stael was an attractive gracious lady. 5. Nice is a word with many meanings and some of them are contradictory. 6. Taxicabs that are dirty are illegal in some cities. 7. The uninvited guest wore a dark blue tweed suit. 8. I hope that some day he will learn how to be polite. 8. Mark Twain's early novels I believe stand the test of time. 9. Write the editor of the Atlantic 8 Arlington Street Boston Massachusetts He replied "I have no idea what you mean." 11. After a good washing and grooming the pup looked like a new dog. 12. Men who are bald are frequently the ones who are the most authoritative on the subject of baldness. 13. Hello Kitty cellphones which are very popular in Japan have not really caught on in Taiwan.

Introduction to corpus linguistics Simon Smith & Adam Kilgarriff

Plan for today Short review of corpus basics 4 ages of corpus research – From pre-computer age, to SkE Functions of SkE Demonstration of SkE in use

Quiz What’s a (linguistic) corpus? What does the Latin word mean? What are corpora?corpora What’s the BNC? How big is the British National Corpus? What is the advantage of having a very big corpus? What can corpora be used for?

5 major uses for linguistic corpora Language learning and teaching Theoretical research on Language and Linguistics Literary research and analysis Language technology Lexicography (=dictionary making) – Cobuild, Longman, … – All learner dictionaries now use corpora

How do you make a dictionary? (What sources can you use?) Use your own knowledge of words Ask all your friends for their knowledge Consult other dictionaries – and copy them Read thousands of books – and take lots of notes Use a corpus

Taiwan, Dec 2006 Four ages of corpus research (in lexicography) Kilgarriff, Lexical Computing Slide: 8 Age 1: Pre-computer Age 2: KWIC concordance (KWIC=?) Age 3: Corpus query tools e.g. Sketch Engine

Taiwan, Dec 2006 Kilgarriff, Lexical Computing Slide: 9 Age 1: Pre-computer First Oxford English (1860) Dictionary: 20 million index cards – a word (usually rare) and a citation

Taiwan, Dec 2006 Kilgarriff, Lexical Computing Slide: 10 Age 2: KWIC Concordance

Taiwan, Dec 2006 Kilgarriff, Lexical Computing Slide: 11 Age 2 (~ ): KWIC Concordances Using computers List of lines which contain a keyword The keyword is in the middle

Taiwan, Dec 2006 Kilgarriff, Lexical Computing Slide: 12 4 person in an agreement/dispute 1 political association 4 person in an agreement/dispute 2 social event 5 to be party to something... 3 group of people The coloured pens method

Taiwan, Dec 2006 Kilgarriff, Lexical Computing Slide: 13 Age 2: limitations as corpora get bigger: too much data 50 lines for a word: read all 500 lines: could read all, takes a long time 5000 lines: impossible

Taiwan, Dec 2006

Why do corpora keep getting bigger? (anyone?) Improvements in technology – Price of storage is going down – Speed of access is going up Representativeness – Small corpus  many examples of common words, maybe – But not enough examples of unusual words

Lexical distribution What’s the most common word in English? What % does it make up of a whole corpus? The 100 most common words make up __% of all the words in a corpus? The 7500 most common words make up __% Answers: – The, 5%, 45% and 90% So: – you need massive corpora, if you want to really represent rare words properly

18 Limitation of KWIC analysis A s corpora get bigger: too much data – 50 lines for a word: read all – 500 lines: could read all, takes a long time – 5000 lines: no Instead, look at a Word Sketch from Sketch Engine – a statistical summary of word usage – shows most common collocates

Taiwan, Dec

Taiwan, Dec

Taiwan, Dec 2006 Maybe stop here Kilgarriff, Lexical Computing Slide: 21

Functions of SkE KWIC concordance – Sorting, filtering etc Word sketch Automatic thesaurus Sketch difference – discriminate near-synonyms 22

23 Lexical approach to language learning Lewis (1993) and Schmitt (2000) say – the vocab is stored in the brain in collocations – Bacon is stored near eggs – 蛋 is stored near 炒飯 – scotch is stored with whisky Saying strong car or powerful tea or broken house seems very “foreign”

24 From - a lexical approach activity, based on a story textwww.teachingenglish.org

News task 4 sentences News story must be from the current week Please include the date when you print it Make two lists of adjectives: –(+) exciting; dramatic; unusual… –(-) dull; complicated; bloodthirsty… Choose the best story from your group –I’m not very keen on that story because… –I prefer this story because…

Collocations and sentences 5 words Use the SkE beta Say which corpus you used 3 collocations for each word –State the frequency –State the salience ( 顯著性 ) Example sentence from SkE should use one of the collocations you chose If you don’t understand the sentence, don’t use it!

Before this week’s reading, ask: How many different cuisines can you name, from around the world? Which cuisine do you think is the healthiest?