2English Profile is…A long-term collaborative research programme aiming to:understand what the CEFR actually means for Englishinvestigate what learner English is really likedevelop detailed reference descriptions for each CEFR levelCEF levels language-independent. This has been done for some other languages.‘I can use simple phrases and sentences to describe where I live and people I know’ – but what specific vocab do they need/use for this? What grammar forms do they need. What functional skills?Vocab strand is most developed, though work going on on other strands.2
4English Vocabulary Profile Assign levels: first A1, A2, B1, B2, now C1 and C2Not just by word but at sense level (= each meaning)Finding out what learners at each level CAN doReflecting current classroom practice and materialsBritish English and American English versions4
5Sources used for A1-B2 levels Spoken and written native speaker corporaCambridge ESOL Vocabulary Lists (KET & PET)Breakthrough, Waystage, Threshold & Vantage syllabiCambridge Readers wordlistsWordlists in current coursebooks & vocabulary booksHindmarsh English LexiconBut the most important source of all is ….5
6Cambridge Learner Corpus Joint development by CUPand Cambridge ESOLOver 43 million wordsOver 150,000 exam scriptsGrows each year by2-3 million wordsAll CEFR levels130 first languages203 countriesOver 20 million words coded for errorsUpdate this slide each year.6
7Corpus-informed research • provides real language data• highlights what is frequent• clearly shows typical patterns: structures, collocations, phrases, phrasal verbs, idioms• captures change over time, as a corpus is constantly updatedI was just going to remind people about the strength of corpus-informed work…
13Insights from 2010 Validation The A1-B2 “Wordlists” have been validated by:researchers in Cambridge, Nottingham, Miami and Tokyostudents tested on phrasal verbs in Tokyo, Bilbao and BrnoCUP authors, editors and lexicographersCambridge ESOL item writers and test developersteachers around the world, via Word of the Week feedbackAmendments and additions have been made as a result.13
14Ron Martínez phrasal expressions Embedded phrase:make your way / 100m in BNCwas within ROUTE sense of way A2 - phrase pulled out, with two senses at B2 and C2We made our way down the river. B2He managed to make his way in the film industry. C2This leads in to the slide with way phrases.
16Scope of the C levels Additional senses of words already in A1-B2 New words supported by corpus evidenceExtension of many word familiesLots of additional phrases and phrasal verbsFrequent idioms, where there is evidence
17Core results for know, with the C1 phrases on the next slide
19Extra sources for C levels IELTS data in Cambridge Learner CorpusAcademic English native speaker corporaAcademic Word List (Coxhead, 2000)Academic Formulas List (Ellis & Simpson-Vlach, 2010)Phrasal expressions analysis (Martínez, 2010)
21Idioms in the EVP Only a handful to B2, eg break the ice Additional category search for A1-C2 versionInclusion dictated by NS frequency and CLCNot included: raining cats and dogsIncluded: behind closed doorsIdioms policy. Raining cats and dogs is a useful one to cite: lots of examples in CLC from 1993, less frequent now… and not useful/current for NS.
23Launch plans for EVP DJK A1-C2 levels sample on EP website A-Z A1-B2 levels available to Network PartnersA-Z A1-B2 levels public release in Sept 2011A-Z A1-C2 release in 2012Data contributors qualify for free access
24Cambridge English Profile Corpus CLC is a unique resource but is solely written exam tasks.The EP Corpus will contain:Spoken learner language (2 million words)Samples of non-exam written material (8 million words)Data collection is via the English Profile website.We need you to get involved and submit student data!24
26English Profile: get involved! • Visit the websiteSign up to free Word of the weekContribute to the Data collection project:CEPC - The Cambridge English Profile Corpus - learner data from around the world26