RESEARCH DESIGN & CORPUS COMPILATION. Corpus design is intrinsic and a fundamental part of the analysis. It is guided by the RQ and affects the results.

Slides:



Advertisements
Similar presentations
Qualitative methods - conversation analysis
Advertisements

A learner corpus of students’ examination work in English language (a project) Sylwia Twardo Centre for Foreign Language Teaching, Warsaw University, Poland.
VCE Religion and Society Revised Study
Lingua inglese II The discourse of Broadcast news.
ENG-214.  Preparing  Conducting  Reporting  To prepare for your interview:  Chose a topic focus that interests you, one that you want to explore.
Compiling a corpus II. Corpus A finite size, non random collection of naturally occurring language, in a computer readable form. Non-random = representative.
THE TEXT PARAMETERS AND THE TECHNIQUES OF READING TESTING IN CHRISTIAN AND ISLAM RELIGION INSTITUTES (A CASE STUDY AT INSTITUTE ALKITAB TIRANUS (IAT) BANDUNG.
CS 1400 Using Microsoft Visual Studio 2005 if you don’t have the appropriate appendix.
Breaking News English
 MODERN DATABASE MANAGEMENT SYSTEMS OVERVIEW BY ENGINEER BILAL AHMAD
Preparing for the Verbal Reasoning Measure. Overview Introduction to the Verbal Reasoning Measure Question Types and Strategies for Answering General.
Chapter One – Thinking as a Writer
Test Preparation Strategies
Chapter 3: An Introduction to Corpus Linguistics Compiled by: Sajjad Ghadamyari Farhad Ghiasvand Presentation Date: Dec. 8, Monday.
Choosing Your Primary Research Method What do you need to find out that your literature did not provide?
Norm Theory and Descriptive Translation Studies
Managing a Travel Agency A travel agent offers a holiday planning and booking service. This service includes details of holidays such as: special offers.
Revising and Editing Checklist - Review
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Online Corpora in L2 Writing Class Zawan Al Bulushi Indiana University Bloomington November 15,
NSW Curriculum and Learning Innovation Centre Draft Senior Secondary Curriculum ENGLISH May, 2012.
Translation Studies 8. Research methods in Translation Studies Krisztina Károly, Spring, 2006 Sources: Károly, 2002; Klaudy, 2003.
Representatıvness, balance and samplıng ın a corpus Lınguistıcs.
Advanced Supplementary Level. You learn how to: 4 write grammatical English 4 speak in English (for communication) 4 read English for general understanding.
Learning goals.
UNIT 1 ENGLISH DISCOURSE ANALYSIS (an Introduction)
Averil Coxhead Hüsem Korkmaz MA TEFL. was developed from a corpus of 5 million words with the needs of ESL/EFL learners in mind, contains the most widely.
Readings in Foreign Journals and Press Zou Qiming Telephone:
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
What is the phenomenon? How is it different & similar to another phenomenon? When is it exhibited vs. not? Why? Why is it true vs. not ? What explains.
How Can Corpora Help Me To Be Successful in CO150?
Focus Education Assessing Reading: Exceeding Year 4 Expectations Year 4 Exceeding Expectations: Reading Locate and use information from a range of.
Science Fair Parent Workshop Eugenia B. Thomas K-8 Center Lower Academy (Grades 3-5)
Corpus approaches to discourse
Engaging with data Choices and decisions. Seeing or looking at? The advance of corpus linguistics has certainly changed the way that we can look at our.
English for Specific Purposes
Writing the news Can I understand how to write for a newspaper? What are the key ingredients of a newspaper article? Come up with at least FIVE in your.
Project 1: Creating Newsletters Module 2: Becoming Ethical Journalists.
IB: Language and Literature
11 Researcher practice in data management Margaret Henty.
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
INTRODUCTORY LECTURE BY PROF. MIKE KURIA. WHAT IS STYLISTICS? Method of textual interpretation in which primacy is assigned to language. Literary texts.
LANGUAGE, DIALECT, AND VARIETIES
What is Research?. Intro.  Research- “Any honest attempt to study a problem systematically or to add to man’s knowledge of a problem may be regarded.
Chapter 20 Asking Questions, Finding Sources. Characteristics of a Good Research Paper Poses an interesting question and significant problem Responds.
Cedric D. Murry APT Instructor of Applied Technology in research and development.
Final Exam Specifications. Your final exam will consist of three sections: A) Reflection, B) An achievement test and C) Appendix. You must submit your.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Debbies ‘how to write a lit review’ guide This is a summary and you should also work through the weekly PowerPoint's! My comments are on each slide in.
Teaching Plan A General Introduction. Course Description and Objective Advanced English Composition is a writing intensive course that takes a holistic.
CPUT Libraries Information literacy in the new curriculum M.Moll.
Writing your Master Thesis Management of People, Management of Innovation Processes, and Strategy and Organization Bo H. Eriksen, 12 October 2016.
Non-fiction and Media Higher Tier.
Developing EAP reading materials for teaching and publication
E303 Part II The Context of Language Research
IB Assessments CRITERION!!!.
Exam Explanation. UNIT 2 Jane Craw 2014.
Corpus Linguistics I ENG 617
Reading strategies for Developing reading skills among students
The Anatomy of a Scientific Article: IMRAD format
Q1-Identify and Interpret List four things from the text about…
English Language Assessment Objectives
Comparative Essay.
Contextual Analysis Context governs our linguistics choice.
Frames Icons.
Using GOLD to Tracking L2 Development
Applied Linguistics Chapter Four: Corpus Linguistics
In this chapter Be able to outline the purpose and distinct focus of management research; • Be able to place your research project on a basic-applied.
How to revise for English exams
Constructing a Test We now know what makes a good question:
Presentation transcript:

RESEARCH DESIGN & CORPUS COMPILATION

Corpus design is intrinsic and a fundamental part of the analysis. It is guided by the RQ and affects the results. Design criteria are interpretative and must be explicit (why you chose the texts you did, how and why you organised them in the way you did) Different purposes = different corpora.

Corpus design What? Which? When? Where? How? Why?

What Choosing discourse type(s) there are epistemological considerations And there are practical considerations General vs. topical epistemological considerations practical considerations

Which Choosing variables You need to have at least one variable constant or your corpora are not really comparable e.g. same time period different newspapers Same kind of newspaper different time period

comparison You are comparing and looking for patterns One occurrence of anything is not enough, a pattern is: a) a figure that emerges from a homogeneous background by means of differentiation and b) the accumulation of similar things. C) recurring regularities of form

Comparative analysis Looking at: DIFFERENCE SIMILARITY across corpora within corpora

parameters across corpora mode (written vs. spoken) discourse type (e.g. factual vs. fiction) time (diachronic studies) variety (e.g. British English vs. American English) geography (e.g. national vs. local newspapers) political tendency (Democrats vs. Republican’s speeches) individual (e.g. George Elliot vs. Thomas Hardy)...

Parameters within corpora sub-corpora (e.g. headlines vs. articles; news vs. comment) Specific lexical items (e.g. moral vs. ethic; boy vs. girl; immigrant vs. asylum seeker vs. refugee...)

Collections of texts – not one text Integral output of a source-unit (e.g. a whole edition of a newspaper) The corpus of works by one author (not a single text)

Topic based corpus Search-term(s) based collection You gather texts by searching a database for all the texts containing the search-term(s) identifying the list of search items to ensure the coverage of the topic is as complete as possible.

Time based Historical linguistics diachronic change/stability of language modern diachronic analysis See edition of Corpora MD-CADS for examples (Partington 2010)

Research questions All the choices we make in the corpus design and data collection phase e.g. what to collect, how to collect it, from which platform, in which format, etc. all depend on the RQ!

Practical considerations availability access collection speed storage format

The research question All the choices we make in the corpus design and data collection phase e.g. what to collect, how to collect it, from which platform, in which format, etc. depend on the RQ!

RQ example 1 1. How are muslims represented in the British press? What are the appropriate search terms? muslim*, moslem*, islam*...? Consider synonyms and near-synonyms, alternative spellings etc.

RQ example 2 2. How is religion represented in the British press? How many terms do I need to add? How many terms can I add?

RQ example 3 3. How much attention does the British press give to religion? A search-term based corpus will not tell you. How will you find out? How will you delimit the work? (by limiting and defining the RQ a bit more, e.g. by defining a time period or the type of newspapers under consideration)

storage FOLDERS folders and file names (a repository of information, a sort of level 0 of mark-up) FILES become our definition for what is a text unit of analysis

Best practice Distribute information between FOLDER and FILE according to the structure of your corpus (and to your RQ) Avoid having more than 2 or 3 levels of folders Keep names short but dense with information

example 1: Do newspapers use the same language at a 20 years distance? Which among British broadsheets has changed the most?

storage for example 1 CORPUS year 1 Newspaper1 y1_n1_f1 y1_n1_f2 y1_n1_f3y1_n1_f4... N2 N3 year 2 N3 N1 N2

example 2: How are science and religion represented in political discourse?

Solution 1 Science corpus Religion corpus Solution 2 Democrat corpus Republican corpus

How much? The bigger the better BUT also the size depends on the purpose! I ask for a minimum of 100,000 words

The transformation of texts into textual resources is a process of interpretation and therefore compilers have the responsibility typically associated with an editor. The questions we ask (and those we do not ask), affect the answers we can get, it is important to keep track of our expectations and choices and the reasons behind them.

Epistemological reflexivity: you need to ask yourself How has the research question defined and limited what can be ‘found’? How has the design of the study and the method of analysis ‘constructed’ the data and the findings? How could the research question have been investigated differently? To what extent would this have given rise to a different understanding of the phenomenon under investigation?

Reflexivity is an unavoidable aspect of research: epistemological reflexivity encourages us to reflect upon the assumptions (about the world, about knowledge) that we have made in the course of the research, and it helps us to think about the implications of such assumptions for the research and its findings(Nightingale and Cromby, 1999: 228).

Principles of accountability Replicability These principles are important in researqch and you need to learn to ask yourself how your research follows the principles We will be looking at all these issues again and in more detail

Exam The exam includes: the first draft consisting of the abstract and corpus description presented to the group A final draft consisting of abstract and a copy of your corpus and its description sent to me A presentation on the day of the exam. Don’t forget you need to have proof of B2 competence in English to be able to register the exam.