INTRODUCTION: RESEARCH AREA 1. Chinese Semantics 2. Semantic difference related to syntax 3. Module Attribute Representation of Verbal Semantics (MARVS)

Slides:



Advertisements
Similar presentations
Uses of a Corpus “[E]xplore actual patterns of language use”
Advertisements

The Meaning of Language
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
Metaphorical Uses of Language in Native and Non-native Student Writing: A corpus-based study By: Claudia Marcela Chapetón Castro M.A. in Applied Linguistics.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
INTRODUCTION: RESEARCH AREA  Autonomy, self-guided learning  Self-Access Learning Center.
CALTS, UNIV. OF HYDERABAD. SAP, LANGUAGE TECHNOLOGY CALTS has been in NLP for over a decade. It has participated in the following major projects: 1. NLP-TTP,
What is a corpus?* A corpus is defined in terms of  form  purpose The word corpus is used to describe a collection of examples of language collected.
Predicting Text Quality for Scientific Articles Annie Louis University of Pennsylvania Advisor: Ani Nenkova.
Corpus Linguistics and Second Language Acquisition – The use of ACORN in the teaching of Spanish Grammar Guadalupe Ruiz Yepes.
A qualitative study on learner identity of American Chinese learners and foreign language learning Jia YU.
Corpus 06 Discourse Characteristics. Reasons why discourse studies are not corpus-based: 1. Many discourse features cannot be identified automatically.
Grammar induction by Bayesian model averaging Guy Lebanon LARG meeting May 2001 Based on Andreas Stolcke’s thesis UC Berkeley 1994.
1 Human simulations of vocabulary learning Présentation Interface Syntaxe-Psycholinguistique Y-Lan BOUREAU Gillette, Gleitman, Gleitman, Lederer.
Presented by Jennifer Robison TexTESOL II March 12, 2010 San Antonio, TX.
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Corpus Linguistics What can a corpus tell us ? Levels of information range from simple word lists to catalogues of complex grammatical structures and.
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
OBJECT ORIENTED PROGRAMMING IN C++ LECTURE
Corpus Linguistics Case study 2 Grammatical studies based on morphemes or words. G Kennedy (1998) An introduction to corpus linguistics, London: Longman,
Phonetics, Phonology, Morphology and Syntax
Chapter 3: An Introduction to Corpus Linguistics Compiled by: Sajjad Ghadamyari Farhad Ghiasvand Presentation Date: Dec. 8, Monday.
Semantics. Semantics-concerned with the investigation of meaning in a language without any reference to the context of situation The study of linguistic.
English Language and Literature Prelim Lesson: Investigating Language Use in ‘The Handmaid’s Tale’
A Contrastive Study of English Native Speaker's and Chinese Learner's Use of Existential Construction Tian Ma.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Language and Thought.
Current Status and Future of Language Resources in Taiwan Chu-Ren Huang Institute of Linguistics, Academia Sinica Symposium on Language Resources in Asia.
Linguistics, Pragmatics & Natural Grammar
PropBank, VerbNet & SemLink Edward Loper. PropBank 1M words of WSJ annotated with predicate- argument structures for verbs. –The location & type of each.
Reflections on Using Corpora Data in EFL Teaching CHEN BO Chongqing Jiaotong University 2006.
Word Sense Disambiguation (WSD)
LOGO RHETORICAL AND LINGUISTIC CHOICES IN DISSERTATION ABSTRACTS TRANSLATED FROM ENGLISH INTO ARABIC Amani Salman Abstract The main aim of the present.
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
Relative Clauses in Mandarin Chinese Conversation Na Wang.
Unit A1 What is Translation?
Li Zhen
Chapter 10 Language and Computer English Linguistics: An Introduction.
A Corpus-based Genre and Language Feature Analysis of Chinese and English Linguistics and Literature Article Abstracts Fan Chunxiang Zhengzhou University.
A Language Independent Method for Question Classification COLING 2004.
Machine Translation  Machine translation is of one of the earliest uses of AI  Two approaches:  Traditional approach using grammars, rewrite rules,
Efficiently Computed Lexical Chains As an Intermediate Representation for Automatic Text Summarization H.G. Silber and K.F. McCoy University of Delaware.
Copyright © Curt Hill Languages and Grammars This is not English Class. But there is a resemblance.
Capturing patterns of linguistic interaction in a parsed corpus A methodological case study Sean Wallis Survey of English Usage University College London.
1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.
The interface between model-theoretic and corpus-based semantics
Translation Studies 9. The use of corpora in TS Krisztina Károly, Spring, 2006 Sources: Olohan, 2004; Tirkkonen-Condit, 2005.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Wordnet - A lexical database for the English Language.
Alignment of Bilingual Named Entities in Parallel Corpora Using Statistical Model Chun-Jen Lee Jason S. Chang Thomas C. Chuang AMTA 2004.
+ Indonesian Language Maintenance: A Case Study of Indonesian Families in the United States By: Nurenzia Yannuar.
LING 6520: Comparative Topics in Linguistics (from a computational perspective) Martha Palmer Jan 15,
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Lexical Semantics Fall Lexicon Collection of Words Collection of Words Mental store of information about words and morphemes Mental store of information.
Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
MORPHOLOGY. PART 1: INTRODUCTION Parts of speech 1. What is a part of speech?part of speech 1. Traditional grammar classifies words based on eight parts.
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
In this lecture, we will learn about: Translation.
Chapter 5 The Oral Approach.
Use of Literature in Language Teaching
Collecting Written Data
E303 Part II The Context of Language Research
PRESENTED BY: PEAR A BHUIYAN
Verb Activation through Priming at the Syntax-Semantics Interface
INTRODUCTION TO COMPILERS (Pavan)
Chapter 10: Compilers and Language Translation
Artificial Intelligence 2004 Speech & Natural Language Processing
Presentation transcript:

INTRODUCTION: RESEARCH AREA 1. Chinese Semantics 2. Semantic difference related to syntax 3. Module Attribute Representation of Verbal Semantics (MARVS)

AIM/JUSTIFICATION Chinese is full of near-synonymous verbs which are difficult for foreign learners to learn and they are a potential problem for computer translation. MARVS is a representational framework to distinguish meaning of Chinese verbs in their syntax structures. Try to do a little help to Chinese learners and a little bit contribution to computer translation by using MARVS to find out the distinction between “Guan” and “Kan”( both means looking in Chinese).

REFERENCES (1) Huang, Chu-Ren, Kathleen Athens, Li-Li Chang, Keh-Jiann Chen, Mei-Chun Liu, and Mei-Chih Tsai The Module-Attribute Representation of Verbal Semantics: From Semantics to Argument Structure. International Journal of Computational Linguistics and Chinese Language Processing. (2) Liu, Mei-Chun Corpus-based Lexical Semantic Study of Verbs of Doubt: Huayi and Cai in Mandarin. Concentric ( 3 ) Liu, Mei-Chun From Collocation to Event Information: the Case of Mandarin Verbs of Discussion1. Language and Linguistics 4 (3): , (4) Chiang, Ting-Yi, Chou, Ming-Hui, Liu, Mei-Chun A Frame-based Approach to Polysemous Near-synonymy: the Case with Mandarin Verbs of Expression. Journal of Chinese Language and Computing, 15 (3): ( ) (5) Chung, Siaw-Fong and Kathleen Ahrens. Forthcoming. MARVS Revisited: Operationalizing Sense Frequency and MI Values. Language and Linguistics: Lexicon, Grammar and Natural Language Processing. (6) Mei-chun Liu and Ting-yi Chiang The Construction of Mandarin VerbNet: A Frame-based Study of Statement Verbs. LANGUAGE AND LINGUISTICS 9.2: , 2008 ( 7 ) Wang Juan A Corpus Based Study on the Chinese Near- Synonymous Verbs of Running

RESEARCH QUESTIONS Can the two near-synonymous verbs of looking in Chinese be used alternatively in all the contexts? If not, what are their distribution differences? How to explain the differences by using the model of MARVS? (Two more steps will be introduced here)

MATERIALS/INSTRUMENTS Module Attribute Representation of Verbal Semantics (MARVS) Verb---Sense---Eventive Information Share Sense and Mutual Information Value Event Module Event-Internal Attribute Role Module Role-Internal Attribute

METHODOLOGY ---SOURCE The corpus that I would like to use is developed by the Center for Chinese Linguistics of Beijing University. Both modern and classical Chinese data are included in this corpus. For the modern Chinese data, there are both spoken and written data, and the latter just takes a small percentage (0.04%). Now I am also trying to find a Chinese corpus dominated by spoken data.

METHODOLOGY ---PROCEDURE Firstly, all instances of each of the two verbs will be searched for in the corpus. Secondly, these entries of each verb will be classified into different type of syntactic pattern. Thirdly, the aspectual type that is associated with each verb will be examined. Last but not least, I will use the modified MARVS model to explain the differences between the verbs.

METHODOLOGY ---TYPE OF DATA AND ANALYSISMETHODOLOGY ---TYPE OF DATA AND ANALYSIS Data collection: written ones and spoken ones. Beginning period: quantitative approach to analyze data themselves and try to find out their common features. Later period: qualitative approach to explain the differences under the guidance of modified MARVS. TYPE OF DATA AND ANALYSIS

A Corpus Based Study on the Chinese Near- Synonymous Verbs of Looking By Using Modified MARVS Sun He

ANTICIPATED PROBLEMS/LIMITATIONS OF THE STUDY The data I can collect is probably prone to written ones since the spoken ones are hard to access to. Therefore the result will not very comprehensive.

WHAT DO YOU EXPECT TO FIND? Kan probably has the event focus of both the starting and the endpoint of the event while Guan does not. Guan tends to work more with artistic words while Kan can work with more variety kinds of nouns.