Viswanatha Naidu Y IIIT-Hyderabad

Slides:



Advertisements
Similar presentations
Language and Grammar Grammar – rules used to organise and describe language Syntax - the way sentences are structured Parts of speech: Nouns – people,
Advertisements

CS Morphological Parsing CS Parsing Taking a surface input and analyzing its components and underlying structure Morphological parsing:
Greenberg 1963 Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements.
Long Distance Dependencies (Filler-Gap Constructions) and Relative Clauses October 10, : Grammars and Lexicons Lori Levin (Examples from Kroeger.
Statistical NLP: Lecture 3
Ian Cushing English teacher, Surbiton High School UK Linguistics Olympiad Committee Education Committee, Linguistics Association of Great Britain Grammar.
Ana Bertha Camargo Mejía
LING NLP 1 Introduction to Computational Linguistics Martha Palmer April 19, 2006.
Focus On Grammar Book 2, 5 th edition Lesson 11: Adjective Clauses!!!!
Introduction to Computational Linguistics Lecture 2.
 Christel Kemke 2007/08 COMP 4060 Natural Language Processing Word Classes and English Grammar.
Stemming, tagging and chunking Text analysis short of parsing.
NLP and Speech 2004 English Grammar
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Fall 2005-Lecture 2.
Matakuliah: G0922/Introduction to Linguistics Tahun: 2008 Session 10 Syntax 1.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
1 CSC 594 Topics in AI – Applied Natural Language Processing Fall 2009/ Outline of English Syntax.
Chapter Section A: Verb Basics Section B: Pronoun Basics Section C: Parallel Structure Section D: Using Modifiers Effectively The Writer’s Handbook: Grammar.
VERBS.
The students will be able to know:
Kalyani Patel K.S.School of Business Management,Gujarat University.
Morphology (CS ) By Mugdha Bapat Under the guidance of Prof. Pushpak Bhattacharyya.
Inversion in the English Language.
Introduction to English Syntax Level 1 Course Ron Kuzar Department of English Language and Literature University of Haifa Chapter 2 Sentences: From Lexicon.
Paradigm based Morphological Analyzers Dr. Radhika Mamidi.
ICS611 Introduction to Compilers Set 1. What is a Compiler? A compiler is software (a program) that translates a high-level programming language to machine.
Some Advances in Transformation-Based Part of Speech Tagging
Complex sentences The dog barked because it was lonely. Mother sang a lullaby when the baby woke up. Although they were well looked after the birds flew.
A Remedial English Grammar. CHAPTERS ARTICLES AGREEMENT OF VERB AND SUBJECT CONCORD OF NOUNS, PRONOUNS AND POSSESSIVE ADJECTIVES CONFUSION OF ADJECTIVES.
Overview Project Goals –Represent a sentence in a parse tree –Use parses in tree to search another tree containing ontology of project management deliverables.
Natural Language Processing Lecture 6 : Revision.
English Review for Final These are the chapters to review. In Textbook: Chapter 1 Nouns Chapter 2 Pronouns Chapter 3 Adjectives Chapter 4 Verbs Chapter.
Infinitives The final verbal…... Infinitives  are verbals which means they are verbs that act as other parts of speech.  Remember the other verbals?
Modifier (grammar) Definition: A word, phrase, or clause that functions as an adjective oradverb to provide additional information about another word or.
A Cascaded Finite-State Parser for German Michael Schiehlen Institut für Maschinelle Sprachverarbeitung Universität Stuttgart
BY HELEN LORENA SOLANO ALEXANDER ARANDA. is a group of words without both a subject and predicate. Phrases combine words into a larger unit that can function.
Notes on Pinker ch.7 Grammar, parsing, meaning. What is a grammar? A grammar is a code or function that is a database specifying what kind of sounds correspond.
Culture , Language and Communication
CSA2050 Introduction to Computational Linguistics Parsing I.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
English Review for Final These are the chapters to review. In Textbook: Chapter 9 Nouns Chapter 10 Pronouns Chapter 11 Adjectives Chapter 12 Verbs Chapter.
Parts of Speech Grammar Review Unit 1 Foundations Unit 1 Foundations.
1 Introduction to Computational Linguistics Eleni Miltsakaki AUTH Spring 2006-Lecture 2.
POS Tagger and Chunker for Tamil
Parts of Speech: Structure Classes
SYNTAX.
◦ Process of describing the structure of phrases and sentences Chapter 8 - Phrases and sentences: grammar1.
1 Some English Constructions Transformational Framework October 2, 2012 Lecture 7.
Error Analysis of Two Types of Grammar for the purpose of Automatic Rule Refinement Ariadna Font Llitjós, Katharina Probst, Jaime Carbonell Language Technologies.
Basic Syntactic Structures of English CSCI-GA.2590 – Lecture 2B Ralph Grishman NYU.
October 10, 2003BLTS Kickoff Meeting1 Transfer with Strong Decoding Learning Module Transfer Rules {PP,4894} ;;Score: PP::PP [NP POSTP] -> [PREP.
Parts of Speech By: Miaya Nischelle Sample. NOUN A noun is a person place or thing.
Parts of speech English Grade 9 Kaleena Ortiz PARTS OF SPEECH Noun Pronoun Adjective AdverbVerbPreposition Conjunction Interjection Click here for this.
Descriptive Grammar – 2S, 2016 Mrs. Belén Berríos Droguett
Non-finite forms of the verb
CIS, Ludwig-Maximilians-Universität München Computational Morphology
Parts of Speech Review.
Words, Phrases, Clauses, & Sentences
Statistical NLP: Lecture 3
Understanding English Grammar: Chapter 7
Chapter 4 Basics of English Grammar
Translation Problems.
Sentences, Clauses and Phrases
Introduction to Linguistics
By Mugdha Bapat Under the guidance of Prof. Pushpak Bhattacharyya
Sentences, Clauses and Phrases
Grammar Review.
Linguistic Essentials
Chapter 4 Basics of English Grammar
Relativization.
Presentation transcript:

Viswanatha Naidu Y IIIT-Hyderabad 20-02-09 vnaidu@research.iiit.ac.in SCONLI-3 Corpus Based study of Relative clauses in Hindi & Telugu Transfer grammar rules for relative clauses Viswanatha Naidu Y IIIT-Hyderabad 20-02-09 vnaidu@research.iiit.ac.in

Hindi Relative clause overview Telugu Relative clause overview OUTLINE Introduction Hindi Relative clause overview Telugu Relative clause overview Transfer grammar & Rules Examples Conclusion 20-02-2009 SCONLI-3, India

Relative clause overview Restricts or qualify the meaning of the noun in NP Languages like English explicitly indicate with relative pronouns Relative pronouns occur in the initial or part of PP or NP 20-02-2009 SCONLI-3, India

Types of Relative clause There are two types of relative clauses 1.Restrictive clause 2.Non-restrictive clause 20-02-2009 SCONLI-3, India

Restrictive Non-Restrictive Serves to give hearer an added piece of information identified entity Provides additional info. Use of Proper nouns Helps the hearer/ reader to identify the referent of the noun phrase. Rel. pronoun can be dropped How does he/she know the distinction? Very often the distinction expressed intonationally, but also orthographically with punctuation marker. 20-02-2009 SCONLI-3, India

The young linguist whom I saw in the conference lives in Hyderabad. Restrictive clause The young linguist whom I saw in the conference lives in Hyderabad. Non-Restrictive clause Chomsky, who arrived for the conference lives in Cambridge, Massachusetts. 20-02-2009 SCONLI-3, India

The accessibility hierarchy defined (comrie, 1977) as SubjectDirect objectNon-direct object Possessive Easy to relativize subjects than it is to relativize any of the other positions 20-02-2009 SCONLI-3, India

Hindi also has the Restrictive – Non-Restrictive Distinction. Restrictive clause E.g. jo billi mere gara mem hai vaha cUhe se DartA hai Rel. cat.3.sg my house.nom in is that. Rat Abl afraid is ‘The cat in my house is afraid of Rat.’ Non-restrictive clause Chandrababu Naidu, jo Andhra Pradesh kA mukhya mantrI REL GEN chief.M minister.M thA Ajkal yahAM hai. Was.sg now a days here is Chandrababu Naidu who was the CM of AP is here now a days. 20-02-2009 SCONLI-3, India

Hindi Relative clauses overview Two types of relative clauses 1.Correlative clause 2.Participle clause 20-02-2009 SCONLI-3, India

Correlative clause | Participle clause Have an explicit relative markers. These markers will be preceded by the nouns Appear initial position of the clauses No explicit markers Two types of participle relatives, Present Participle Past Participle Modify head noun as relative clause do. 20-02-2009 SCONLI-3, India

Correlative clauses In correlatives any NP position can be relativized. Like Subjects, objects, Indirect, Oblique including Instruments, locatives etc. Instrumental Relativization jis cAku se maine murgI ko kAtA vaha bahut tej thA REL knife.M with I.nom hen.M. Accu cut that very sharp was.M The knife with which I cut the hen was very sharp 20-02-2009 SCONLI-3, India

Present participle & Past participle All verbs yield present participle forms, which have two functions. Adjectival Adverbial Inflect for Person, Number, Gender. Appear in the form (verb +-tA +huA). Only a restricted set of verbs yield past participle forms, which indicate achievement. which have two functions. Adjectival Adverbial Inflect for Person, Number, Gender. Appear in the form (verb + -A + huA). 20-02-2009 SCONLI-3, India

Hindi does not allow to relativize the Instrument, locative, etc. It allows only subjects and objects. Subject Relativization dauDA huA laDkA acAnak ruk gayA running was boy suddenly stop. Came. ‘The boy who was running suddenly came to stop’ Instrumental Relativization *mere dwArA kelA kAtA huA cAkU le jAo I by Banana cut did. knife.M take away Take away the knife with which I cut the Banana Locative Relativization *mere dwArA baitI huI kurcI bahut mAngI hai. I by sit.F did. Chsir very costly. is The chair in which I sat is costly. 20-02-2009 SCONLI-3, India

Telugu Relative clause overview Telugu also has two types of relative clauses Correlative clauses Participle clauses But, correlatives are not normal in Telugu (Bh.Krishnamurthy) used in more formal speech. My experience of working in corpus gave the same experience as I hardly found the correlatives eppudu Akalaite appudu tinAli. whenever. Hungry that+time eat. More naturally it can be expressed using participles 20-02-2009 SCONLI-3, India

Participle Relative clauses Participles form by means of non-finite construction Do not inflect person, number, gender E.g. ADutunna abbAyi bAwunnAdu. play.v.adj.conti boy.Nom good.3.sg.. M The boy, who is playing is good. Accessibility hierarchy of Participle Relativization in Telugu is more frequent, Higher, (oblique forms can also be relativized). 20-02-2009 SCONLI-3, India

MT ARCHITECTURE ANALYSIS TRANSFER GENERATOR POS TAGGAER CHUNKER MORPH ANALYZER TOKENIZER PARSER NER WSD Etc. ANALYSIS TRANSFER TRANSFER GRAMMAR 1.Lexical grammar 2.Structural grammar TRANSILITERATION Default features TL specific features Word generator Etc. GENERATOR 20-02-2009 SCONLI-3, India

One of the approaches to Machine Translation Transfer grammar One of the approaches to Machine Translation Captures ‘structural differences’ between SL & TL Aim to develop the MT system 20-02-2009 SCONLI-3, India

ORGANIZATION OF TRANSFER GRAMMAR input Removing specific features of grammar in SL Bridging the gap between SL & TL Lexical & Structural Transfer Rules Nouns Verbs Adjectives Adverbs Prepositions Conjunctions Miscellaneous Final adjustment by the generator according to the TL features output 20-02-2009 SCONLI-3, India

जो आँधी कल आयी थी वो बहुत नुकसान कर गयी Relative clause rules jo AmdhI kal AyI thI vah bahut nuksAn kar gayI Rel. storm.F yesterday come.Pef.F.sg Past.F.sg that much damage.M do go.PerfF.sg ninna vaccina tufAnu cAlA nasTam cesindi. Yesterday.Nom come.past.verbal.adj storm.Nom much damage did.Non-Mascu.3.sg ‘The storm that raged yesterday did a great deal of damage’ जो आँधी कल आयी थी वो बहुत नुकसान कर गयी నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది 20-02-2009 SCONLI-3, India

Till now mapping between main clause-main clause वो बहुत नुकसान कर गयी Null చాలా నష్టము చేసింది vo bahut nuksAn kar_gayI Null cAlA nasTam cesindi Till now mapping between main clause-main clause there are no problems except the correlative marker Let’s see the sub-ordinate clause mapping 20-02-2009 SCONLI-3, India

Here we can clearly see, there is a gap between जो आँधी कल आयी-थी Null నిన్న తుఫాను వచ్చింది jo AndhI kal AyI-thI Null ninna tufAnu vaccindi Here we can clearly see, there is a gap between SL and TL in three aspects. No relative marker in TL SL sub-ordinate +Tense into Non-finite form Word order change 20-02-2009 SCONLI-3, India

Change of word order according to TL RULES Change of word order according to TL 2. Deleting the relative makers (jo & vo) 3. Converting the finite into non-finite 20-02-2009 SCONLI-3, India

Null నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది जो आन्दी कल आयी थी वो बहुत नुकसान करगयी Null నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది 20-02-2009 SCONLI-3, India

Null ninna vaccina tufAnu cAlA nasTam cesindi Change of word order according to TL jo AndhI kal AyI_thI vo bahut nuksAn kargayI Null ninna vaccina tufAnu cAlA nasTam cesindi 20-02-2009 SCONLI-3, India

Null ninna vaccina tufAnu cAlA nasTam cesindi jo kal AyI_thI AndhI vo bahut nuksAn kargayI Null ninna vaccina tufAnu cAlA nasTam cesindi 20-02-2009 SCONLI-3, India

Null నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది Deleting the relative makers (jo & vo) जो कल आयी थी आंधी वो बहुत नुकसान करगयी Null నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది 20-02-2009 SCONLI-3, India

ninna vaccina tufAnu cAlA nasTam cesindi kal AyI_thI AndhI bahut nuksAn kargayI ninna vaccina tufAnu cAlA nasTam cesindi 20-02-2009 SCONLI-3, India

నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది कल आयी थी आंधी बहुत नुकसान करगयी నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది Converting the finite into non-finite 20-02-2009 SCONLI-3, India

ninna vaccina tufAnu cAlA nasTam cesindi kal Aya+(Past.V.adj) AndhI bahut nuksAn kargayI ninna vaccina tufAnu cAlA nasTam cesindi 20-02-2009 SCONLI-3, India

నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది कल आय्(Past.v.adj) आंधी बहुत नुकसान करगयी నిన్న వచ్చిన తుఫాను చాలా నష్టము చేసింది 20-02-2009 SCONLI-3, India

REQUIRED RESOURCES LARGE AMNOUNT OF CORPUS, REPRESENTS ALL DOMAINS BROAD COVERAGE MORPH ANALYZER, GENERATOR >BROAD COVERAGE E-BILINGUAL LEXICONS LARGE AMOUNT OF POS & PARSED TREE BANK CORPUS LARGE AMOUNT OF PARALLEL CORPORA Apart from, there is an high necessity of well trained LINGUIST for modeling the language (s), and a Computer Scientist for implementing the model (s). 20-02-2009 SCONLI-3, India

Thank you for patience 20-02-2009 SCONLI-3, India