Presentation is loading. Please wait.

Presentation is loading. Please wait.

Postgraduate Diploma in Translation Lecture 1 Computers and Language.

Similar presentations


Presentation on theme: "Postgraduate Diploma in Translation Lecture 1 Computers and Language."— Presentation transcript:

1 Postgraduate Diploma in Translation Lecture 1 Computers and Language

2 Feb 2005 -- MRDiploma in Translation - Lecture 12 Course Information Web http://www.cs.um.edu.mt/~mros/diptran Lecturers mike.rosner@um.edu.mt ray.fabri@um.edu.mt D. Arnold et al (1994) Machine Translation: an Introductory Guide. See website. H. Somers (2003). Computers and Translation, a Translator’s Guide. See website.

3 Feb 2005 -- MRDiploma in Translation - Lecture 13 Computers and Language Computational Linguistics Emphasis on mechanised linguistic theories. Grew out of early Machine Translation efforts Natural Language Processing Computational models of language analysis, interpretation, and generation. Language Engineering emphasis on large-scale performance example: Google

4 Feb 2005 -- MRDiploma in Translation - Lecture 14 CL: Two Main Disciplines COMP SCILINGUISTICS

5 Feb 2005 -- MRDiploma in Translation - Lecture 15 Linguistics Phonetics: The study of speech sounds Phonology: The study of sound systems Morphology: The study of word structure Syntax: The study of sentence structure Semantics: The study of meaning Pragmatics: The study of language use

6 Feb 2005 -- MRDiploma in Translation - Lecture 16 Grammar Rules: Prescriptive versus Descriptive Prescriptive Grammar Rules for and against certain uses Proscribed forms that are in current use “don’t end a sentence with a preposition” Subjective Descriptive Grammar Rules characterizing what people actually say Goal to characterize all and only that which speakers find acceptable Objective

7 Feb 2005 -- MRDiploma in Translation - Lecture 17 Noam Chomsky Noam Chomsky’s work in the 1950s radically changed linguistics, making syntax central. Chomsky has been the dominant figure in linguistics ever since. Chomsky invented the generative approach to grammar.

8 Feb 2005 -- MRDiploma in Translation - Lecture 18 Generative Grammar: Key Points A language is a (possibly infinite) set of sentences. Grammar is finite. Grammar of a particular language expresses linguistic knowledge of that language Theory of Grammar includes mathematical definition of what a grammar is. The “Theory of Grammar” is a theory of human linguistic abilities. [source: Sag & Wasow]

9 Feb 2005 -- MRDiploma in Translation - Lecture 19 Theories of Sentence and Word Structure: Rewrite Rules Rules can be used to specify the sentences of a language. Rules have the form LHS  RHS LHS may be a sequence of symbols RHS may be a sequence of symbols or words. Lexicon specifies words and their categories

10 Feb 2005 -- MRDiploma in Translation - Lecture 110 A Simple Grammar/Lexicon grammar: S  NP VP NP  N VP  V NP lexicon: V  kicks N  John N  Bill S NP N Johnkicks NPV VP N Bill

11 Feb 2005 -- MRDiploma in Translation - Lecture 111 Formal v. Natural Languages Formal Languages Arithmetic 3290 1 1010101 Logic  x man(x)  mortal(x) URL http://www.cs.um.edu.mt Natural Languages English John saw the dog German Johann hat den hund gesehen Maltese Ġianni ra kelb

12 Feb 2005 -- MRDiploma in Translation - Lecture 112 Points of Similarity A language is considered to be a (possibly infinite) set of sentences. Sentences are sequences of words. Rules determine which sequences are valid sentences. Sentences have a definite structure. Sentence structure related to meaning.

13 Feb 2005 -- MRDiploma in Translation - Lecture 113 Points of Difference Formal Languages The grammar defines the language Restricted application Non ambiguous Natural Languages The language defines the grammar Universal application Highly ambiguous

14 Feb 2005 -- MRDiploma in Translation - Lecture 114 Ambiguity Morphological Ambiguity en-large-ment Lexical Ambiguity the sheep is in the pen Syntactic Ambiguity small animals and children laugh Semantic Ambiguity every girl loves a sailor Pragmatic Ambiguity can you pass the salt? The management of ambiguity is central to the success of CL in general and MT in particular.

15 Feb 2005 -- MRDiploma in Translation - Lecture 115 Computer Science The study of basic concepts Information Data Algorithm Program The application of these concepts to practical tasks. Implementation of computational models from other fields.

16 Feb 2005 -- MRDiploma in Translation - Lecture 116 Information Information is an theoretical concept invented by Shannon in 1948 to measure uncertainty. The units of this measure are called bits. Length – metres Weight – kilos Information – bits 1 bit is the amount of uncertainty inherent to a situation when there are exactly two possible outcomes. Example: for breakfast I will have coffee or I will have tea (nothing else). When I tell you that I have tea, I have conveyed one bit of information. The greater the number of possible outcomes, the more bits of infomation involved in the statement that indicates the actual outcome.

17 Feb 2005 -- MRDiploma in Translation - Lecture 117 Data A formalized representation of facts or concepts suitable for communication, interpretation, or processing by people or automated means. Example: a telephone directory Unlike information, which is abstract, data is concrete Data has a certain level of structure. In the telephone directory, for example, we have the structure of a list of entries, each of which has a name, an address, and a number.

18 Feb 2005 -- MRDiploma in Translation - Lecture 118 Algorithm A well defined procedure for the solution of a given problem in a finite number of steps Abstract Designed to perform a well-defined task. Finite description length. Guaranteed to terminate.

19 Feb 2005 -- MRDiploma in Translation - Lecture 119 Algorithm for Chocolate Cake

20 Feb 2005 -- MRDiploma in Translation - Lecture 120 Program to Add X and Y subtract 1 from X add 1 to Y X = 0? Read X and Y X = 2, Y = 3 yesno Output Y

21 Feb 2005 -- MRDiploma in Translation - Lecture 121 Computer Program A set of instructions, written in a specific programming language, which a computer follows in processing data, performing an operation, or solving a logical problem. Concrete A program can implement an algorithm. More than one program may implement the same algorithm. Not all programs express good algorithms!

22 Feb 2005 -- MRDiploma in Translation - Lecture 122 Instructions vs. Execution Steps 1.Read X 2.Read Y 3.X = X-1 4.Y = Y+1 5.If X = 0 then Print(X) else goto 3 How many instructions? How many execution steps?

23 Feb 2005 -- MRDiploma in Translation - Lecture 123 Algorithms and Linguistics Does linguistic theory make sense without implementing the concepts? Linguistic theory provides linguistic knowledge in the form of grammar rules theories about grammar rules Putting knowledge to some use involves processing issues: parsing generation

24 Feb 2005 -- MRDiploma in Translation - Lecture 124 Computational Linguistics – Issues How are a grammar and a lexicon represented? How is the structure of a given sentence actually discovered? How can we actually generate a sentence to express a particular meaning? How can linguistic theory be made concrete enough to test algorithmically? Can an artificial system learn a language with limited exposure to grammatical sentences?

25 Feb 2005 -- MRDiploma in Translation - Lecture 125 Non computational theories can be misleading Representational details omitted. Computer memory requirements omitted. Nature of individual steps may be unclear. Difficult to test. Potentially unimplementable

26 Feb 2005 -- MRDiploma in Translation - Lecture 126 Example of a Non Computational Model

27 Feb 2005 -- MRDiploma in Translation - Lecture 127 Computers and Language Twin Goals Scientific Goal: Contribute to Linguistics by adding a computational dimension. Technological Goal: Develop machinery capable of handling human language that can support “language engineering”

28 Feb 2005 -- MRDiploma in Translation - Lecture 128 Computers and Language Tools & Resources Grammar Formalisms, e.g. Definite Clause Grammars Parsing Algorithms sentence  structure Generation Algorithms structure  sentence Statistical Methods Linguistic Corpora

29 Feb 2005 -- MRDiploma in Translation - Lecture 129 Computers and Language: Applications Information Retrieval/Extraction Document Classification Question Answering Style and Spell Checking Integrated Multimodal Tasks Machine Translation


Download ppt "Postgraduate Diploma in Translation Lecture 1 Computers and Language."

Similar presentations


Ads by Google