11-736 Seminar on Endangered Languages Alan W Black, Robert Frederking, Lori Levin, Laura Tomokiyo Language Technologies.

Slides:



Advertisements
Similar presentations
Speech-to-Speech Translation Hannah Grap Language Weaver, Inc.
Advertisements

LANGUAGE MAINTENANCE & SHIFT
English as a global language. “English is the Global Language”  Is this statement obvious?  How many news articles deal with this topic? (click here.
LDTC Workshop #8 Revitalization. What is Language Revitalization? “The attempt by interested parties, including individuals, cultural or community groups,
 about 5,000-6,000 different languages spoken in the world today  English is far the most world wide in its distribution  1/4 to 1/3 of the people.
TO EDIT GRAPHICS IN THE MASTER SELECT: VIEW > SLIDE MASTER TO APPLY PAGE STYLES RIGHT CLICK YOUR PAGE >LAYOUT Research Institute for Professional Practice,
Religious Pluralism: Ground Rules for Living Together
J. Kunzmann, K. Choukri, E. Janke, A. Kießling, K. Knill, L. Lamel, T. Schultz, and S. Yamamoto Automatic Speech Recognition and Understanding ASRU, December.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
Topic 6: Language shift and language death
The current status of Chinese- English EBMT -where are we now Joy (Ying Zhang) Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
NICE: Native language Interpretation and Communication Environment Lori Levin, Jaime Carbonell, Alon Lavie, Ralf Brown Carnegie Mellon University.
Jumping Off Points Ideas of possible tasks Examples of possible tasks Categories of possible tasks.
The current status of Chinese-English EBMT research -where are we now Joy, Ralf Brown, Robert Frederking, Erik Peterson Aug 2001.
Seminar on Endangered Languages
Machine Translation with Scarce Resources The Avenue Project.
Social 10-2 Globalization Case Study #5: Identity.
"Cuando muere una lengua, las cosas divinas, estrellas, sol y luna,las cosas humanas, pensar y sentir, no se reflejan en eso espejo." "When a language.
MAUREEN COWAN CHAPTER SIX Introduction To Language.
The time may soon come when we say goodbye to most of the world’s languages. Today humans express themselves in over 6,000 different languages, but that.
NDA Briefing for the BILC Professional Seminar 2007.
Language Technologies Institute School of Computer Science Carnegie Mellon University NSF August 6, 2001 NICE: Native language Interpretation and Communication.
Promoting Success for All Students through Technology.
Cultural Geography Chapter 5 review Key Questions How does culture shape space and place? How does culture shape space and place? How do cultural practices.
Context & Essay Writing High & Low Context Cultures Discussion & Reflection Implications for writing in English.
LIMITLESS POTENTIAL | LIMITLESS OPPORTUNITIES | LIMITLESS IMPACT Copyright University of Reading IMPACT AND THE ARTS AND HUMANITIES Anthony Atkin (Research.
Speech-to-Speech MT JANUS C-STAR/Nespole! Lori Levin, Alon Lavie, Bob Frederking LTI Immigration Course September 11, 2000.
Collecting EL data: working with language communities Laura Tomokiyo.
Language Chapter 5 An Introduction to Human Geography
Language Extinction. Language Demographics  There are approximately 6,700 current spoken languages  Since 1900, 600 have been languages lost  2400.
Language, Status, and Loss Lecture #2 | LLC 5160 For details of the license under which you may use this work, see:
Foreign Aid Monday, February 23, 2015 Students will understand the history of modern American foreign aid and be able to identify the three types of aid.
Functions of Speech 1. Expressive 2. Directive 3. Informative (Referential) 4. Metalinguistic 5. Poetic 6. Phatic 7. Heuristic 8. Commissive 9. Performative.
FishBase Summary Page about Salmo salar in the standard Language of FishBase (English) ENBI-WP-11: Multilingual Access to European Biodiversity Sites through.
What is Religion? Religion in Canada.
Eliciting Features from Minor Languages The elicitation tool provides a simple interface for bilingual informants with no linguistic training and limited.
Social  Why is Language Diversity Important? Chinese, English and Spanish are the three most popular languages in the world, in terms of number.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
MLAC DIVERSITY COALITION Year in Review Presented by Joyce Allen-Beckford Director Andy Garcia Associated Director November, 2007.
Multi-Engine MT for Quick MT. Missing Technology for Quick MT LingWear ISI MT NICE Core Rapid MT - Multi-Engine MT - Omnivorous resource usage - Pervasive.
Endangered and Extinct Languages. Language Demographics  There are approximately 6,809 current spoken languages  330 languages each have one million.
Documenting Endangered Languages A Partnership between the National Endowment for the Humanities and the National Science Foundation.
Pages Exploring Globalization Read pages To some people Language is identity. Do you feel that way? Experts believe that there are between.
Introduction to Linguistics 10 The Future of English
Introduction to the Framework: Unit 1, Getting Readyhttp://facultyinitiative.wested.org/1.
Social 10-1 Chapter 4 Affirming Identity, Language, and Culture Review.
Language Hayley Bunnell Jenna Hagerty Lauren Lubitz.
Perspectives on Deafness Medical vs. Cultural perspective.
SOCIAL SCIENCES SHIREE LEE Making connections between ECE & School.
Control Engineering Research Seminar Todays Agenda Answering to question ”Why are we here?” –Objectives –Methods –Practical issues Presentation.
NEW FOCUS nf nf 1 Research Strategy and Implementation Telephone: Facsimile: ACN: 066.
Seminar on Endangered Languages
11/23/00UNU/IAS/UNL Centre1 The Universal Networking Language United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
Language Technologies Capability Demonstration Alon Lavie, Lori Levin, Alex Waibel Language Technologies Institute Carnegie Mellon University CATANAL Planning.
Recent Advances in Speech Translation Systems ESSLLI-2002 Tutorial Course August 12-16, 2002 Course Organizers: Alon Lavie – Carnegie Mellon University.
English Language Learners. What Is ELL? English Language Learners 1.) Students who are new to the English language. 2.) Students whose native language.
Cultural Competency & Legal Services: Part I Karin Wang, Asian Pacific American Legal Center October 2009.
21 st century Teaching and Learning District Educator Deborah Harris EDU620: Meeting Individual Student Needs With Technology Instructor: Adriane Wheat.
TECHNOLOGY AWARENESS & INTEREST COMMUNITY EVENT SPONSORED BY THE SCHOOL DISTRICT AND COMMUNITY CENTER.
Teaching for Results Session 5 Facilitated by: Shauna Watson.
Dyslexia GTN 302/3 Community Nutrition & Dietetic Service Practicum SATESH BALACHANTHAR DIETETICS 3 YEAR.
Background of the NICE Project Lori Levin Jaime Carbonell Alon Lavie Ralf Brown.
LingWear Language Technology for the Information Warrior Alex Waibel, Lori Levin Alon Lavie, Robert Frederking Carnegie Mellon University.
LANGUAGE. Language & Culture Language is a set of sounds and symbols that is used for communication. Language is a set of sounds and symbols that is used.
Conversion of information in different coding systems
RECENT TRENDS IN SMT By M.Balamurugan, Phd Research Scholar,
Sociolinguistics Sarah Alshamran.
Hispanics & Latinos Minority Studies.
E.N.L. Program Overview Miss Jacey Diez
Presentation transcript:

Seminar on Endangered Languages Alan W Black, Robert Frederking, Lori Levin, Laura Tomokiyo Language Technologies Institute Fall 2014

[Intro: see class announcement]

Administrivia Please register! Slides will be available on website Regular meeting time? Readings, presentations; 15 weeks, incl. today For next week, read handout from “ Language Death ” (David Crystal) –Individual projects in Tech for Endangered Languages Everyone signed in?

What are Endangered Languages? [From here on, REF ’ s version, drawn largely from “ Language Death ” (Crystal)] The size of the problem –How do you count languages? How do you count Arabic? Flemish? –How do you estimate unknown languages? –How do you evaluate the status of a small language? Not just size (Zuni)

Real minority languages: Spanish vs. Apache: CHINESE, MANDARIN SPANISH ENGLISH 300M???Arabic? [if all local versions included together] … HAITIAN CREOLE FRENCH … ADYGHE (Russia)600 th NAVAJO NYORE (Kenya)1000 th 12693APACHE, WESTERN 6413ZUNI3000 th ; less than 10k 775TLINGIT … 4PAWNEE … 0AFRO-SEMINOLE CREOLE (

Why should we care? We need diversity Languages express identity –[REF/German] Languages are repositories of history –And Culture Languages contribute to sum of human knowledge –Ethnobiology, technology Languages are interesting

And it's not hopeless Welsh Maori Hawaiian Cornish (may have been extinct)

Description vs. Documentation vs. Revitalization Linguistic Description Language Documentation –“ Sleeping ” languages: Miami Language Revitalization

What can LT do about it? Build MT systems?

LingWear for the Information Warrior New Ideas The pre-development of appropriate interlingua representations for domains of interest facilitates generation into a new language within two weeks. The development of new MT engines (e.g. learnable transfer rules) and improved multi-engine integration supports rapid deployment of MT for a new language with scarce resources. Gisting and summarization in the source language followed by MT is better than vice versa. Carnegie Mellon University School of Computer Science: A.Waibel, L. Levin, A. Lavie, R. Frederking Impact Allow military and relief organizations to converse in limited domains of interest with the local population in an area of conflict and/or disaster Allow military and other operatives in the field to assimilate foreign language information they encounter on-the-move Rapidly port and deploy the technology into new languages with scarce resources Schedule 9/2000: Baseline MT systems functional. 12/2000: Baseline text summarizer functional. 9/2001: Port to second language complete. 9/2002: Port to third language complete.

Diplomat: Rapid-deployment, wearable, speech-to-speech translation Develop a new language in weeks, graceful improvement for months or years. Speech in, speech out. Use user interaction to cope with errors. Human factors of working with computer naïve users. Languages: English and {Croatian, Haitian Creole, Spanish, some Korean, (Arabic)}

AVENUE: Low Cost MT for Minor Languages Speakers of electronically underrepresented languages can participate in the information age. Policy makers can access ideas, viewpoints, and information from the developing world. MT for unforeseen translation needs: e.g., humanitarian aid. Documentation and preservation of endangered languages.

What can LT do about it? Build MT systems? –AVENUE experience: the community didn ’ t want state-of-the-art MT, but project had been funded to build MT! Haitian Creole was difficult, but still not small enough to be endangered! New language technology methods that the community can use themselves to save their language?

What can LT do about it? Three of the main ways languages die: –children don ’ t learn them; [REF/German] –teenagers forget them; –young adults forget them Spelling checkers, SMS, etc. Navajo.org [Dene] (but in English!) –Winnebago [Hochunk] website errors Etc.?

Only the beginning…