Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML.

Slides:



Advertisements
Similar presentations
TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational.
Advertisements

Tones Review / 四声复习 As you know and we have discussed many times, accurate tones are extremely important in Chinese. The ability to pronounce tones correctly,
University of Michigan Flint Zhong, Yan
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
1 SSML The Internationalization of the W3C Speech Synthesis Markup Language SpeechTek 2007 – C102 – Daniel C. Burnett.
Pinyin Foundation (1) Pinyin Foundation (1) 拼音基础.
SSML extensions for multi-language usage Davide Bonardo W3C Workshop on Internationalizing SSML Crete, May 2006.
Communicating with Robots using Speech: The Robot Talks (Speech Synthesis) Stephen Cox Chris Watkins Ibrahim Almajai.
Analyzing Students’ Pronunciation and Improving Tonal Teaching Ropngrong Liao Marilyn Chakwin Defense.
5 Time and Introduction Module (L9-L10) Time and Introduction Quiz 2 Review Chinese IAB (IA +IB)
5 Time and Introduction Module (L9-L10) Time and Introduction test Review Chinese IAB (IA +IB)
4 Residence and Family Module (L7-L8) Residence Family Quiz 1 Review Chinese IAB (IA +IB)
SPOKEN LANGUAGE SYSTEMS MIT Computer Science and Artificial Intelligence Laboratory Mitchell Peabody, Chao Wang, and Stephanie Seneff June 19, 2004 Lexical.
Chapter 3: Formal Translation Models
Chapter 15 Speech Synthesis Principles 15.1 History of Speech Synthesis 15.2 Categories of Speech Synthesis 15.3 Chinese Speech Synthesis 15.4 Speech Generation.
1 Speech synthesis 2 What is the task? –Generating natural sounding speech on the fly, usually from text What are the main difficulties? –What to say.
Copyright © Lumivox International Co., Ltd. Unit 1 Greeting Lesson 1 Hello ! 你好! Unit 1 Greeting Lesson 1 Hello ! 你好! Level.
Position Paper for W3C Workshop on Internationalizing SSML The Usage of Part-Of-Speech for Resolving Multiple Pronunciations in SSML Myoung-Wan.
Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.
Prof. Li Universidad Del Este. Review of Greetings.
Building High Quality Databases for Minority Languages such as Galician F. Campillo, D. Braga, A.B. Mourín, Carmen García-Mateo, P. Silva, M. Sales Dias,
1 SSML Extensions for TTS in Indian Languages II workshop on Internationalizing SSML May 2006, Greece Nixon Patel and Kishore Prahallad Bhrigus.
JEITA Speech Group1 Issues of SSML in Japanese Wataru IMATAKE (ANIMO LIMITED) Makoto AKABANE (Sony Computer Entertainment Inc.) Kazuyo TANAKA (Tsukuba.
Goals, Objectives, Rules.  All submissions to Turnitin will follow MLA formatting:  Header (if the piece is two or more pages)  Heading  Font size.
HTML Essentials Markup ( Part I ). Why Markup ? Markup gives meaning and structure to your web page Creates a relationship between the elements.
How IPA is Used in SSML and PLS Paolo Baggia, Loquendo Wed. August 9 th, 2006.
Speech & Language Development 1 Normal Development of Speech & Language Language...“Standardized set of symbols and the knowledge about how to combine.
Comprehension: To Understand Making Instructional Adaptations in Comprehension Instruction Presented by Pam Jones COPESD MiBLSi Conference 2008.
Background Infants and toddlers have detailed representations for their known vocabulary items Consonants (e.g., Swingley & Aslin, 2000; Fennel & Werker,
Document Type Definitions Kanda Runapongsa Dept. of Computer Engineering Khon Kaen University.
PrepTalk a Preprocessor for Talking book production Ted van der Togt, Dedicon, Amsterdam.
Nasal endings of Taiwan Mandarin: Production, perception, and linguistic change Student : Shu-Ping Huang ID No. : NA3C0004 Professor : Dr. Chung Chienjer.
The development of Chinese characters
1 W3C Workshop on Internationalizing SSML SSML Extension for Korean Workshop : 2005/11/02 (Wed) Sang-Jin Kim
SSML 1.1: The Internationalization of SSML Daniel C. Burnett August 9, 2006.
Language Joviltė Beržanskytė PSbns Content: Elements of language Language development The Influence of language to thinking Do animals use language?
Overview of CSSML Yan Jun, Department Manager Anhui USTC iFLYTEK Co., Ltd University of Science & Tech of China.
World Languages Mandarin English Challenges in Mandarin Speech Recognition  Highly developed language model is required due to highly contextual nature.
Phonemic Awareness = Phonics. Phonemic Awareness w The understanding that spoken words are made up of a series of discrete sounds Is different from Phonics:
PETRA – the Personal Embedded Translation and Reading Assistant Werner Winiwarter University of Vienna InSTIL/ICALL Symposium 2004 June 17-19, 2004.
1 A Study on Implementation of Southern-Min Taiwanese Tone Sandhi System Iu n Un-gian Lau Kiat-gak Li Sheng-an.
1 ADVANCED MICROSOFT WORD Lesson 14 – Editing in Workgroups Microsoft Office 2003: Advanced.
Word Study. What do you need to know? Write down the following information!
The eXtensible Markup Language (XML). Presentation Outline Part 1: The basics of creating an XML document Part 2: Developing constraints for a well formed.
890060: Chinese Beginners Course
© 2013 by Larson Technical Services
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
An Introduction to S3ML Beijing InfoQuick SinoVoice Speech Technology Corp. CHEN Ming, LV Shinan, LI Xiulin.
Unit 1: Wǒ zì jǐ 我自己 Myself
INTRODUCTION JavaScript can make websites more interactive, interesting, and user-friendly.
1 UNIT TEST 1 PRACTICE PINYIN 1.Initials / Finals (listening) 2.Sound distinguishing/comparing 3.Pinyin tones 4.Spelling rules Language in use 1.Phrase.
Quick Overview on Tones
StAIR design Learning “hello” and “goodbye” in Chinese Cep811 Siming Hu.
PLS for SSML Paolo Baggia Loquendo Workshop II on Internationalizing SSML.
L197 Beginners' Chinese Module Team, The OpenUniversity Learning Chinese Characters – with ink, keyboard and mobile Apps Department of Languages The Open.
Natural Language Processing Vasile Rus
Lesson 1 Dialogue 1 Grammar University of Michigan Flint Zhong, Yan.
Being a Writer at St Leonard’s
Second Grade Chinese Literacy September 15, 2016
Ni hao, Mandarin – Workshop 2 CESC Asian Studies Term 1, 2015
Kuiper and Allan Chapter 6.2
Kuiper and Allan Chapter 6.2
Research on the Modeling of Chinese Continuous Speech Recognition
Write Clear Text and Messages Lecture-11
Pinyin pinyin is a phonetic system of the Chinese language. It adopts the roman alphabet to represent phonetic sounds in Mandarin Chinese.
University of Michigan Flint Zhong, Yan
Introduction to Pinyin
DEMO CLASS by Frank Arellano LEARNING CHINESE Demo Class for beginners.
Presentation transcript:

Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML

Toshiba (China) R&D Center Outline Tone Word boundary

Toshiba (China) R&D Center Tone (cont…) Importance As important as phonemes in tonal language Same syllables with different tones take different meaning: 妈 (mā) 麻 (má) 马 (mă) 骂 (mà) Sandhi phenomenon in tonal language 你好 ni3 hao3  ni2 hao3 Synthesis with correct tone help listener catch the meaning of speech Non-markup behavior Tone can be achieved by looking up dictionary or applying rules. Errors may occur, especially in dealing with sandhi

Toshiba (China) R&D Center Suggestion on Tone (cont…) Our suggestions Using Pinyin sequence as the value of phoneme element Using number 1, 2, 3, 4 and 5 standing for tone “yin ping”, “yang ping”, “shang sheng”, “qu sheng” and neutral tone in Mandarin: Text: 大都 (dàdoū) Pinyin sequence+tone: /da 4/dou 1/ Solution1: new tone element (optional), with required attribute detail: 大都 Solution 2: new value “t” and “pt”of alphabet attribute in phoneme element 大都

Toshiba (China) R&D Center Note on Tone Markup Possible influence on SSML1.0 Solution 1: Tone element cannot be followed by other element, and can be enclosed by p, s, w(if defined) element Solution 2: phoneme element is modified, the relation to other elements should not change The tone strings given by markup cannot be changed in the text normalization step in the result of looking up the lexicon. Tone markup should be neglected, when Value error of tone Unmatched length of tone sequence

Toshiba (China) R&D Center Outline Tone Word boundary

Toshiba (China) R&D Center Word Boundary (cont…) Word is the basic unit for sentence parsing and understanding. Chinese sentences are composed of sequence of Chinese characters without blanks or spaces to specify word boundaries. Difficulties: Complex words, such as reduplications, derived words, such as “ 简简单单 ”(very easily), “ 非物质 ”(immateriality) Proper nouns, such as location name, person name The ambiguous word segmentations. A: 上海 是 个 大都会。 (Shanghai is a metropolis) B: 上海人 大都 会 那么 说。 (Most Shanghainese will say that) Non-markup behavior Determine the boundary using language-specific knowledge Errors may occur

Toshiba (China) R&D Center Suggestions on Word Boundary (cont…) New element w is suggested 都会 An optional attribute detail is also recommended to mark phrases 上海人大都会 Here, the phrase is split into three words, and the number of Chinese characters of these words are 3, 2 and 1.

Toshiba (China) R&D Center Suggestion on Word Boundary (cont…) Legal values of the optional attribute detail Not bigger than the length of the contained text 上海 上海 Default value is the length of the contained text 上海 When the sum of value is smaller than the length of the contained text, the left part is regarded as a word 上海人大都会 The first 3 Chinese characters “ 上海人 ”are regarded as one word and the left “ 大都会 ” are regarded as another word When the sum of value is bigger than the length of the contained text, this markup should be neglected

Toshiba (China) R&D Center Possible Influence on SSML 1.0 Influence on speech synthesizing steps Word segmentation is suggested to be done before parse text and analysis structure Relation between SSML 1.0 markups and word segmentation markup w (needs more discussion) p, s element can be followed by w element; w element can be followed by audio, emphasis, phoneme, prosody, say-as, sub, voice and t(if defined) 上海 上海 大都 会

Toshiba (China) R&D Center Thank you!