When to Code WHEN NOT TO CODE James D. (jj) Johnston Chief Scientist DTS, Inc.

Slides:



Advertisements
Similar presentations
My list of 10 worst mistakes in Audio (some of which are pet peeves) J. D. (jj) Johnston Neural Audio Kirkland, Wa, USA.
Advertisements

Improving audibility as a foundation for better speech understanding Pamela Souza, PhD Northwestern University Evanston, IL.
Why do we hear what we hear? James D. Johnston Chief Scientist DTS, Inc.
Audiograms What are they? Why have them? How do I read them?
Why do we hear what we hear? James D. Johnston Chief Scientist, DTS, Inc.
Analogue to Digital Conversion (PCM and DM)
April 25, 2005ECE 457 Cellular Communication ECE 457 Spring 2005.
Chapter Two Fundamentals of Data and Signals
Chapter Two Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User's Approach Sixth Edition.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Increasing Hearing Ability – Using Frequency Shifting and Channel Control Yuan Alex Gao CMPT
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
A Software Defined Radio Implementation for Voice Transmission over Wireless Ad-hoc Networks Jason Tran SURF-IT 2009 Fellow Mentors: Dr. Homayoun Yousefi’zadeh.
Chapter 2 Fundamentals of Data and Signals
EET 450 Chapter 18 – Audio. Analog Audio Sound is analog Consists of air pressure that has a variety of characteristics  Frequencies  Amplitude (loudness)
© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.
1 New Technique for Improving Speech Intelligibility for the Hearing Impaired Miriam Furst-Yust School of Electrical Engineering Tel Aviv University.
1 Chapter 2 Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User’s Approach.
Repeater Etiquette for the Newly Licensed By AG4BR Satellite Bob aka Baghdad Boobie.
Cellular Telephones How Do They Work and Are They Safe? Meagan Morrell May 3, 2000.
1 Live Sound Reinforcement Equalizers and other signal processing equipment.
Fundamentals of Digital Communication
DIGITAL VOICE NETWORKS ECE 421E Tuesday, October 02, 2012.
Chapter 5. Sound Intensity (db) = 20 log (P1/P2)
Data Communications & Computer Networks, Second Edition1 Chapter 2 Fundamentals of Data and Signals.
© 2006 Cisco Systems, Inc. All rights reserved. QOS Lecture 2 - Introducing VoIP Networks.
Digital multiplexers In TDM a group of analog signals are sampled sequentially in time at a common sampling rate and then multiplexed for transmission.
Copyrights © All rights Reserved. Asterisk and VoIP issues Chetan Vaity March 2007.
Technician License Course Chapter 5 Operating Station Equipment Lesson Plan Module 11: Transmitters, Receivers and Transceivers.
LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.
Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.
Dynamic Range and Dynamic Range Processors
The Care and Feeding of Loudness Models J. D. (jj) Johnston Chief Scientist Neural Audio Kirkland, Washington, USA.
Chapter 3.2 Speech Communication Human Performance Engineering Robert W. Bailey, Ph.D. Third Edition.
Pulse Code Modulation Pulse Code Modulation (PCM) : method for conversion from analog to digital waveform Instantaneous samples of analog waveform represented.
Definition and Coordination of Signal Processing Functions for telephone connections involving automotive speakerphones Scott Pennock Senior Hands-Free.
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
RAZAN YOUSEF AL-ALI SECTION: B COURSE NO: RAZAN YOUSEF AL-ALI SECTION: B COURSE NO: 3143.
1 Chapter 2 Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User’s Approach.
Parts of a Wave Crest Wavelength Trough Normal Rest Position Frequency = 2 waves per second.
CDMA TECHNOLOGY DEFINITION OF CDMA TECHNOLOGY A coding scheme, used as a modulation technique, in which multiple channels are independently coded for.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
CHAPTER 3 DELTA MODULATION
How to use your hand-held radio effectively in an emergency Tactical Use Of Radios Part 1 Advancement Module 01.
Noise. Noise When someone loses their hearing, it can be devastating for them and annoying for others around then. People with poor hearing are often.
Chapter Two Fundamentals of Data and Signals Data Communications and Computer Networks: A Business User's Approach Eighth Edition.
Frequency and Bandwidth A means of quantifying and interpreting media capacity.
Workable Presentations 20 Tips (more or less) to a Successful Presentation Created by S. L. Shea Dept of Family & Comm. Medicine Southern Illinois University.
Destructive Interference – Active Noise Reduction (ANR) Koss Noise Reduction Headphones.
Encoding How is information represented?. Way of looking at techniques Data Medium Digital Analog Digital Analog NRZ Manchester Differential Manchester.
Overview of Digital Video Compression Multimedia Systems and Standards S2 IF Telkom University.
CDMA Systems. 2 How does CDMA work? Each bit (zero or one) is spread into N smaller pulses/chips (a series of zeros and ones). The receiver which knows.
Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.
TRANSMITTER FUNDAMENTALS P-117. Audio Frequency Definition Acoustic, mechanical, or electrical frequencies corresponding to normally audible sound waves.
Sami Khorbotly, Ph.D., IEEE member Assistant Professor
1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.
DIGITAL COMMUNICATION. Introduction In a data communication system, the output of the data source is transmitted from one point to another. The rate of.
Radio Equipment. Review: On the Transmitter Side The purpose of radio communications is to transfer information from one point to another. The information.
Lecture 2.4. Multiplexing. Learning Outcomes Discuss the concept of Multiplexing Explain & calculate frequency-division multiplexing. Explain & calculate.
Codec 2 open source speech codec
GSM Speech Coding To send a voice across a radio network, we have to turn our voice into a digital signal. GSM uses a method called RPE-LPC (Regular Pulse.
The Importance of In-Mask Communications
Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.
Digital Communications Chapter 13. Source Coding
Cellular Telephone Networks
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

When to Code WHEN NOT TO CODE James D. (jj) Johnston Chief Scientist DTS, Inc

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. What is a coder, or a codec A coder or codec is a signal processing method that reduces bit rate. There are two (broadly speaking) kinds of codec: –Source Coders –Perceptual Coders

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. What is a Source Coder A source coder uses a signal model of the source to reduce the bit rate –It does this by reducing redundancy, and/or by adding noise –A source coder’s success depends entirely on its ability to model the source. Not all sources meet the same model Not all coders can handle disparate models The more constrained the source, the more efficient the codec –But you’d better not have the wrong source, then.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Some points about Source Coders A source coder is highly dependent on its source model, adaptive or fixed –The stronger the model, the more compression –The more general the model, the less the reduction in bit rate (in general) When a source coder does not match the actual source –Compression fails Bit rate rises (or) Quality falls –When a source coder is extremely specific, it may fail completely on some source material Adaptive codecs (most are) generally adapt to the most energetic signal presented as input –If noise is louder, it will code the noise better than the speech!

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Low Rate Voice Coders The way that a low rate voice coder reduces the bit rate is by having a very specific speech model. –Such codecs fail miserably on music –Such codecs fail miserably when exposed to too much background noise In short, if you use a very-low-rate voice codec, YOU MUST MAKE SURE THAT IT GETS CLEAN VOICE INPUT –We all hear this on our cell phones on a daily basis

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. What about Special Codecs It is possible to make a codec work somewhat better on noise at a cost to speech performance Unfortunately, this cuts both ways.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. What’s the point for Emergency Services? 1.Don’t use an extremely low rate speech codec 2.Really! Don’t use a low-rate speech codec 3.See Rule #2! You have no control over background noise, and the background noise in an emergency situation is likely to be both non-speech-like as well as intense.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Digital Transmission in the Emergency Services Analog transmission gets noisy when you get to the limits of communication. –The noise gives you warning that you are losing contact. Digital transmission JUST GOES AWAY –What’s more, with time-domain redundancy, it will “stay away” for a while If you use digital transmission, make sure that the user will be alerted to loss of communication.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. What about perceptual codecs? A perceptual codec attempts to discover what the most audible signal elements are, and sends them –It uses a human hearing model that assumes normal listening levels. Do I need to say any more? –Ok, It uses a human hearing model that assumes normal listening levels.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Implications for Emergency Services Perceptual coders must make assumptions about the listening level Perceptual coders must also make assumptions about what the “message” is at low rates The combination of lots of noise, loud noise, and speech is not a good combination.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. The Ear vs. Noise At high levels, the ear’s ability to separate frequencies in a signal is badly degraded –This makes speech even harder to understand –Making it LOUDER may make the result worse A situation where the listener is in a high noise environment, the talker has lots of noise, and a low- rate codec is in use is very, very likely to create communications problems. BE CAREFUL!

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Now what? HIGH rate speech codecs (say muLaw at 56 kb/s) have no model to speak of, and are more robust. Straight PCM is probably an excessive rate for mobile radio use The tradeoff between rate, delay, time redundancy, and “the cliff” must be carefully examined.

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. So now what? GET THE CLEANEST SPEECH SIGNAL YOU CAN A.Close talking mike B.Good AGC C.Heavily filtered bandwidth, Hz or so i.Yes, this eliminates fricatives, that’s what Alpha, Baker … is for. D.Use as much noise cancelling as you can

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Then Carefully test any codecs for –Tandeming properties in noise –DRT/Articulation performance in noise –Overload or “lockup” issues in noise

Information on this slide is the confidential property of DTS. Any unauthorized copying is strictly prohibited. Copyright 2009 DTS, Inc. Finally: BE Careful OUT THERE!