CS 414 - Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.

Slides:



Advertisements
Similar presentations
EET260: A/D and D/A conversion
Advertisements

Tamara Berg Advanced Multimedia
Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.
Analogue to Digital Conversion (PCM and DM)
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
4.2 Digital Transmission Pulse Modulation (Part 2.1)
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Multimedia communications EG-371Dr Matt Roach Multimedia Communications EG 371 and EG 348 Dr Matthew Roach Lecture 2 Digital.
Digital Data Transmission ECE 457 Spring Information Representation Communication systems convert information into a form suitable for transmission.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
CSc 461/561 CSc 461/561 Multimedia Systems Part A: 1. Audio.
© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.
Fundamental of Wireless Communications ELCT 332Fall C H A P T E R 6 SAMPLING AND ANALOG-TO-DIGITAL CONVERSION.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Chapter 4 Digital Transmission
Digital Audio Multimedia Systems (Module 1 Lesson 1)
Digital Communication Techniques
1/21 Chapter 5 – Signal Encoding and Modulation Techniques.
Formatting and Baseband Modulation
Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.
A/D Conversion No. 1  Seattle Pacific University Analog to Digital Conversion Based on Chapter 5 of William Stallings, Data and Computer Communication.
Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
Review: The application layer. –Network Applications see the network as the abstract provided by the transport layer: Logical full mesh among network end-points.
Multimedia Technology Digital Sound Krich Sintanakul Multimedia and Hypermedia Department of Computer Education KMITNB.
Chapter 6 Basics of Digital Audio
COMP Representing Sound in a ComputerSound Course book - pages
CSC361/661 Digital Media Spring 2002
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
Media Representations - Audio
Pulse Code Modulation (PCM)
COMMUNICATION SYSTEM EEEB453 Chapter 5 (Part IV) DIGITAL TRANSMISSION.
10/6/2015 3:12 AM1 Data Encoding ─ Analog Data, Digital Signals (5.3) CSE 3213 Fall 2011.
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
British Computer Society (BCS)
The Physical Layer Lowest layer in Network Hierarchy. Physical transmission of data. –Various flavors Copper wire, fiber optic, etc... –Physical limits.
Basic Encoding Techniques
1 Introduction to Information Technology LECTURE 6 AUDIO AS INFORMATION IT 101 – Section 3 Spring, 2005.
Pulse Code Modulation PCM is a method of converting an analog signal into a digital signal. (A/D conversion) The amplitude of Analog signal can take any.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
Encoding and Simple Manipulation
Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
4.2 Digital Transmission Pulse Modulation Pulse Code Modulation
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 4 – Visual Perception and Digital Image Representation Klara Nahrstedt Spring 2012.
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
Transmission Media. Quiz: ANGEL->Lessons->Quiz 2 Figure 1Figure 2 Figure 3Figure 4.
Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.
1 What is Multimedia? Multimedia can have a many definitions Multimedia means that computer information can be represented through media types: – Text.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 4 – Audio and Digital Image Representation Klara Nahrstedt Spring 2010.
Advanced Computer Networks
Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.
Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
1 st semester 1436 / Modulation Continuous wave (CW) modulation AM Angle modulation FM PM Pulse Modulation Analog Pulse Modulation PAMPPMPDM Digital.
Pulse Code Modulation (PCM) Analog voice data must be translated into a series of binary digits before they can be transmitted. With Pulse Code Modulation.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Digital Audio (2/2) S.P.Vimal CSIS Group BITS-Pilani
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 4 – Visual Perception and Digital Image Representation Klara Nahrstedt Spring 2014.
Computer Communication & Networks
The Physics of Sound.
Chapter 4: Second generation Systems-Digital Modulation
Analog to digital conversion
Topics discussed in this section:
UNIT – III I: Digital Transmission.
UNIT II.
4.1 Chapter 4 Digital Transmission Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Quantization and Encoding
CS 4594 Data Communications
Digital Control Systems Waseem Gulsher
Analog to Digital Encoding
Presentation transcript:

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014

CS Spring 2014 Administrative Form Groups for MPs  Deadline January 27 (today!!) to TA

Why is sound logarithmic? Last lecture we had dB = 10*log (I/I 0 ), where I 0 is threshold of hearing Question: why is sound (loudness) behaving logarithmically? One answer: Human senses obey Weber- Fletcher law, where it was measured that the response of the sense machinery is logarithm of an input. This applies to hearing, eye sensitivity, temperature sense, etc. CS Spring 2014

Weber-Fletcher Law Weber law:  Studied responses of humans to physical stimulus in quantitative fashion  Just-noticeable difference between two stimuli is proportional to magnitude of stimuli Fechner law:  Constructed psychophysical scale to describe relationship between physical magnitude of stimulus and perceived intensity  Formalized scaling and specified that perceived loudness/brightness is proportional to log10 (actual intensity measured with an accurate non-human instrument)

Integrating Aspects of Multimedia CS Spring 2014 Image/Video Capture Image/Video Information Representation Media Server Storage Transmission Compression Processing Audio/Video Presentation Playback Audio/Video Perception/ Playback Audio Information Representation Transmission Audio Capture A/V Playback

Today Introduced Concepts Analog to Digital Sound Conversion  Sampling, sampling rate, Nyquist Theorem  Quantization, pulse code modulation, differentiated PCM, Adaptive PCM Signal-to-Noise Ratio Data Rates CS Spring 2014

Key Questions How can a continuous wave form be converted into discrete samples? How can discrete samples be converted back into a continuous form? CS Spring 2014

Lifecycle from Sound to Digital to Sound Source:

Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz – 8KHz]

Digital Representation of Audio Must convert wave form to digital  sample  quantize  compress

Sampling (in time) Measure amplitude at regular intervals How many times should we sample? CS Spring 2014

Nyquist Theorem For lossless digitization, the sampling rate should be at least twice the maximum frequency response. In mathematical terms: f s > 2*f m where f s is sampling frequency and f m is the maximum frequency in the signal CS Spring 2014

Limited Sampling But what if one cannot sample fast enough? CS Spring 2014

Limited Sampling Reduce signal frequency to half of maximum sampling frequency  low-pass filter removes higher-frequencies  e.g., if max sampling frequency is 22kHz, must low-pass filter a signal down to 11kHz CS Spring 2014

Sampling Ranges Auditory range 20Hz to kHz  must sample up to to 44.1kHz  common examples are kHz, kHz, kHz, kHz, and 44.1 KHz Speech frequency [200 Hz, 8 kHz]  sample up to 16 kHz  but typically 4 kHz to 11 kHz is used CS Spring 2014

Quantization CS Spring 2014

Sampling and 4-bit quantization Source: CS Spring 2014

Quantization CS Spring

Quantization Typically use  8 bits = 256 levels  16 bits = 65,536 levels How should the levels be distributed?  Linearly? (PCM)  Perceptually? (u-Law)  Differential? (DPCM)  Adaptively? (ADPCM)

Bit Rate Bits per second Bits per sample X samples per second 16 X 44.1 = Kbps (Mono) 16 X 44.1 X 2 = Kbps (Stereo) CS Spring 2014

Nyquist Limit max data rate = 2 H log 2 V bits/second, where H = bandwidth (in Hz) V = discrete levels (bits per signal change) Shows the maximum number of bits that can be sent per second on a noiseless channel with a bandwidth of H, if V bits are sent per signal  Example: what is the maximum data rate for a 3kHz channel that transmits data using 2 levels (binary) ?  Solution: H = 3kHz; V = 2; max. data rate = 2x3,000xln2=6,000bits/second CS Spring 2014

Pulse Code Modulation Pulse modulation  Use discrete time samples of analog signals  Transmission is composed of analog information sent at different times  Variation of pulse amplitude or pulse timing allowed to vary continuously over all values PCM  Analog signal is quantized into a number of discrete levels CS Spring

PCM Quantization and Digitization CS Spring 2014 Digitization Quantization

Linear Quantization (PCM) Divide amplitude spectrum into N units (for log 2 N bit quantization) Sound Intensity Quantization Index CS Spring 2014

Non-uniform Quantization CS Spring 2014

Perceptual Quantization (u-Law) Want intensity values logarithmically mapped over N quantization units Sound Intensity Quantization Index CS Spring 2014

Differential Pulse Code Modulation (DPCM) What if we look at sample differences, not the samples themselves?  d t = x t -x t-1  Differences tend to be smaller Use 4 bits instead of 12, maybe? CS Spring 2014

Differential Pulse Code Modulation (DPCM) Changes between adjacent samples small Send value, then relative changes  value uses full bits, changes use fewer bits  E.g., 220, 218, 221, 219, 220, 221, 222, 218,.. (all values between 218 and 222)  Difference sequence sent: 220, +2, -3, 2, -1, -1, -1,  Result: originally for encoding sequence numbers need 8 bits;  Difference coding: need only 3 bits CS Spring 2014

Adaptive Differential Pulse Code Modulation (ADPCM) Adaptive similar to DPCM, but adjusts the width of the quantization steps Encode difference in 4 bits, but vary the mapping of bits to difference dynamically  If rapid change, use large differences  If slow change, use small differences CS Spring 2014

Signal-to-Noise Ratio (metric to quantify quality of digital audio) CS Spring 2014

Signal To Noise Ratio Measures strength of signal to noise SNR (in DB)= Given sound form with amplitude in [-A, A] Signal energy = A 0 -A

Quantization Error Difference between actual and sampled value  amplitude between [-A, A]  quantization levels = N e.g., if A = 1, N = 8, = 1/4 CS Spring 2014

Compute Signal to Noise Ratio Signal energy = ; Noise energy = ; Noise energy = Signal to noise = Every bit increases SNR by ~ 6 decibels CS Spring 2014

Example Consider a full load sinusoidal modulating signal of amplitude A, which utilizes all the representation levels provided The average signal power is P= A 2 /2 The total range of quantizer is 2A because modulating signal swings between –A and A. Therefore, if it is N=16 (4-bit quantizer), Δ = 2A/2 4 = A/8 The quantization noise is Δ 2 /12 = A 2 /768 The S/N ratio is (A 2 /2)/(A 2 /768) = 384; SNR (in dB) 25.8 dB CS Spring 2014

Data Rates Data rate = sample rate * quantization * channel Compare rates for CD vs. mono audio  8000 samples/second * 8 bits/sample * 1 channel = 8 kBytes / second  44,100 samples/second * 16 bits/sample * 2 channel = 176 kBytes / second ~= 10MB / minute CS Spring 2014

Comparison and Sampling/Coding Techniques CS Spring 2014

Summary CS Spring 2014