CSC361/661 Digital Media Spring 2002

Slides:



Advertisements
Similar presentations
Physical Layer: Signals, Capacity, and Coding
Advertisements

Analog Representations of Sound Magnified phonograph grooves, viewed from above: When viewed from the side, channel 1 goes up and down, and channel 2 goes.
MULTIMEDIA TUTORIAL PART - III SHASHI BHUSHAN SOCIS, IGNOU.
Information Sources And Signals
Copyright 2001, Agrawal & BushnellVLSI Test: Lecture 181 Lecture 18 DSP-Based Analog Circuit Testing  Definitions  Unit Test Period (UTP)  Correlation.
Analogue to Digital Conversion (PCM and DM)
DFT/FFT and Wavelets ● Additive Synthesis demonstration (wave addition) ● Standard Definitions ● Computing the DFT and FFT ● Sine and cosine wave multiplication.
4-Integrating Peripherals in Embedded Systems (cont.)
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.
IT-101 Section 001 Lecture #8 Introduction to Information Technology.
Image and Sound Editing Raed S. Rasheed Sound What is sound? How is sound recorded? How is sound recorded digitally ? How does audio get digitized.
School of Computing Science Simon Fraser University
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.
CSc 461/561 CSc 461/561 Multimedia Systems Part A: 1. Audio.
Chapter 2 : Business Information Business Data Communications, 4e.
Fundamentals of Digital Audio. The Central Problem n Waves in nature, including sound waves, are continuous: Between any two points on the curve, no matter.
Chapter 4 Digital Transmission
CELLULAR COMMUNICATIONS DSP Intro. Signals: quantization and sampling.
SIMS-201 Audio Digitization. 2  Overview Chapter 12 Digital Audio Digitization of Audio Samples Quantization Reconstruction Quantization error.
Digital Audio Multimedia Systems (Module 1 Lesson 1)
 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.
Advanced Computer Graphics CSE 190 [Spring 2015], Lecture 3 Ravi Ramamoorthi
Digital audio. In digital audio, the purpose of binary numbers is to express the values of samples that represent analog sound. (contrasted to MIDI binary.
A/D Conversion No. 1  Seattle Pacific University Analog to Digital Conversion Based on Chapter 5 of William Stallings, Data and Computer Communication.
LE 460 L Acoustics and Experimental Phonetics L-13
Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?
Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.
Sampling Terminology f 0 is the fundamental frequency (Hz) of the signal –Speech: f 0 = vocal cord vibration frequency (>=80Hz) –Speech signals contain.
GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
Computer Networks Digitization. Spring 2006Computer Networks2 Transfer of an Analog Signal  When analog data (voice, pictures, video) are transformed.
DTC 354 Digital Storytelling Rebecca Goodrich. Wave made up of changes in air pressure by an object vibrating in a medium—water or air.
Chapter 6 Basics of Digital Audio
Modulation Continuous wave (CW) modulation AM Angle modulation FM PM Pulse Modulation Analog Pulse Modulation PAMPPMPDM Digital Pulse Modulation DMPCM.
ACOE2551 Microprocessors Data Converters Analog to Digital Converters (ADC) –Convert an analog quantity (voltage, current) into a digital code Digital.
1 4-Integrating Peripherals in Embedded Systems (cont.)
Computer Some basic concepts. Binary number Why binary? Look at a decimal number: 3511 Look at a binary number: 1011 counting decimal binary
Media Representations - Audio
10/6/2015 3:12 AM1 Data Encoding ─ Analog Data, Digital Signals (5.3) CSE 3213 Fall 2011.
Transforms. 5*sin (2  4t) Amplitude = 5 Frequency = 4 Hz seconds A sine wave.
Chapter #5 Pulse Modulation
The Physical Layer Lowest layer in Network Hierarchy. Physical transmission of data. –Various flavors Copper wire, fiber optic, etc... –Physical limits.
1 Introduction to Information Technology LECTURE 6 AUDIO AS INFORMATION IT 101 – Section 3 Spring, 2005.
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.
Encoding and Simple Manipulation
Chapter 2 Basic Science: Analog and Digital Audio.
IT-101 Section 001 Lecture #9 Introduction to Information Technology.
4.2 Digital Transmission Pulse Modulation Pulse Code Modulation
Encoding How is information represented?. Way of looking at techniques Data Medium Digital Analog Digital Analog NRZ Manchester Differential Manchester.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.
Multimedia Sound. What is Sound? Sound, sound wave, acoustics Sound is a continuous wave that travels through a medium Sound wave: energy causes disturbance.
1 What is Multimedia? Multimedia can have a many definitions Multimedia means that computer information can be represented through media types: – Text.
Session 18 The physics of sound and the manipulation of digital sounds.
Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
1 st semester 1436 / Modulation Continuous wave (CW) modulation AM Angle modulation FM PM Pulse Modulation Analog Pulse Modulation PAMPPMPDM Digital.
Audio sampling as an example of analogue to digital Mr S McIntosh.
Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.
Fourier Analysis Patrice Koehl Department of Biological Sciences National University of Singapore
The Physics of Sound.
COMPUTER NETWORKS and INTERNETS
Multimedia Systems and Applications
Fundamentals Data.
IT 2002 – Graphics & Multimedia
C-15 Sound Physics 1.
Digital Control Systems Waseem Gulsher
Analog to Digital Encoding
Govt. Polytechnic Dhangar(Fatehabad)
Presentation transcript:

CSC361/661 Digital Media Spring 2002 Sound Processing CSC361/661 Digital Media Spring 2002

How Sound Is Produced Air vibration Molecules in air are disturbed, one bumping against another An area of high pressure moves through the air in a wave Thus a wave representing the changing air pressure can be used to represent sound

How Sound Perceived The cochlea, an organ in our inner ears, detects sound. The cochlea is joined to the eardrum by three tiny bones. It consists of a spiral of tissue filled with liquid and thousands of tiny hairs. The hairs get smaller as you move down into the cochlea. Each hair is connected to a nerve which feeds into the auditory nerve bundle going to the brain. The longer hairs resonate with lower frequency sounds, and the shorter hairs with higher frequencies. Thus the cochlea serves to transform the air pressure signal experienced by the ear drum into frequency information which can be interpreted by the brain as sound.

Pulse Code Modulation PCM is the most common type of digital audio recording. A microphone converts a varying air pressure (sound waves) into a varying voltage. Then an analog-to-digital converter samples the voltage at regular intervals. Each sampled voltage gets converted into an integer of a fixed number of bits.

Digitization of Sound Sampling Quantization Most humans can’t hear anything over 20 kHz. The sampling rate must be more than twice the highest frequency component of the sound (Nyquist Theorem). CD quality is sampled at 44.1 kHz. Frequencies over 22.01 kHz are filtered out before sampling is done. Quantization Telephone quality sound uses 8 bit samples. CD quality sound uses 16 bit samples (65,536 quantization levels) on two channels for stereo. Show how in Cool Edit. Show difference in quality. Show computation for file size. Compare to size on disk.

Encoder Design A – B. Apply bandlimiting filter to remove high frequency components. C. Sample at regular time intervals. D. Quantize each sample.

Sampling Error (Undersampling) If you undersample, one frequency will alias as another. For CD quality, frequencies above 22.05 kHz are filtered out, and then the sound is sampled at 44.1 kHz. This is depicted on the next slide. Figure from Multimedia Communications by Fred Halsall, Addison-Wesley, 2001.

Quantization Interval If Vmax is the maximum positive and negative signal amplitude and n is the number of binary bits used, then the magnitude of the quantization interval, q, is defined as follows: For example, what if we have 8 bits and the values range from –1000 to +1000?

Quantization Error (Noise) Any values within a quantization interval will be represented by the same binary value. Each code word corresponds to a nominal amplitude value that is at the center of the corresponding quantization interval. The actual signal may differ from the code word by up to plus or minus q/2, where q is the size of the quantization interval.

Quantization Intervals and Resulting Error

Results of Insufficient Quantization Levels Insufficient quantization levels result from not using enough bits to represent each sample. Insufficient quantization levels force you to represent more than one sound with the same value. This introduces quantization noise. Dithering can improve the quality of a digital file with a small sample size (relatively few quantization levels).

Linear Vs. Non-Linear Quantization In linear quantization, each code word represents a quantization interval of equal length. In non-linear quantization, you use more digits to represent samples at some levels, and less for samples at other levels. For sound, it is more important to have a finer-grained representation (i.e., more bits) for low amplitude signals than for high because low amplitude signals are more sensitive to noise. Thus, non-linear quantization is used.

Sound Editing See Tutorial for Choosing sampling rate and bit depth Recording sound See Studio Plugin Overview for information about multi-track recording See Noise Reduction Overview for information about noise reduction Show Cool Edit tutorial Show how to cut, paste, do a loop,

Fourier Analysis

Fourier Transform It is possible to take any periodic function of time x(t) and resolve it into an equivalent infinite summation of sine waves and cosine waves with frequencies that start at 0 and increase in integer multiples of a base frequency = 1/T, where T is the period of x(t). Mathematically, we can say the same thing with this equation: This equation does NOT tell how to compute the Fourier transform, that is, how we get the coefficients a1…a and b1…b. “Periodic” means it repeats. You can use the length of the sound file as the period. Any point on the wave can be computed as a sum of

Discrete Fourier Transform We can’t do an infinite summation on a computer. For digitally sampled input we can do the summation using the same number of frequency samples as there are time input samples. We can pretend that x(t) is periodic and that the period is the same length as the recording (or sound segment). The base frequency will be 1/length of recording (or sound segment).

Difference Between Discrete Fourier Transform and Discrete Cosine Transform The discrete cosine transform uses real numbers. This is all you need for image representation. The Fourier Transform uses complex numbers, which have a real and an imaginary part.

Recall the definition of the Discrete Cosine Transform For an N X N pixel image the DCT is an array of coefficients where where This tells how to compute the Discrete Cosine Transform.

Versions of the Fourier Transform Fourier Transform -- infinite summation Discrete Fourier Transformation -- a sum of n waves derived from n samples; O(n2) complexity Fast Fourier Transform -- a fast version of the Fourier transform, O(n* log2n) complexity; a disadvantage is that it requires a windowing function See http://www.dataq.com/applicat/articles/an11.htm, http://www.dataq.com/applicat/articles/an11.htm, and http://www.chipcenter.com/eexpert/bmasta/bmasta001.html

Windowing Functions Minimizes the effect of phase discontinuities at the borders of segments. Hanning, Hamming, Blackman, and Blackman-Harris are often used.

Fourier Analysis in CoolEdit Can be used to filter certain frequencies. The window size and function are adjustable Go to Transform/Filters/FFT to filter frequencies. Go to Analyze/Frequency Analysis to see an analysis of the frequency.