Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.

Slides:



Advertisements
Similar presentations
JPEG Compresses real images Standard set by the Joint Photographic Experts Group in 1991.
Advertisements

Department of Computer Engineering University of California at Santa Cruz MPEG Audio Compression Layer 3 (MP3) Hai Tao.
Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew
Psycho-acoustics and MP3 audio encoding
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 11 – MP3 and MP4 Audio (Part 7) Klara Nahrstedt Spring 2012.
MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
MPEG-1 MUMT-614 Jan.23, 2002 Wes Hatch. Purpose of MPEG encoding To decrease data rate How? –two choices: could decrease sample rate, but this would cause.
A stereo audio file 1. Audio Channels Number of audio channels determines number of waveforms in a recording Two relevant types of recording Stereo recording.
4.1Different Audio Attributes 4.2Common Audio File Formats 4.3Balancing between File Size and Audio Quality 4.4Making Audio Elements Fit Our Needs.
School of Informatics CG087 Time-based Multimedia Assets Compression & StreamingDr Paul Vickers1 Compression & Streaming Serving, shrinking, and otherwise.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
Part A Multimedia Production Rico Yu. Part A Multimedia Production Ch.1 Text Ch.2 Graphics Ch.3 Sound Ch.4 Animations Ch.5 Video.
Digital Audio Coding – Dr. T. Collins Standard MIDI Files Perceptual Audio Coding MPEG-1 layers 1, 2 & 3 MPEG-4.
I Power Higher Computing Multimedia technology Audio.
AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
PAC/AAC audio coding standard A. Moreno Georgia Institute of Technology ECE8873-Spring/2004
1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 
Digital audio recording Kimmo Tukiainen. My background playing music since I was five first time in a studio at fourteen recording on my own for six months.
Image and Sound Editing Raed S. Rasheed Sound What is sound? How is sound recorded? How is sound recorded digitally ? How does audio get digitized.
Multimedia communications EG-371Dr Matt Roach Multimedia Communications EG 371 and EG 348 Dr Matthew Roach Lecture 2 Digital.
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
MPEG-3 For Audio Presented by: Chun Lui Sunjeev Sikand.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.
EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.
Fundamentals of Perceptual Audio Encoding Craig Lewiston HST.723 Lab II 3/23/06.
Digital Audio Multimedia Systems (Module 1 Lesson 1)
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
STEGANOGRPAHY [APPLICATION – I]
Digital Audio Watermarking: Properties, characteristics of audio signals, and measuring the performance of a watermarking system نيما خادمي کلانتري
School of Informatics CG087 Time-based Multimedia Assets Compression & StreamingDr Paul Vickers1 Compression & Streaming Serving, shrinking, and otherwise.
Chapter 6 Basics of Digital Audio
COMP Representing Sound in a ComputerSound Course book - pages
Psycho- acoustics and MP3 audio encoding Physics of Music PHY103.
Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.
AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.
CMPT 365 Multimedia Systems
Media Representations - Audio
A Tutorial on MPEG/Audio Compression Davis Pan, IEEE Multimedia Journal, Summer 1995 Presented by: Randeep Singh Gakhal CMPT 820, Spring 2004.
Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.
MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
Basic Concepts of Audio Watermarking. Selection of Different Approaches Embedding Domain  time domain  frequency domain DFT, DCT, etc. Modulation Method.
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.
MPEG-1Standard By Alejandro Mendoza. Introduction The major goal of video compression is to represent a video source with as few bits as possible while.
Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.
1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.
School of Informatics, Engineering, & Technology CM613 Multimedia Storage & Retrieval Compression & StreamingDr Paul Vickers1 Compression & Streaming Serving,
AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100.
IntroductiontMyn1 Introduction MPEG, Moving Picture Experts Group was started in 1988 as a working group within ISO/IEC with the aim of defining standards.
Interactive Multimedia Sound Mikael Fernström. Data sources Microphones and transducers –Sample acoustic reality Synthesis –Simulate reality (and beyond.
MP3 and MP4 Audio By: Krunal Tailor
[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
Digital Communication
Spread Spectrum Audio Steganography using Sub-band Phase Shifting
Basic Concepts of Audio Watermarking
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Audio Compression Techniques
Govt. Polytechnic Dhangar(Fatehabad)
Digital Audio Application of Digital Audio - Selected Examples
Presentation transcript:

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3

The steps of MP3 Input signal in digital form (sampled). Split signal into separate frequency bands corresponding to the ear’s critical bands. Separately calculate the ear’s response to ear band. Increase the quantisation step in bands were we can mask quantisation noise.

The steps of MP3 Code resultant bit stream using Huffman coding. Output the result.

Sampling Sample rate of 44.1 kHz using 16 bits per channel in stereo This gives 1,411,2000 bps blocks of 512 samples are taken Converted to frequency domain using modified DCT Split into 32 equal width bands

Psycho-acoustics and critical bands The human ear can hear from 20 Hz –20,000 Hz in frequency. Y=chirp([0:1/44100:4], 20,4, 20000, 'logarithmic'); sound(Y, 44100) The ear’s sensitivity changes with frequency. Most sensitive about Hz speech MW radio.

Psycho-acoustics and critical bands More importantly sensitivity levels are shifted by sounds which are close in frequency. This gives rise to critical bands. These are bands or areas around sounds where the sensitivity to nearby frequencies is reduced. There can be many (20-30) critical bands in the sound spectrum at any time. The bands vary in width across the audio spectrum, but are typically Hz wide between 1000 and 4000 Hz.

Psycho-acoustics and critical bands It is the exploitation of the critical bands which allows mp3 to achieve its compression. (Principles of digital Audio, Ken Pohlmann)

Masking Critical bands are exploited. Frequencies are removed if they are not audible Sounds are inaudible if masked by either louder sounds or specific frequencies

Audible Masking A psycho-acoustic model is applied Frequencies are removed if they are not audible Sounds are inaudible if masked by either louder sounds or specific frequencies

Temporal Masking A psycho-acoustic model is applied Frequencies are removed if they are not audible Sounds are inaudible if masked by either louder sounds or specific frequencies

Post Masking Around 200ms Sounds are inaudible if masked by either louder sounds or specific frequencies

Pre Masking Around 20ms Sounds are inaudible if masked by either louder sounds or specific frequencies.

Masking Demo setup Fs=44100 sig1=1000 sig2=1020 sig3=200 S1=0.9*sin(2*pi*sig1*(1:(5*F s))/Fs); S2=0.9*sin(2*pi*sig2*(1:(5*F s))/Fs); S3=0.9*sin(2*pi*sig3*(1:(5*F s))/Fs);

Masking Demo setup Play sounds – sound(S1, 44100) – sound(S2, 44100) – sound(S3, 44100) Make mixed sounds –S12=0.5*S1+0.5*S2; –S13=0.5*S1+0.5*S3; Play mixed sounds – sound(S12, 44100) – sound(S13, 44100)

Audible Masking Demo Sounds are inaudible if masked by either louder sounds or specific frequencies. Make two masked sounds S12mask=0.9*S1+0.03*S2; S13mask=0.9*S1+0.03*S3; Play them –sound(S12mask, 44100) – sound(S13mask, 44100)

Post Masking Demo Make 100ms snippets of S2 and S3. –S2sht=S2(1:round(100/1000*Fs)) –S3sht=S3(1:round(100/1000*Fs)) Play them Add them to end of S1 and play them. –sound([S1 S2sht ], 44100) –sound([S1 S3sht ], 44100) Reduce level of “snippets” by 100 and play them: sound([ S1 0.01*S2sht ], ) sound([ S1 0.01*S3sht ], 44100)

Quantisation The number of bits per channel is fixed in advance. For mp3 this ranges from 32 to 160 kbps depending on the amount of compression required Bits are allocated to the remaining frequencies in each channel

Quantisation Noise As the quantisation steps are increased, noise level increases. This is because the error between the actual signal and the quantisation step may be regarded as a separate (unwanted noise) signal added to the actual signal.

Quantisation Noise However, if we can mask this unwanted noise we can use fewer quantisation steps (less bits).

Quantisation Noise Demonstration. Input a high quality wav file. Increase the quantisation step. –Aquant=round(A(:)*(2^4- 1))/(2^4-1) –For 4 bit Listen to the sound. Keep increasing quantisation step while listening to the quantisation noise.

Huffman coding. Resultant bitstream is now reduced, because of coarser quantisation, but can be further reduced by the use of Huffman coding. Due to the nature of sound high level components at certain frequencies are less likely than low level ones and vice versa. This statistical bias can be exploited using Huffman coding.

Further reading comp.pdf Principles of digital Audio, Ken Pohlmann