Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh.

Slides:

Advertisements

Similar presentations

Multimedia: Digitised Sound Data Section 3. Sound in Multimedia Types: Voice Overs Special Effects Musical Backdrops Sound can make multimedia presentations.

Advertisements

Speech Coding Techniques

Tamara Berg Advanced Multimedia

Introduction to MP3 and psychoacoustics Material from website by Mark S. Drew

MPEG/Audio Compression Tutorial Mike Blackstock CPSC 538a January 11, 2004.

CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.

Time-Frequency Analysis Analyzing sounds as a sequence of frames

Digital Audio Compression

Speech & Audio Coding TSBK01 Image Coding and Data Compression Lecture 11, 2003 Jörgen Ahlberg.

Speech Compression. Introduction Use of multimedia in personal computers Requirement of more disk space Also telephone system requires compression Topics.

AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.

Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.

1 Digital Audio Compression. 2 Formats  There are many different formats for storing and communicating digital audio:  CD audio  Wav  Aiff  Au 

Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.

Page 15/18/2015 CSE 40373/60373: Multimedia Systems Bluray (  MPEG-2 - enhanced for HD, also used for playback of DVDs and.

CELLULAR COMMUNICATIONS 5. Speech Coding. Low Bit-rate Voice Coding  Voice is an analogue signal  Needed to be transformed in a digital form (bits)

Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.

1 CODING AND COMPRESSION PRESENTED BY: PING CHEN CECS401 UMC DATE: April,

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

Lecture 14: Spring 2007 MPEG Audio Compression

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.

Digital Voice Communication Link EE 413 – TEAM 2 April 21 st, 2005.

Audio and Video Compression

CSc 461/561 CSc 461/561 Multimedia Systems Part A: 1. Audio.

Fundamental of Wireless Communications ELCT 332Fall C H A P T E R 6 SAMPLING AND ANALOG-TO-DIGITAL CONVERSION.

COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Digital Audio Multimedia Systems (Module 1 Lesson 1)

1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.

CS :: Fall 2003 Audio Coding Ketan Mayer-Patel.

Chapter Seven: Digital Communication

GODIAN MABINDAH RUTHERFORD UNUSI RICHARD MWANGI.  Differential coding operates by making numbers small. This is a major goal in compression technology:

Chapter 6 Basics of Digital Audio

LECTURE Copyright  1998, Texas Instruments Incorporated All Rights Reserved Encoding of Waveforms Encoding of Waveforms to Compress Information.

Audio Compression Usha Sree CMSC 691M 10/12/04. Motivation Efficient Storage Streaming Interactive Multimedia Applications.

AUDIO COMPRESSION msccomputerscience.com. The process of digitizing audio signals is called PCM PCM involves sampling audio signal at minimum rate which.

Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission.

Multimedia Data Speech and Audio Dr Sandra I. Woolley Electronic, Electrical and Computer Engineering.

Page 0 of 23 MELP Vocoders Nima Moghadam SN#: Saeed Nari SN#: Supervisor Dr. Saameti April 2005 Sharif University of Technology.

Speech Coding Submitted To: Dr. Mohab Mangoud Submitted By: Nidal Ismail.

MPEG Audio coders. Motion Pictures Expert Group(MPEG) The coders associated with audio compression part of MPEG standard are called MPEG audio compressor.

Introduction to SOUND.

1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.

Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.

CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2009.

ECE 5525 Osama Saraireh Fall 2005 Dr. Veton Kepuska

VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.

Digital Multiplexing 1- Pulse Code Modulation 2- Plesiochronous Digital Hierarchy 3- Synchronous Digital Hierarchy.

MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.

Digital Audio III. Sound compression (I) Compression of sound data requires different techniques from those for graphical data Requirements are less stringent.

1 Audio Coding. 2 Digitization Processing Signal encoder Signal decoder samplingquantization storage Analog signal Digital data.

Chapter 20 Speech Encoding by Parameters 20.1 Linear Predictive Coding (LPC) 20.2 Linear Predictive Vocoder 20.3 Code Excited Linear Prediction (CELP)

CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2014.

Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.

1 What is Multimedia? Multimedia can have a many definitions Multimedia means that computer information can be represented through media types: – Text.

Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.

Fundamentals of Multimedia Chapter 6 Basics of Digital Audio Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.

UNIT V. Linear Predictive coding With the advent of inexpensive digital signal processing circuits, the source simply analyzing the audio waveform to.

1 Speech Compression (after first coding) By Allam Mousa Department of Telecommunication Engineering An Najah University SP_3_Compression.

Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.

Digital Communications Chapter 13. Source Coding

Multimedia: Digitised Sound Data

Chapter 13 Basic Audio Compression Techniques

1 Vocoders. 2 The Channel Vocoder (analyzer) : The channel vocoder employs a bank of bandpass filters,  Each having a bandwidth between 100 HZ and 300.

CS 4594 Data Communications

Mobile Systems Workshop 1 Narrow band speech coding for mobile phones

Govt. Polytechnic Dhangar(Fatehabad)

Presentation transcript:

Submitted By: Santosh Kumar Yadav (111432) M.E. Modular(2011) Under the Supervision of: Mrs. Shano Solanki Assistant Professor, C.S.E NITTTR, Chandigarh Presentation On Audio Compression & Psychoacoustic 1

Content History of Audio Compression Basic of Audio Compression Categorization of Audio compression Silence compression ADPCM LPC CELP Psychoacoustics Frequency Masking Critical Band Temporal Masking 2

Historyof Audio Compression First form of audio compression came out in 1939, when Dudley first introduced the Vocoders to reduce the amount of bandwidth needed to transmit speech over a telephone line. In the 1960 compression was used in telephony. Now a days various compression techniques are used in Storage devices and with various File Formats. 3

Basic of Audio Compression Compression can be accomplished using two ways: a. Take the data from a standard digital audio system and compress it using S/W. b. To encode the signal in a different system and compressed by the H/W. The sounds we hear are caused by variation in air pressure which are picked up by our ear. In an analog electronic audio system, these pressure signals are converted to a electric voltage by a microphone. 4

Voice Pattern 5

Quantization of Voice Signal 6

Signal Reconstruction 7

Categorization of Audio Compression 8 Audio Compression Simple Audio Compression: 1.Silence Compression using RLE. 2.Adaptive Differential PCM. 3.Linear Predicative Coding. 4.Code Excited Linear Prediction. Psychoacoustic: 1.Frequency Masking 2.Temporal Masking. MPEG Audio Compression.

Silence Compression using RLE It is a form of lossless compression. It is easy to implement. Silence are replaced by the code and no of its consecutive sequence. Steps: a. Determine threshold for audio data. b. If the audio level is below the threshold, will be considered as silence. c. Silence in the audio is replace by code(e.g.”0”), The higher the threshold level more will be compression and hence more will the loss of info. Silence encoding is important for human speech as it has flat pauses between the spoken words. 9

Adaptive Differential PCM Used for quantization of audio signal. Defined the scaled difference signal fn as: e n is difference between two signals, α is multiplier constant. f n is fed into the quantizer for quantization. 10

Vocoders It is Voice Coders. Used in Linear Predictive Coding. Used for filtering various frequency range by using sub band filters. Consonants like M, N can be taken as voice as it uses vocal cord. 11 Sound Voice (Pulse like Vowels) Unvoice (Noise like Consonants)

Working of Vocoder Pitch of period of voice is considered. Voiced/unvoiced bit is set for voice and reset for unvoiced. Frequency of the sound is filtered by various filters. Signal transmitted to receiver end and then decoded there. 12

Linear Predictive Coding LPC vocoders extracts salient features of speech directly from the waveform rather than transforming the signal to the frequency domain. Bit rate is small as sound is not sent but its analyzed attributes are sent. Attributes or description parameters (like gain, max and min amplitude etc). 13 Sound Signal Segments Sample (Speech Frames)

Linear Predictive Coding LPC decide whether the current segment is voiced or unvoiced. For unvoice: Noise generator is used to create sample values f(n). For voice: Pulse train generator is used to create sample values f(n). S(n) is current o/p, s(n-i) represents the previous o/p, G is gain factor, f(n) is current frame input. It is called linear because it consider previous output also and act linearly. The speech encoder works in a block-wise fashion. Adv: Simple and easy to implement. Disadv: Error factor in generated o/p is more. 14

Code Excited Linear Prediction It is more complex. There is a code book of excitation vector to which actual speech is matched and the index of the best match is sent to the receiver. This complexity increase the bitrates to bps. CELP codes has two kinds of predictions: A. STP (Short time prediction): Predict within the sample and remove redundancy within speech frames. B. LTP(Long time prediction): Removes redundancy within the segment. Adv: It nearly produce the original sound. Disadv: It is complex and requires more bandwidth. 15

Psychoacoustic Psychoacoustics modeling referred to as perceptual coding. Range of human hearing 20Hz to 20KHz. Most audible range 500Hz to 4KHz. Maximum amplitude of quietest sound human can hear is 120 dB. 16

Equal Loudness Relation 17

Frequency Masking Threshold of Hearing: 18

Frequency Masking Curve The greater the power in the masking tone the wider its influence- then broader the range of frequency it can mask. If two tones are widely separated in frequency, little masking occurs. 19

Multiple Frequency Tone Masking 20

Critical Band The critical band represents the ears resolving power for simultaneous tones or partials. 21

Bark Unit Critical band unit given by Heinrich Barkhausen. 22

Temporal Masking The louder the test tone, the shorter the amount of time required before the test tone is audible once the masking tone is removed. 23

Summary Basic of Audio Compression. Types of Audio Compression. Fundamentals of psychoacoustics. 24

FAQ’s Why linear predictive coding is called linear? What is the significance of equal-loudness curve? How RLE can be applied on audio? What is the role of noise generator and pulse generator in Vocoder? 25

Refrences Fundamental of Multimedia by Le & Drew

Queries ? 27

Thank You For Your Patience 28