Audio Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003.

Slides:



Advertisements
Similar presentations
Introduction to Digital Audio
Advertisements

Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.
CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.
Audio Coding Team Member: ChungMing Yan, Chun Tong.
Time-Frequency Analysis Analyzing sounds as a sequence of frames
Digital Audio Compression
N Team 15: Final Presentation Peter Nyberg Azadeh Bararsani Adie Tong N N multicodec minisip.
Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.
Speech-Coding Techniques Chapter 3. Internet Telephony 3-2 Introduction Efficient speech-coding techniques Advantages for VoIP Digital streams of ones.
Codec requirements update Michael Knappe Co-chair, codec WG 1Michael Knappe IETF 77.
Image and Sound Editing Raed S. Rasheed Sound What is sound? How is sound recorded? How is sound recorded digitally ? How does audio get digitized.
Speech codecs and DCCP with TFRC VoIP mode Magnus Westerlund
© 2006 AudioCodes Ltd. All rights reserved. AudioCodes Confidential Proprietary Signal Processing Technologies in Voice over IP Eli Shoval Audiocodes.
Speech Coding Nicola Orio Dipartimento di Ingegneria dell’Informazione IV Scuola estiva AISV, 8-12 settembre 2008.
Understanding the Internet Low Bit Rate Coder Jan Linden Vice President of Engineering Global IP Sound Presented by Jan Skoglund Sr. Research Scientist.
Speech & Audio Processing
1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.
Overview of Adaptive Multi-Rate Narrow Band (AMR-NB) Speech Codec
MPEG Audio Compression by V. Loumos. Introduction Motion Picture Experts Group (MPEG) International Standards Organization (ISO) First High Fidelity Audio.
Audio Coding MPEG1 Layers I, II, III MPEG2MPEG4 Sherida Subrati Anthony Caliendo.
© 2006 Cisco Systems, Inc. All rights reserved. 2.2: Digitizing and Packetizing Voice.
UCB Source Coding Jean Walrand EECS. UCB Outline Compression Losless: Huffman Lempel-Ziv Audio: Examples Differential ADPCM SUBBAND CELP Video: Discrete.
COMP 249 :: Spring 2005 Slide: 1 Audio Coding Ketan Mayer-Patel.
Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:
1 Audio Compression Multimedia Systems (Module 4 Lesson 4) Summary: r Simple Audio Compression: m Lossy: Prediction based r Psychoacoustic Model r MPEG.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
CS :: Fall 2003 Audio Coding Ketan Mayer-Patel.
Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 15 – MP3 and MP4 Audio Klara Nahrstedt Spring 2014.
LE 460 L Acoustics and Experimental Phonetics L-13
Introduction of Digital Audio Name: Yao-Cheng Chuang Phone:
Secure Steganography in Audio using Inactive Frames of VoIP Streams
MPEG: (Moving Pictures Expert Group) A Video Compression Standard for Multimedia Applications Seo Yeong Geon Dept. of Computer Science in GNU.
Audio. Why Audio Essential tool for – Interface – Narrative – Setting & Mood.
Sergei Hyppenen Supervisor: Professor Sven-Gustav Häggman
10/10/04 L5/1/28 COM342 Networks and Data Communications Ian McCrumRoom 5D03B Tel: voice.
Speaker : Chungyi Wang Advisor: Quincy Wu Date :
What’s new in Wideband Audio?
© 2006 Cisco Systems, Inc. All rights reserved. Optimizing Converged Cisco Networks (ONT) Module 2: Cisco VoIP Implementations.
An Introduction to the “Thor-like” Power of Ogg Vorbis! Robert W. Ferguson III January 30, 2003.
1 Audio Compression. 2 Digital Audio  Human auditory system is much more sensitive to quality degradation then is the human visual system  redundancy.
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Pengantar Multimedia. Sound  Physical phenomenon – vibration.  Source = electrical – acoustic  Vibration – oscillation – wave  Wave periodical – song,
Image Processing Architecture, © 2001, 2002, 2003 Oleh TretiakPage 1 ECE-C490 Image Processing Architecture MP-3 Compression Course Review Oleh Tretiak.
Jason A. Hockman McGill University 24 January 2008
MMDB-8 J. Teuhola Audio databases About digital audio: Advent of digital audio CD in Order of magnitude improvement in overall sound quality.
Guerino Mazzola (Fall 2015 © ): Introduction to Music Technology IIIDigital Audio III.5 (F Oct 30) MP3 and other digital audio file formats.
Comparisons of FEC and Codec Robustness on VoIP Quality and Bandwidth Efficiency Wenyu Jiang Henning Schulzrinne Columbia University ICN 2002, Atlanta,
Minjie Xie, Dave Lindbergh, and Peter Chu
Voice Sampling. Sampling Rate Nyquist’s theorem states that a signal can be reconstructed if it is sampled at twice the maximum frequency of the signal.
A UDIO B ANDWIDTH D ETECTION IN THE EVS C ODEC University of Sherbrooke, Canada VoiceAge Corporation, Montréal, Canada Fraunhofer IIS, Erlagen, Germany.
Audio Coding Lecture 7. Content  Digital Audio Basic  Speech Compression  Music Compression.
Opus SW codec RTLAB Ki Eun Seong. What is the Opus Codec? Real-time interactive audio codec Targets interactive audio over the internet Aims to be royalty-free,
MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD
RICO HARTONO JAHJA SOURCE CODING: PART IV.
MP3 and MP4 Audio By: Krunal Tailor
Scalable Speech Coding for IP Networks
III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.
Compression.
Introduction to Digital Audio
Audio Henning Schulzrinne Dept. of Computer Science
Introduction to Digital Audio
Understanding the Internet Low Bit Rate Coder
Introduction to Digital Audio
Introduction to Digital Audio
MPEG-1 Overview of MPEG-1 Standard
III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.
Audio Henning Schulzrinne Dept. of Computer Science
Govt. Polytechnic Dhangar(Fatehabad)
Introduction to Digital Audio
Presentation transcript:

Audio Henning Schulzrinne Dept. of Computer Science Columbia University Fall 2003

Common narrowband audio codecs Codecrate (kb/s) delay (ms) multi-rateem- bedd ed VBRbit-robust/ PLC remarks iLBC /Xquality higher than G.729A no licensing Speex XXX--/Xno licensing AMR-NB XX/X3G wireless G X/XTDMA wireless GSM-FR1320GSM wireless (Cingular) GSM-EFR12.220X/X2.5G G X/XH.320 (ISDN videconferencing) G X/--H.323, videoconferences

Common wideband audio codecs Codecrate (kb/s) delay (ms) multi-rateem- bedd ed VBRbit-robust/ PLC remarks Speex4— XXX--/Xno licensing AMR-WB6.6— XX/X3G wireless G.72248, 56, (1.5) X/--2 sub-bands now dated

iLBC – MOS behavior with packet loss

Recent audio codecs iLBC: optimized for high packet loss rates (frames encoded independently) AMR-NB – 3G wireless codec – kb/s – 20 ms coding delay

Speex Open-source patent-free speech codec CELP (code-excited linear prediction) codec operating modes: – narrowband (8 kHz sampling rate) 2.15 – 24.6 kb/s delay of 30 ms – wideband (16 kHz sampling rate) kb/s delay of 34 ms – ultra-wideband (32 kHz sampling rate) intensity stereo encoding variable bit rate (VBR) possible voice activity detection (VAD)

Ogg Vorbis Similar in application to AAC, MP3, VQF, …, but claims to be free of patents Ogg = container format file (also for Speex, FLAC) Vorbis = music speech codec near CD quality = 160 kb/s forward-adaptive modified DCT (discrete cosine transform) – overlapping windows – floor: carries frequency representation as piecewise linear interpolated representation on a dB amplitude scale and linear frequency scale – residue: subtract out floor  cascaded (multi-pass) vector quantization – entropy (Huffman) coding carries codec parameters in header

Sound localization Human ear uses 3 metrics for stereo localization: – intensity – time of arrival (TOA) – 7 µs – direction filtering and spectral shaping by outer ear For shorter wavelengths (4 – 20 kHz), head casts an acoustical shadow giving rise to a lower sound level at the ear farthest from the sound sources At long wavelength (20 Hz - 1 KHz) the, head is very small compared to wavelengths – In this case localization is based on perceived Interaural Time Differences (ITD) UCSC CMPE250 Fall 2002

Audio samples cs.html Speex: – both narrowband and wideband