DCSP-13 Jianfeng Feng Department of Computer Science Warwick Univ., UK

Slides:

Advertisements

Similar presentations

Analog to digital conversion

Advertisements

DCSP-12 Jianfeng Feng

DCSP-13 Jianfeng Feng

DCSP-5: Noise Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-17: Matched Filter Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-21 Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-20 Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-13 Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-22 Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-3: Fourier Transform Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-11 Jianfeng Feng

DCSP-12 Jianfeng Feng Department of Computer Science Warwick Univ., UK

DCSP-16 Jianfeng Feng Department of Computer Science Warwick Univ., UK

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.

Copyright 2003, Marchany Hiding Text in MP3 Files Randy Marchany VA Tech Computing Center Blacksburg, VA

Guerino Mazzola (Fall 2014 © ): Introduction to Music Technology IIIDigital Audio III.6 (Fr Oct 24) The MP3 algorithm with PAC.

CS335 Principles of Multimedia Systems Audio Hao Jiang Computer Science Department Boston College Oct. 11, 2007.

A Phonetician ’ s Guide to Audio Formats Chilin Shih University of Illinois at Urbana Champaign LSA 2006January 5-8, 2006.

Time-Frequency Analysis Analyzing sounds as a sequence of frames

Dale & Lewis Chapter 3 Data Representation Analog and digital information The real world is continuous and finite, data on computers are finite  need.

Chapter 4: Representation of data in computer systems: Sound OCR Computing for GCSE © Hodder Education 2011.

1 Chapter 16 Fourier Analysis with MATLAB Fourier analysis is the process of representing a function in terms of sinusoidal components. It is widely employed.

AUDIO COMPRESSION TOOLS & TECHNIQUES Gautam Bhattacharya.

DREAM PLAN IDEA IMPLEMENTATION Introduction to Image Processing Dr. Kourosh Kiani

1 Audio Compression Techniques MUMT 611, January 2005 Assignment 2 Paul Kolesnik.

1 Speech Parametrisation Compact encoding of information in speech Accentuates important info –Attempts to eliminate irrelevant information Accentuates.

Chapter 12 Fourier Transforms of Discrete Signals.

Digital Audio, Image and Video Hao Jiang Computer Science Department Sept. 6, 2007.

T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.

EE2F1 Speech & Audio Technology Sept. 26, 2002 SLIDE 1 THE UNIVERSITY OF BIRMINGHAM ELECTRONIC, ELECTRICAL & COMPUTER ENGINEERING Digital Systems & Vision.

Warped Linear Prediction Concept: Warp the spectrum to emulate human perception; then perform linear prediction on the result Approaches to warp the spectrum:

 Principles of Digital Audio. Analog Audio  3 Characteristics of analog audio signals: 1. Continuous signal – single repetitive waveform 2. Infinite.

A Full Frequency Masking Vocoder for Legal Eavesdropping Conversation Recording R. F. B. Sotero Filho, H. M. de Oliveira (qPGOM), R. Campello de Souza.

Introduction to Sound Sounds are vibrations that travel though the air or some other medium A sound wave is an audible vibration that travels through.

LE 460 L Acoustics and Experimental Phonetics L-13

Digital Audio What do we mean by “digital”? How do we produce, process, and playback? Why is physics important? What are the limitations and possibilities?

Fundamentals Rawesak Tanawongsuwan

GCT731 Fall 2014 Topics in Music Technology - Music Information Retrieval Overview of MIR Systems Audio and Music Representations (Part 1) 1.

CSCI-235 Micro-Computers in Science Hardware Part II.

The Wavelet Tutorial: Part3 The Discrete Wavelet Transform

Multiresolution STFT for Analysis and Processing of Audio

Acoustic Analysis of Speech Robert A. Prosek, Ph.D. CSD 301 Robert A. Prosek, Ph.D. CSD 301.

More Meaningful Jargon Or, All You Need to Know to Speak Like a Geek Sound.

Digital Recording. Digital recording is different from analog in that it doesn’t operate in a continuous way; it breaks a continuously varying waveform.

Introduction to Digital Signals

Time Frequency Analysis

Marwan Al-Namari 1 Digital Representations. Bits and Bytes Devices can only be in one of two states 0 or 1, yes or no, on or off, … Bit: a unit of data.

Lecture#10 Spectrum Estimation

CSCI-100 Introduction to Computing Hardware Part II.

AUDIOFILES Harika Basana ), Elizabeth Chan ), Nikolai ), Frank Zhang ) 6100.

COMP135/COMP535 Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 2 Lecture 2 – Digital Representations.

Time Compression/Expansion Independent of Pitch. Listening Dies Irae from Requiem, by Michel Chion (1973)

Information Systems Design and Development Media Types Computing Science.

Lifecycle from Sound to Digital to Sound. Characteristics of Sound Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20Hz – 20KHz] Speech: [200Hz.

MP3 and AAC Trac D. Tran ECE Department The Johns Hopkins University Baltimore MD

Data Representation: Sound

Objective and Subjective Audio Assessment of MP3 Players’ Quality

III Digital Audio III.6 (Fr Oct 20) The MP3 algorithm with PAC.

Department of Computer Science Warwick Univ., UK

Department of Computer Science Warwick Univ., UK

Homework 1 (Due: 11th Oct.) (1) Which of the following applications are the proper applications of the short -time Fourier transform? Also illustrate.

III Digital Audio III.6 (Mo Oct 22) The MP3 algorithm with PAC.

EEE 244-6: Fourier Transform (FFT) and Signal Processing

COMS 161 Introduction to Computing

Audio Compression Techniques

Govt. Polytechnic Dhangar(Fatehabad)

Introduction to MATLAB

Fourier Transforms of Discrete Signals By Dr. Varsha Shah

Presentation transcript:

DCSP-13 Jianfeng Feng Department of Computer Science Warwick Univ., UK

Applications Power spectrum estimate Compression

clear all close all sampling_rate=100; %Hz omega=20; %signal frequecy Hz N=10000; %total number of samples for i=1:N x_sound(i)=cos(2*pi*omega*i/sampling_rate); %signal x(i)=cos(2*pi*omega*i/sampling_rate)+2*randn(1,1); %signal+noise axis(i)=sampling_rate*i/N; % for psd time(i)=i/sampling_rate; % for time trace end subplot(1,2,1) plot(time,x); %signal + noise, time trace xlabel('time (sec)'); ylabel('signal') subplot(1,2,2) plot(axis,abs(fft(x)).^2,'r'); % power of signal xlabel('Frequency') ylabel('Power') sound(x_sound, sampling_rate); %true signal sound

Singnal processing demo: transformation A few words on Matlab periodgram (fft) pwelch (overlapped windows)

Power spectrum for white noise Noise is a stochastic process x(t), for time t (discrete or continuous) Most noisy noise should have no memory, which impliese that E x(t)x(t+s) = 0 if s is not zero E x(t)x(t) = 1 or in another words E x(t)x(t+s) =  (s)

Therefore the psd of the white noise is flat: it has constant power for all frequencies, as confirmed in the previous matlabe example Different from all meaningful signals we encount

Spectrogram A spectrogram is an image that shows how the power spectrum of a signal varies with time.

Time Frequenc y

t=0:0.001:20; % 2 1kHz sample rate y=chirp(t,100,1,200,'q'); % 100Hz, cross 200Hz at t=1sec spectrogram(y,128,120,128,1E3); % Display the spectrogram title('Quadratic Chirp: start at 100Hz and cross 200Hz at t=1sec'); sound(y)

Compression

Sampling and reconstruction The question we consider here is under what conditions we can completely reconstruct the original signal x(t) from its discretely sampled signal x(n).

The use in MP3 is designed to greatly reduce the amount of data required to represent the audio recording and still sound like a faithful reproduction of the original uncompressed audio for most listeners.

An MP3 file could result in a file that is about 1/11 th the size of the file created from the original audio source.

The compression works by reducing accuracy of certain parts of sound that are deemed beyond the auditory resolution ability of most people.

This method is commonly referred to as perceptual coding.

The compression works by reducing accuracy of certain parts of sound that are deemed beyond the auditory resolution ability of most people. This method is commonly referred to as perceptual coding. It internally provides a representation of sound within a short-term time/frequency analysis window, by using psychoacoustic models to discard or reduce precision of components less audible to human hearing, and recording the remaining information in an efficient manner.

This technique is often presented as relatively conceptually similar to the principles used by JPEG, an image compression format.