A primer by J. D. Johnston Microsoft Corporation

Slides:

Advertisements

Similar presentations

Decibel values: sum and difference. Sound level summation in dB (1): Incoherent (energetic) sum of two different sounds: Lp 1 = 10 log (p 1 /p rif ) 2.

Advertisements

Frequency analysis.

Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements Christopher A. Shera, John J. Guinan, Jr., and Andrew J. Oxenham.

My list of 10 worst mistakes in Audio (some of which are pet peeves) J. D. (jj) Johnston Neural Audio Kirkland, Wa, USA.

Principles of Electronic Communication Systems

SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.

Why do we hear what we hear? James D. Johnston Chief Scientist DTS, Inc.

Why do we hear what we hear? James D. Johnston Chief Scientist, DTS, Inc.

Audio Compression ADPCM ATRAC (Minidisk) MPEG Audio –3 layers referred to as layers I, II, and III –The third layer is mp3.

Digital Coding of Analog Signal Prepared By: Amit Degada Teaching Assistant Electronics Engineering Department, Sardar Vallabhbhai National Institute of.

Hearing and Deafness 2. Ear as a frequency analyzer Chris Darwin.

CS 551/651: Structure of Spoken Language Lecture 11: Overview of Sound Perception, Part II John-Paul Hosom Fall 2010.

PHYSICS OF SOUND PHYSICS OF SOUND HEARING CONSERVATION PROGRAM 1 28 Jan 2013.

The frequency spectrum

Digital Representation of Audio Information Kevin D. Donohue Electrical Engineering University of Kentucky.

SIMS-201 Characteristics of Audio Signals Sampling of Audio Signals Introduction to Audio Information.

IT-101 Section 001 Lecture #8 Introduction to Information Technology.

Sound Chapter 15.

Pegasus Lectures, Inc. Volume II Companion Presentation Frank Miele Pegasus Lectures, Inc. Ultrasound Physics & Instrumentation 4 th Edition.

The Doppler Effect A source emits a sound of constant frequency. If the apparent frequency of the source is increased which of the following is true? A.

A.Diederich– International University Bremen – Sensation and Perception – Fall Frequency Analysis in the Cochlea and Auditory Nerve cont'd The Perception.

PH 105 Dr. Cecilia Vogel Lecture 10. OUTLINE  Subjective loudness  Masking  Pitch  logarithmic  critical bands  Timbre  waveforms.

SUBJECTIVE ATTRIBUTES OF SOUND Acoustics of Concert Halls and Rooms Science of Sound, Chapters 5,6,7 Loudness, Timbre.

Multimedia Data Introduction to Image Processing Dr Mike Spann Electronic, Electrical and Computer.

Sound Transmission and Echolocation Sound transmission –Sound properties –Attenuation Echolocation –Decoding information from echos.

Equalization Changing the curve. What is an EQ? An Equalizer –Is generally a frequency-specific amplifier –Is made up of filters (passive or active) –Is.

1 Live Sound Reinforcement Audio measurements. 2 Live Sound Reinforcement One of the most common terms you will come across when handling any type of.

Calibration & Curve Fitting

Random Processes and LSI Systems What happedns when a random signal is processed by an LSI system? This is illustrated below, where x(n) and y(n) are random.

1 Live Sound Reinforcement Equalizers and other signal processing equipment.

Ni.com Data Analysis: Time and Frequency Domain. ni.com Typical Data Acquisition System.

Loudness October 18, 2006 What is it?? The Process.

Lecture 17 October 29, On Wednesday - Thumping.

EE Audio Signals and Systems Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.

Art 321 Sound, Audio, Acoustics Dr. J. Parker. Sound What we hear as sound is caused by rapid changes in air pressure! It is thought of as a wave, but.

Dr. Richard Young Optronic Laboratories, Inc..  Uncertainty budgets are a growing requirement of measurements.  Multiple measurements are generally.

Dynamic Range and Dynamic Range Processors

The Care and Feeding of Loudness Models J. D. (jj) Johnston Chief Scientist Neural Audio Kirkland, Washington, USA.

COSC 1P02 Introduction to Computer Science 4.1 Cosc 1P02 Week 4 Lecture slides “Programs are meant to be read by humans and only incidentally for computers.

When to Code WHEN NOT TO CODE James D. (jj) Johnston Chief Scientist DTS, Inc.

Multimedia Data Introduction to Image Processing Dr Sandra I. Woolley Electronic, Electrical.

What should we be reading?? Johnston Johnston –Interlude - 2 piano –Interlude - 6 percussion –Chapter 7 – hearing, the ear, loudness –Appendix II – Logarithms,

David Meredith Aalborg University

Applied Psychoacoustics Lecture 3: Masking Jonas Braasch.

Chapter 6. Effect of Noise on Analog Communication Systems

SOUND PRESSURE, POWER AND LOUDNESS MUSICAL ACOUSTICS Science of Sound Chapter 6.

Psychophysics and Psychoacoustics

Encoding and Simple Manipulation

Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.

Vibrationdata 1 Unit 6a The Fourier Transform. Vibrationdata 2 Courtesy of Professor Alan M. Nathan, University of Illinois at Urbana-Champaign.

Frequency Modulation ECE 4710: Lecture #21 Overview:

Introduction to psycho-acoustics: Some basic auditory attributes For audio demonstrations, click on any loudspeaker icons you see....

Sinyal, Power, Spectrum. A graph of the intensity plotted against the frequency (showing the amount of each color) is the frequency spectrum of the light.

Pitch What is pitch? Pitch (as well as loudness) is a subjective characteristic of sound Some listeners even assign pitch differently depending upon whether.

Signal Analyzers. Introduction In the first 14 chapters we discussed measurement techniques in the time domain, that is, measurement of parameters that.

Digital Audio I. Acknowledgement Some part of this lecture note has been taken from multimedia course made by Asst.Prof.Dr. William Bares and from Paul.

SOUND PRESSURE, POWER AND LOUDNESS

The normal approximation for probability histograms.

Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.

By. Jadhav Avinash J Roll no - 2K13E11. Reference: Hewlett Packard Agilent Technology Wikipedia GwINSTEK.

The Care and Feeding of Loudness Models J. D. (jj) Johnston Neural Audio Kirkland, Washington, USA.

Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.

Noise & Sound Graeme Murphy – National Brand Manager, Industrial Equipment.

Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.

Loudness level (phon) An equal-loudness contour is a measure of sound pressure (dB SPL), over the frequency spectrum, for which a listener perceives a.

"Digital Media Primer" Yue-Ling Wong, Copyright (c)2013 by Pearson Education, Inc. All rights reserved.

EE Audio Signals and Systems

A primer by J. D. Johnston Microsoft Corporation

A primer by J. D. Johnston Microsoft Corporation

Human Hearing 096 Part 1.

Presentation transcript:

A primer by J. D. Johnston Microsoft Corporation LOUDNESS vs. INTENSITY A primer by J. D. Johnston Microsoft Corporation Copyright © 2006 by James. D. Johnston Permission granted for any educational use. All other rights reserved.

What’s the big deal. (why do I yammer on about loudness vs What’s the big deal? (why do I yammer on about loudness vs. intensity, anyhow?) Language exists to communicate. If we don’t use the same meanings to the words (leaving out philosophical issues for now), communication is difficult. Hence, a few definitions and words to help us in the discussions. This understanding will also help explain some of the reasons why different compression sounds so different.

Intensity – What is it? Intensity is the EXTERNAL MEASURED LEVEL OF A SOUND. There are many measures of intensity. For now, we will use the common psychoacoustic (not mechanical engineering) definition of SPL, or “sound pressure level”. There are assumptions built into SPL. Let’s keep it simple for now and forget them. SPL, usually measured in dB, is a measure of acoustic energy in the atmosphere.

Ok, now how about electronics? In electronics, it’s still the same, except that we do not know the meaning of 0dB in acoustic terms. Therefore: dBm, dBv, and other defined reference levels provide a way to understand what the intensity of the analog of a signal is. There are assumptions (constant impedance, for instance) built into these level measures as well. Again, let us keep it simple for now.

What is Loudness? Loudness is the INTERNAL, SUBJECTIVE EXPERIENCE OF HOW LOUD A SIGNAL IS. The term loudness dates back at least to Fletcher, if not beyond. Loudness is not intensity Loudness and intensity can be mostly related by a complex calculation, however: Every listener is a bit different Hearing injuries affect loudness in many ways Intensity does NOT equal loudness. The relationship is complex, and could constitute an entire tutorial in and of itself. In the worst cases, intensity is a very poor substitute for loudness, and vice versa.

When do we use Intensity? Intensity is an OBJECTIVE measure. It measures the actual fluctuations in air pressure in the atmosphere that constitute sound. We use intensity when that is what we need to know. We do NOT use intensity when we want to know how loud a sound is to a listener.

When do we use Loudness When we are trying to describe the experience that the listener has. When we are trying to estimate psychoacoustic parameters. When we want to know why somebody is shouting “turn that **** thing down!” and the level (intensity) isn’t that high. When we want to know why somebody is shouting “TURN UP THE SOUND” when the intensity is already excessive. When we want to match loudness, NOT level, across audio selections, either full-bandwidth or from remotes/phones.

What can we say about Loudness vs What can we say about Loudness vs. Intensity in the course of 10 minutes? When the spectrum (frequency content) of a signal is unchanged, loudness is approximately (sometimes very approximately for some special signals) proportional to the 1/3.5 power of the signal power, or the 1/1.75 power of amplitude.

REMEMBER – All of these relationships are approximate! What else? The ear is understood to have “critical bands” or “ERB’s”. These relate to a mechanical filtering system in the ear that does frequency analysis. Signals in the same “band” convert intensity to loudness approximately with the power law relationship given in the last slide. Signals in different bands ADD linearly in loudness. REMEMBER – All of these relationships are approximate!

Yes, and? This means that two signals at frequencies reasonably removed from each other, each of which has ½ the energy of the original, will have 2*(1/2)^(1/3.5) or 1.64 of the loudness of one of the signals presented at energy 1. If I double the energy of a signal without changing the spectrum, the loudness will increase by a factor of about 1.21 If I double the energy of the signal by adding as much energy in at a frequency where energy was not present, the loudness doubles.

What did I just say, anyhow? If we double the energy of a sine wave (to use a simple example), its loudness rises by approximately 2^(1/3.5). If we add another, second sine wave at a frequency removed (in terms of critical bands) from the first, the loudness doubles. The difference in loudness ratio for these two signals with the same energy is 1.21 vs. 2.

And that can be worse. If we take the same amount of energy, and spread it out from a tone across 1, 2, 3, 4, critical bandwidths, the loudness will change (increase) as the energy is spread out, as long as the signal spectrum is somewhat above the absolute threshold of hearing.

A demonstration The two files I am about to play have the same energy. In one case the energy is all in one sine wave In the other case, it’s in 3 sine waves an octave apart (each) (press ESC to cancel the playback)

A graphical example Relative Loudness Number of bands In this slide, the vertical axis is the relative loudness, with the single-band loudness set to ‘1’ for simplicity. The curve shows the relative loudness when the same amount of ENERGY is split over ‘n’ bands, from 1 to 25. The numbers for over 15 bands are probably an overestimate, but that is signal dependent.

Two things to recall about that graph The range of loudness corresponding to a given signal energy can vary by about a factor of 10. The graph is approximate. It is ONLY approximate. Effects due to different listeners will change it Effects regarding absolute threshold will change it It’s only an approximation in any case. The power law is quite approximate to start with. None the less, the point holds that loudness is not very strongly correlated to intensity.

Loudness vs. dB dB SPL is a measure of the INTENSITY of a signal. dB does not measure loudness. A doubling of loudness is about 10dB or so. These numbers were established by a set of experiments done by Fletcher, after Bell’s original experiments. The work was confirmed by many other experimenters. The level change to double loudness depends on both level and frequency The design of the experiment is a full tutorial topic in itself.

Some other demos Another example of narrow vs. broadband signals. Narrowband noise (white noise, 1 critical bw wide) Broadband noise (white noise) Two music clips with the same energy N.B. since these are not stationary, the comparison is much harder N.B. these clips not presented due to DRM.

So what’s this “loudness” button? There is a “loudness” button on my receiver/amplifier/whatever. What is this? Because of the well-known reduced sensitivity of the ear at low frequencies, some manufacturers choose to add a bass boost, sometimes variable, and call it a “loudness” control. It’s linear, time-invariant. The ear is neither. It really is much more difficult than a fixed frequency shaping. The best you can say is that it makes the sound louder. It can not fully compensate for changes in sensation level without being signal dependent and time varying.

Why the Loudness Button can’t do what it is supposed to do. A real “loudness restorer” that worked for different intensities of presentation levels would have to be time-varying and signal dependent. It’s not that easy.

Moving forward We should be careful to use the terms “loudness” and “intensity” properly, and to bear in mind how the two are loosely related. When we use nonlinear processing such as level compression that spreads frequency content, we must realize that the loudness may rise faster than expected.

So, does that mean that there is such a thing as “loudness enhancement”? Well, leaving aside the metaphysical question of exactly what an enhancement is, yes. If a nonlinearity spreads the spectrum of a signal, it is likely to become louder.

There are some common examples. LP distortion grows with level. That means that as level grows, the signal bandwidth (including the distortion) increases. This means that an increase in intensity is over-represented by the increase in loudness. This can create an illusion of “more dynamic range”. It can also be very annoying. Tape distortion grows with level. It behaves different than LP’s, but to the same result, at usual saturation levels.

What about “Make it LOUD Sorts of things? Oh, yes, they work. It is a bit of an art to make a signal have a peak to RMS ratio similar to that of a sine wave. They spread the spectrum very broadly. You can create your own opinion about how they sound.

So, how would one attempt to measure loudness? Pick a time window (this can vary depending on application and need) Do a frequency analysis in this time window. Group together the energies in each 1/3 ERB of the signal (or so). Spread the data per understanding of cochlear filters. Generate an ERB by ERB power spectrum at 1/3 ERB intervals. Compress that spectrum (using somwhere between 1/3 and ¼ power depending on the signal) Normalize each ERB (i.e. if the spread is 1/3 ERB centers, multiply each by 1/3) Sum up those partial loudnesses to get total loudness for the given block

Could you be a bit more specific? Unfortunately, no. At least not in a 1-hour presentation.

Questions?