Presentation is loading. Please wait.

Presentation is loading. Please wait.

Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open.

Similar presentations


Presentation on theme: "Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open."— Presentation transcript:

1 Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open source speech codecs beneath 5000 bit/s ● work in progress ● samples & source: rowetel.com/codec2.html

2 Why Open Source? ● Ham radio is an experimental service ● we need to be able to experiment, understand, and modify ● open source means no license fees, e.g. include in SDR systems for free ● freedom to run on your PC or an embedded CPU

3 Proprietary Codecs ● come in hardware or licensed software form ● difficult to distribute ● they cannot be modified ● understanding how they work is discouraged ● modification may actually be illegal under the license

4 Codec 2 Author - David Rowe ● Adelaide, South Australia ● VK5DGR, first licensed over 30 years ago at age 13 ● 20 years experience in speech coding ● Built some of the first real time speech codecs in the late 1980's on early DSP chips ● Now work full time on open software/open hardware for developing world communications ● http://rowetel.com

5 Digital Voice Radio System A/D codec2 enc FEC enc mod HF/VHF radio D/A codec2 dec FEC dec demod mic spk r

6 Patents and Codecs ● The authors of proprietary/patented codecs borrowed heavily from the public domain ● Perhaps 5% of the algorithms they use are original and patented ● 95% of the algorithms in these codecs are public domain algorithms ● To build an equivalent codec, we simply need alternatives for the 5% that is patented

7 Speech Coding ● Take speech samples (e.g. 16 bit samples at 8 kHz sampling rate) ● Convert to 2400 bit/s ● What can we throw away? ● Retain intelligible speech ● Retain natural speech ● Use a model of speech, send model parameters

8 Model Parameter ● example of a model parameter is pitch ● for humans in the range 50 to 500 Hz ● can be quantised to 7 bits ● updated every 20 ms ● so 7/0.02 = 350 bit/s to represent pitch

9 Sinusoidal Speech Coding Pitch Period 35 samples or 4.4ms at 8kHz sample rate (230 Hz) Time (samples) Amplitude (16 bit samples)

10 Sinusoidal Speech Coding Amplitud e (dB) Frequency (Hz) Pitch 230Hz or 4.3ms Harmonics of 230Hz

11 Sinusoidal Speech Model Amplitude 1 Phase 1 Frequency 1 Amplitude 2 Phase 2 Frequency 2 Amplitude L Phase L Frequency L

12 Amplitude Modelling ● Adjacent amplitudes have similar values ● This leads to coding efficiencies ● We use LPC to represent amplitudes ● digital filter, updated regularly ● fixed number of parameters (e.g. 10) ● LPC envelope approximates amplitudes ● Sampled at the decoder to recover amplitudes

13 Amplitude Modelling

14 Encoder Block Diagram LPC Analysis MBE Voicing est FFT 16 bit, 8kHz samples LPC to LSP Quant Energy Quant Pitch est Pitch Quant LPC Correcti on 2550 bit/s quantised model parameters

15 Bit Allocation ● Alpha V0.1 codec, subject to rapid change ● 51 bits per 20ms frame, or 2550 bit/s

16 Decoder Block Diagram Inverse FFT 16 bit, 8kHz samples Recover Harm Amps LSP to LPC Overlap Add Post Filter FFT LSPs Energ y LPC Correctio n Phase Synthesi s Voicin g

17 Samples ● male ● speaker dependence (open source development) ● background noise ● try samples on rowetel.com/codec2.html with headphones or speakers

18 Prior Art Summary ● Sinusoidal Coding, Mcaulay & Quatieri, 1984 ● Linear Predictive Coding, Makhoul, 1975 ● Line Spectrum Pairs, Itakura, 1975 ● MBE Voicing, Griffin & Lim, 1988 ● Overlap Add, Tribolet & Crochiere, 1979 ● NLP Pitch Estimation, Rowe, 1999 ● LPC Amplitude Recovery (algorithm used here), Rowe, 1991, 1999, 2009 ● Post Filter, Rowe, 2009

19 Further Work ● Better phase model and voicing estimator ● Toll quality at 2000 bit/s ● Lower bit rate, 2400, 1200 bit/s ● Better background noise performance ● FEC and non-redundant error correction ● Integration with modem and test over radio channels ● Fixed point and DSP chip implementation

20 Brainstorms ● what can we do with Codec 2. HF rather than VHF? ● how can we get people using it? ● work with others to integrate with modem and FEC code ● create a digital voice application that can run on a laptop ● novel combinations of codec, FEC, modulation ● PSK31 low bit rate voice mode ● Better than SSB on HF?


Download ppt "Codec 2 ● open source speech codec ● low bit rate (2400 bit/s and below) ● applications include digital speech for HF and VHF radio ● fills gap in open."

Similar presentations


Ads by Google