U of I CS 414 - Digital Audio Representation - D2770109

Home> Schools> University of Illinois> Computer Science (CS) > CS 414> Digital Audio Representation

DOC PREVIEW

U of I CS 414 - Digital Audio Representation

School name University of Illinois

Course Cs 414- Multimedia Systems

Pages 30

This preview shows page 1-2-14-15-29-30 out of 30 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 30 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio RepresentationAdministrativeKey QuestionsCharacteristics of SoundDigital Representation of AudioSampling (in time)Nyquist TheoremNyquist LimitLimited SamplingSlide 10Sampling RangesSlide 12QuantizationSlide 14Pulse Code ModulationPCM Quantization and DigitizationLinear Quantization (PCM)Non-uniform QuantizationPerceptual Quantization (u-Law)Differential Pulse Code Modulation (DPCM)Differential Quantization (DPCM)Adaptive Differential Pulse Code Modulation (ADPCM)Signal-to-Noise RatioSignal To Noise RatioQuantization ErrorCompute Signal to Noise RatioExampleData RatesComparison and Sampling/Coding TechniquesSummaryCS 414 - Spring 2011CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio RepresentationKlara NahrstedtSpring 2011CS 414 - Spring 2011Administrative Form Groups for MPs Deadline January 24 (today!!) to email TAKey QuestionsHow can a continuous wave form be converted into discrete samples?How can discrete samples be converted back into a continuous form?CS 414 - Spring 2011Characteristics of SoundAmplitudeWavelength (w)Frequency ( )TimbreHearing: [20Hz – 20KHz] Speech: [200Hz – 8KHz]Digital Representation of AudioMust convert wave form to digitalsamplequantizecompressSampling (in time)Measure amplitude at regular intervalsHow many times should we sample?CS 414 - Spring 2011Nyquist TheoremFor lossless digitization, the sampling rate should be at least twice the maximum frequency response. In mathematical terms:fs > 2*fmwhere fs is sampling frequency and fm is the maximum frequency in the signalCS 414 - Spring 2011Nyquist Limit max data rate = 2 H log2V bits/second, whereH = bandwidth (in Hz)V = discrete levels (bits per signal change)Shows the maximum number of bits that can be sent per second on a noiseless channel with a bandwidth of H, if V bits are sent per signalExample: what is the maximum data rate for a 3kHz channel that transmits data using 2 levels (binary) ? Solution: H = 3kHz; V = 2; max. data rate = 2x3,000xln2=6,000bits/second CS 414 - Spring 2011Limited SamplingBut what if one cannot sample fast enough?CS 414 - Spring 2011Limited SamplingReduce signal frequency to half of maximum sampling frequencylow-pass filter removes higher-frequenciese.g., if max sampling frequency is 22kHz, must low-pass filter a signal down to 11kHzCS 414 - Spring 2011Sampling RangesAuditory range 20Hz to 22.05 kHzmust sample up to to 44.1kHzcommon examples are 8.000 kHz, 11.025 kHz, 16.000 kHz, 22.05 kHz, and 44.1 KHzSpeech frequency [200 Hz, 8 kHz]sample up to 16 kHz but typically 4 kHz to 11 kHz is used CS 414 - Spring 2011CS 414 - Spring 2011QuantizationCS 414 - Spring 2011QuantizationTypically use8 bits = 256 levels16 bits = 65,536 levelsHow should the levels be distributed?Linearly? (PCM)Perceptually? (u-Law)Differential? (DPCM)Adaptively? (ADPCM)CS 414 - Spring 2011Pulse Code ModulationPulse modulationUse discrete time samples of analog signalsTransmission is composed of analog information sent at different timesVariation of pulse amplitude or pulse timing allowed to vary continuously over all valuesPCMAnalog signal is quantized into a number of discrete levelsCS 414 - Spring 2011PCM Quantization and DigitizationCS 414 - Spring 2011DigitizationQuantizationLinear Quantization (PCM)Divide amplitude spectrum into N units (for log2N bit quantization)Sound IntensityQuantization IndexCS 414 - Spring 2011Non-uniform QuantizationCS 414 - Spring 2011Perceptual Quantization (u-Law)Want intensity values logarithmically mapped over N quantization units Sound IntensityQuantization IndexCS 414 - Spring 2011Differential Pulse Code Modulation (DPCM)What if we look at sample differences, not the samples themselves?dt = xt-xt-1Differences tend to be smallerUse 4 bits instead of 12, maybe?CS 414 - Spring 2011Differential Quantization (DPCM)Changes between adjacent samples smallSend value, then relative changesvalue uses full bits, changes use fewer bitsE.g., 220, 218, 221, 219, 220, 221, 222, 218,.. (all values between 218 and 222)Difference sequence sent: 220, +2, -3, 2, -1, -1, -1, +4....Result: originally for encoding sequence 0..255 numbers need 8 bits; Difference coding: need only 3 bitsCS 414 - Spring 2011Adaptive Differential Pulse Code Modulation (ADPCM)Adaptive similar to DPCM, but adjusts the width of the quantization stepsEncode difference in 4 bits, but vary the mapping of bits to difference dynamicallyIf rapid change, use large differencesIf slow change, use small differencesCS 414 - Spring 2011Signal-to-Noise RatioCS 414 - Spring 2011Signal To Noise RatioMeasures strength of signal to noiseSNR (in DB)= Given sound form with amplitude in [-A, A]Signal energy = )(log1010energyNoiseenergySignalA0-A22ACS 414 - Spring 2011Quantization ErrorDifference between actual and sampled valueamplitude between [-A, A]quantization levels = Ne.g., if A = 1,N = 8, = 1/4 NA2CS 414 - Spring 2011Compute Signal to Noise RatioSignal energy = ; Noise energy = ; Noise energy = Signal to noise =Every bit increases SNR by ~ 6 decibels122NA2223 NA23log102N22ACS 414 - Spring 2011ExampleConsider a full load sinusoidal modulating signal of amplitude A, which utilizes all the representation levels providedThe average signal power is P= A2/2The total range of quantizer is 2A because modulating signal swings between –A and A. Therefore, if it is N=16 (4-bit quantizer), Δ = 2A/24 = A/8The quantization noise is Δ2/12 = A2/768The S/N ratio is (A2/2)/(A2/768) = 384; SNR (in dB) 25.8 dBCS 414 - Spring 2011Data Rates Data rate = sample rate * quantization * channelCompare rates for CD vs. mono audio8000 samples/second * 8 bits/sample * 1 channel= 8 kBytes / second44,100 samples/second * 16 bits/sample * 2 channel = 176 kBytes / second ~= 10MB / minuteCS 414 - Spring 2011Comparison and Sampling/Coding TechniquesCS 414 - Spring 2011Summary CS 414 - Spring

View Full Document