Audio Compression by Using Wavelet
Audio Compression by Using Wavelet
org
ISSN (e): 2250-3021, ISSN (p): 2278-8719
PP 32-36
Abstract: Audio Compression is one of the basic technologies of the modern telecommunication age.
Compression is the technique to convert high input data stream into smaller size. Audio coding gives us the
digital form of audio with as few bits as possible and also maintains the quality. the reduction in bit rates
conserve bandwidth . Audio coding is used in various applications such as digital broadcasting ,high quality
audio for satellite transmission , internet audio or music database where the high quality audio signals bit rate
is reduced without compromising the quality of the signal.
The technology proposed to achieve the design and implementation of audio compression using discrete wavelet
transform technique. The efficiency performance of the audio encoding methods has been measured using
compression ratio as well as peak signal to noise (PSNR) ratio, SNR.
I. Introduction
The revolution in computer history had invariably led to the demand of quality audio data. But the data
rates associated with the uncompressed audio signal is massive. The product of the sampling rate and number of
bits is known as the bit rate. The subtraction of the information rate and bit rate of the signal is known as
redundancy. The audio compression
Technique works to reduce this redundancy without affecting the quality of the audio signal it has
found many applications in various areas such as multimedia signal coding, high fidelity audio for radio
broadcasting, audio transmission for HDTV, audio data transmission /sharing through internet etc.
Sheetal D. Gunjal, Rajeshee D. Raut, 2015. Traditional Psychoacoustic model and daubechies
wavelets for enhanced speech coder performance described the dependencies of Compression ratio, SNR and
the decomposition level it shows increase in the compression ratio value limited by the SNR value.[1] Oathman
O.Khalifa, Sering Habib Harding and Aisha-Hassan described the method for audio compression the signal is
compressed using wavelet and Reconstructed signals are compared using factors like Signal to Noise Ratio
(SNR), Peak Signal to Noise Ratio (PSNR), Normalized Root Mean Square Error (NRMSE), Compression
Ratio for different levels of wavelet .[6] P. Srinivasan and L. H. Jamieson. “High Quality Audio Compression
Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modelling described Wavelet packet-
based compression scheme suitable for high-quality audio transfers over the internet or storage.[5]
To reduce the coded bit rate there are basically two types of techniques used in the first type some form
of digital encoding is applied on the basis of statistical redundancy it is a lossless audio coding in which the
original audio signal is remains unharmed and can be totally recovered. In the second type some sort of signal
processing is used so that sorting of unwanted and information signal components can be done the recovered
audio signal and the original signal is not identical therefore we can say this technique is lossy technique.
A) Transform Technique :
Discrete wavelet transform is used for the audio signal compression. Wavelet transform is very suitable for
audio compression. DWT uses multiresolution technique to analyze different frequencies.
B) Quantization and coding of transform coefficients :
If the amount of information conveyed by each coefficient is different, it makes sense to assign differing
numbers of bits to the different coefficients. The Process of quantization is to convert the discrete time
continuous amplitude into discrete time discrete amplitude. This is done by rounding off each sample to the
nearest quantization level. Each discrete time, discrete amplitude is further represented by finite number of
digits using a coder.
C) Encoding :
In Encoding techniques reduce the number of coefficients by removing the redundant data. The compressed
audio signal can be reconstructed to form the original signal by Decoding followed by de-quantization and
then performing inverse transform Methods.
III Result
3.1 Compression Ratio:
The compression ratio (CR) is defined as the ratio of the length of original signal to the length of the
compressed signal.
Length of original signal
Compression Ratio =
Length of compressed signal
σx2 is the mean square of the speech signal and σe2 is the mean square difference between the
original and reconstructed signals.
20
15 PSNR
10 PSNR
5 PSNR
0
DB02 DB04 DB06 DB08 DB10
60 level 1 C.R.
50
level 2 C.R.
40
30 level 3 C.R.
20 level 4 C.R.
10
level 5 C.R.
0
DB02 DB04 DB06 DB08 DB10
IV. Conclusion
The selection of the Daubechies wavelet family with DWT yielded comparable improvement in the
performance parameters with a good quality reconstruction of the speech signal. The compression factor
improves at the cost of the SNR with progressive levels. At levels 3, 4 and 5 the variation in CF and SNR is
much more consistent. One can select level 3 for good performance, with a moderate number of filter banks.
Adaptive filter banks can be in combination with a model for more effective coding. In addition, using wavelets
the compression ratio can be easily varied, while most other compression techniques have fixed compression
ratios.
The discrete wavelet transform performs very well in the compression of audio signals. For real time
audio processing however, its performance is not as good. Therefore for real time audio coding it is
recommended to use a wavelet with a small number of vanishing moments at level 5decomposition or less.
References
[1]. Sheetal D. Gunjal, Rajeshee D. Raut, 2015. Traditional Psychoacoustic model and daubechies wavelets for enhanced speech
coder performance: International Journal of Technology (2015) 2: 190-197 ISSN 2086-9614
[2]. Sheetal D. Gunjal, Rajeshee D. Raut, 2012. Advance Source Coding Techniques for Audio/Speech Signal: A Survey,
International Journal for Computer Technology and Applications, Volume 3(4), pp. 1335-1342
[3]. M.V. Patil, Apporva Gupta, Ankita Verma and Shikhar Salil, “Audio and Speech Compression Using DCT and DWT
Techniques”, IJIRSET International Journal of Innovative Research in Science, Engineering and Technology, Vol.2 ,Issue 5,
May 2013.
[4]. HarmanpreetKaur and RamanpreetKaur, “Speech compression and decompression using DCT and DWT”, International Journal
Computer Technology &Applications,Vol 3 (4), 1501-1503 IJCTA | July-August 2012
[5]. P. Srinivasan and L. H. Jamieson. “High Quality Audio Compression Using an Adaptive
Wavelet Packet Decomposition and Psychoacoustic Modelling”, IEEE Transactions on Signal Processing, Vol. 46, No. 4, April
1998
[6]. Othman O. Khalifa, Sering Habib Harding & Aisha-Hassan A. Hashim “Compressionusing Wavelet Transform” in Signal
Processing: An International Journal, Volume (2):
[7]. Khalid sayood . "Introduction to Data Compression " 4th Edition. Elsevier
[8]. S.G. Mallat. "A Wavelet Tour of Signal Processing." 2nd Edition. Academic Press, 1999. ISBN 0-12-466606-X