You are on page 1of 8

Speech Processing Using Multirate Dsp

M.SATISH T,VENKATESWARA SWAIN


¾ ECE, GVPCOE ¾ ECE, GVPCOE
MADHURAWADA MADHURAWADA
VISAKHAPATNAM-41. VISAKHAPATNAM-41.
satish.smartguy@gmail.com

1. Abstract
In conventional speech processing be formed from FIR or IIR filters. The
applications, speech signal is encoded aim of the paper is to design a QMF
using fixed number of bits over the filter and then pass a speech signal
entire speech signal band. During the through it. The low pass filtered signal is
process, the bandwidth requirement for decimated and encoded with more
speech transmission is relatively high number of bits and high pass filtered
which is of concern. The QMF signal is also decimated and encoded
(Quadrature Mirror Filter) banks are the with less number of bits. These two bit
fundamental building blocks for spectral streams are multiplexed and transmitted.
splitting. The MF structure allows In receiver side the received signal is de-
spectral decomposition into contiguous multiplexed and decoded. The signal is
overlapping sub bands in such a way that passed through the interpolators and then
aliasing incurred in the initial “analysis” through the synthesis filter so as to
stage is eliminated during signal reconstruct the speech signal .The
reconstruction by the “synthesis” stage. reconstructed signal is compared with
The technique is developed to design the the original speech signal.
so-called perfect reconstruction QMF 2. Introduction
bank, which allows complete elimination In practice we often encounter signals
of amplitude and phase distortion of the where most of the energy content, which
reconstructed signal. A QMF bank can is important to us, is present in a
particular frequency band. One of the sub band is encoded separately that is
best examples for this type of signals is a more number of bits are allocated to sub
speech signal. In speech signals most of bands containing more information and
the energy is present in the lower less number of bits are allocated to sub
frequency bands. Coding the complete bands containing less information. In
signal that is by allocating same number this paper we are implementing a
of bits for the entire signal is not an subband coding system using QMF
efficient way of coding the signal for (Quadrature Mirror Filter) Banks. There
either transmission or storage. Signal are other coding techniques by which we
coding is the act of transforming the can code the signal and convert it into a
signal at hand to a more compact form, compact form, which can be later stored
which can then be transmitted with or transmitted. But coding speech signals
considerably smaller memory. The using QMF Banks is a very popular
motivation behind this is the fact that technique by which we can achieve very
access to the unlimited amount of efficient results. Practically we can
bandwidth, which is not possible. implement a subband coding system
Therefore there is a need to code and effectively and efficiently using the so-
compress speech signals. By taking called perfect reconstruction QMF
advantage of the fact that most of the Banks. In QMF Banks the filters are
energy is present in a particular designed in such a way that the aliasing
frequency band we can split the signal that occurs in the analysis section is
into various bands depending on the completely eliminated in the synthesis
information content and then code the section. Thus by eliminating aliasing we
subband signals separately. One of the are able to reconstruct the signal with a
most popular techniques using this high accuracy.
concept for coding signals is SubBand 3.Multi rate signal processing system
coding. SubBand Coding is a method The basic theory of multirate digital
used for coding of signals where the signal processing is introduced in this
signal is initially split into a number of section along with the two Sampling rate
sub bands depending on the information alteration devices namely Upsampler
content. After splitting the signal each and Downsampler . In many practical
applications where the signal of a given the basic sampling rate alteration devices
sampling rate needs to be converted into are invariably employed together with
an equivalent signal with a different low pass digital filters. An up-sampler is
sampling rate. For example, in digital a device, which increases the sampling
audio, three different sampling rates are rate by an integer factor. The up-sampler
presently employed: 32 kHz in broad is also called as a sampling rate
casting, 44.1 kHz in digital compact disk expander or simply expander. The block
and 48 kHz in digital audio tape (DAT) diagram of up-sampler is x[n] xu [n]
and other applications. Thus conversion
of sampling rates of audio signals
between these three different rates is
often necessary in many situations. The Figure 3.1 Block diagram of up-sampler
Discrete–time systems with unequal Down sampler:
sampling rates at various parts of the
system are called Multirate systems.
Unlike in single rate systems the
Figure 3.2 Block diagram of down-
sampling rates at the input and at the
sampler
output and all the internal nodes are the
A down-sampler is a device which
same. To achieve different sampling
reduces the sampling rate by an integer
rates at different stages, multirate digital
factor. The down-sampler is also called
signal processing systems employ the
as sampling rate compressor. The block
downsampler and the up-sampler, the
diagram of down-sampler is x[n] y[n]
basic sampling rate alteration devices in
addition to the conventional elements
4. Subband coding
such as the adder, the multiplier and the
delay. Many multirate systems employ a
bank of filters with either a common
input or a summed output. The two basic
components in sampling rate alteration
are the up-sampler and the down-
sampler. For sampling rate alterations,
Sub Band Coding (SBC) is a frequency number of samples coded and
domain coding technique in which the transmitted does not exceed the number
input signal is decomposed into a of samples in the original signal since
number of sub bands so that each of this number is necessary and sufficient
these frequency bands can be encoded for the recovery of the original signal.
separately. This technique was originally Under this constraint and in the absence
proposed by Crochiere, Webber and of the channel coders, the overall system
Flanagan as a means to reduce the effect response indicates the quality of the
of quantizing noise due to coding and system. Ideally, the filtering part of the
therefore to improve the quality of system must be reversible, i.e. the
speech coding systems. Encoding in sub overall system response must be a pure
bands offers several advantages that can delay so that the input signal can be
be effectively used to achieve noise perfectly reconstructed at the receiver.
reduction. However, in general reversibility cannot
In the sub band-coding system the input be achieved and sub band-coding
signal, after being sampled at its Nyquist systems suffer from three different types
rate, is divided into channels by first of distortion, interband aliasing,
being passed through a bank of low pass amplitude distortion and phase
and high pass filters. The output of each distortion. Clearly the quality of the
filter is decimated to a rate determined reconstructed signal can be no better
by the number of sub bands and then than the quality of the system response.
each of these channel outputs are On top of that, the quality of the
encoded separately. At the receiver the reconstructed signal degrades further, if
signals, after being decoded, are coders are introduced to the channels.
interpolated back to the original Over the past several years, a number of
sampling rate by a bank of interpolation sub band coding systems have been
filters and then are summed to introduced in an attempt to minimize or
reconstruct the input signal. It is remove the three types of distortion
important that in subband coding mentioned above as well as to minimize
systems the individual channel signals the overall number of computations
be decimated in such a way that the needed for the implementation of these
systems. The original sub band coding have no frequency selectivity. Johnston
system which was presented by though, by using an iterative approach,
Crochiere, Webber and Flanagan, used designed a number of linear phase FIR
finite impulse response (FIR) filters and filters, which produce minimum
the overall response of the system amplitude distortion in the over all
suffered from aliasing and amplitude system response
distortion as well as distortion due to
coding. In a later work presented by
Crochiere, infinite impulse response
(IIR) elliptic filters were used. These
Block diagram of 2-channel QMF
filters introduce, to some degree, phase
bank
distortion as well. Croisier, Esteban and
5. Two channel qmf bank
Galand in their work managed to remove
In many applications, a discrete-time
the interband aliasing by introducing the
signal x[n] is first split into a number of
concept of quadrature mirror filters
sub band signals by means of an analysis
(QMF) to realize a two-band splitting
filter bank; the sub band signals are the
analysis/reconstruction system. The
sub band signals are then processed and
input signal could be divided into more
finally combined by a synthesis filter
sub bands by using this two band
bank resulting in an output signal y[n].If
splitting system in a tree-structure. It
the sub band signals are band limited to
was also shown that if equal length,
frequency ranges much smaller than that
linear phase, finite impulse response
of the original input signal, they can be
(FIR) quadrature mirror filters (QMF)
downsampled before processing.
are used; phase distortion is also
Because of the lower sampling rate, the
eliminated leaving only the amplitude
processing of the down-sampled signals
distortion.
can be carried out efficiently. After
Analysis Section
processing, these signals are upsampled
The amplitude distortion cannot be
before being combined by the synthesis
removed by using linear phase FIR-
bank into a higher-rate signal. The
QMF sub band splitting, except for the
combined structure employed is called a
trivial case in which the resulting filters
Quadrature-mirror filter (QMF) bank. If
the down-sampling and up-sampling encoded by exploiting the special
factors are equal to or greater than the spectral properties of the signal, such as
number of bands of the filter bank, then energy levels and perceptual importance.
the output y[n] can be made retain some The coded sub-band signals are
or all of the characteristics of the input combined into one sequence by
multiplexing and either stored for later
retrieval or transmitted. At the receiving
end, the coded sub-band signals are first
recovered by demultiplexing and
decoders are used to produce

Figure 5.2 Frequency Response


approximations of the original down-

Characteristics of QMF Bank sampled signals. The decoded signals are


x[n] by properly choosing the filters in then up-sampled by a factor of 2 and
the structure. The two channel passed through the synthesis filter bank
Quadrature Mirror Filter (QMF) bank is composed of the low pass and high pass
multirate digital filter structure that filters whose frequency responses are
employs two down- samplers in the F0(z) and F1(z) whose outputs are then
“signal analysis” section and two added yielding y[n]. It follows from the
upsamplers in the “signal synthesis” figure that the sampling rates of the
section. The input signal x[n] is first input signal x[n] and output signal y[n]
passed through a two-band analysis filter are the same. The analysis and the
bank containing the low pass and high synthesis filters in the QMF bank are
pass filters with frequency responses chosen so as to ensure that the
H0(z) and H1(z) .Their corresponding reconstructed output y[n] is a reasonable
impulse responses are h0(n) and h1(n) replica of the input x[n]. Moreover, they
respectively, with a cutoff frequency at are also designed to provide good
π/2, as shown in the fig. The frequency frequency selectivity in order to ensure
response characteristics of QMF bank that the sum of the power of the sub-
The sub-band signals v0 (n) and v1 (n) are band signals is reasonably close to the
then down-sampled by a factor of 2.Each input signal power. In practice, various
down-sampled subband signal is errors are generated in this scheme. In
addition to the coding error and errors characteristic, which results in perfect
caused by transmission through the reconstruction of the input speech signal.
channel, the QMF bank itself introduces 6.1 Input speech signal and its
several errors due to sampling rate specifications:
alterations and imperfect filters. We The speech signal on which sub-band
ignore the coding and channel errors, coding is to be performed is given as an
and investigate only the errors generated input to the QMF bank, which was
by the down-samplers and up-samplers discussed in the previous chapters. For
in the filter bank and their effects on the this we recorded a speech signal using
performance of the system. the tool sound recorder i.e available in
In this chapter we efficiently designed an the windows with the following
FIR filter for implementing the specifications
quadrature mirror filter bank with
minimum possible error.
Results and Conclusion
We have successfully implemented the
sub-band coding system by designing an
optimum four channel QMF bank. The
frequency response characteristics of
LPF and HPF used in QMF bank are as
given in fig 6.1: From the above
Figure 6 .1 Frequency Response of
characteristics it is seen that, the
QMF Bank
response of the QMF filter is almost
approaching the ideal all-pass filter
Format: PCM QMF bank for sub-band coding of input
Attributes: 8 kHz, 8 bit, Mono speech signal. The result shows that the
The recorded speech signal is of two output is a perfect reconstruction of the
seconds duration with a length of 21600 input speech signal.
samples. The speech signal is sampled
with a sampling frequency of 8 kHz and References
coded with 8 bits per sample. The input 1. ”Digital Signal Processing (Principles,
speech waveforms and output Algorithms and Applications) ”by John
reconstructed speech waveforms are G.Proakis and Dimitris G.Manolakis.
shown below: 2. Digital Signal Processing, A.
From these waveforms we can observe Oppenheim & R. Schafer, (Prentice-
that there is a delay of 31 samples Hall, 1975, ISBN 0-13-214635- 5).
between the input and output speech, 3. P.P. Vaidyanathan. Multirate Systems
which is equal to N-1 (where N=32 is and Signal rocessing. Prentice-Hall,
the length of the filter). The data rate Englewood Cliffs, NJ, 1993.
reduction depends upon the number of 4. S.K. Mitra. Digital Signal Processing
bits allocated for low-pass and high-pass – A Computer-Based Approach. Mc
sections. Graw-Hill, New York, 2 edition, 2001.
6.2 Conclusion 5. [Chi, et al.] Chi, T., Gao, Y., Guyton,
The intention of this work is to design M., Ru, P., and Shamma S.A. “Spectro-
and implement a SUBBAND CODING Temporal Modulation Transfer
system. We have successfully designed Functions and Speech Intelligibility”.
an optimum low pass filter for four 6. [Sinha and tewfik, 1993] Sinha, D.
channel QMF Bank to minimize the and Tewfik, A. (Dec. 1993) “Low bit
amplitude distortion. From this Low pass rate transparent audio compression using
filter we have designed a High pass adapted wavelets,” IEEE Transactions
Filter. Using these filters we have on Signal Processing 4(12) pp. 3463-
successfully simulated a two channel 3479.

You might also like