DSP Processors (Gowri Modified)

DSP Processors
UNIT-V
Multiplier
Figure 4.1(a) The 4x4 binary multiplication
Figure 4.1(b) The structure of a 4x4 Braun multiplier
MAC using DSP

11
12
3
11
24
R2
44
2
3
Clr
;Clear Accumulator A
Rep
; Rep N times the next instruction
MAC
*(R0)+, *(R1)+, A
; Fetch the two memory locations pointed by R0 and

R1, multiply them together and add the result to A, the
final result is stored back in A
Mov
A, *R2
; Move result to memory
Figure 4.4 A MAC unit
Innumerical analysis, one or moreguard digitscan be used to

reduce the amount ofroundoferror.
For example, suppose that the final result of a long, multi-step
calculation can be safely rounded of toNdecimal places. That is to
say, the roundof error introduced by this final roundof makes a
negligible contribution to the overall uncertainty.
However, it is quite likely that it isnotsafe to round of the
intermediate steps in the calculation to the same number of digits.
Be aware that roundof errors can accumulate. IfMdecimal places
are used in the intermediate calculation, we say there
areMNguard digits.
Guard digits are also used in floating point operations in most
computer systems.
Figure 4.5 A MAC unit with accumulator guard

bits
Von Neumann architecture
Figure 4.8(a) The bus structure of von Neumann

architecture(The bus structure of von Neumann
architecture (Related matter from ramesh babu 5th
edition page no 11.8) matter from ramesh babu 4th
also
there Neumann modelandPrinceton
Thevon Neumann architecture, alsoedition
known as
thevon
architecture, is acomputer architecturebased on that described in 1945 by the

mathematician and physicistJohn von Neumannand others.
The design of a von Neumann architecture is simpler than the more modern
Harvard architecturewhich is also a stored-program system but has one dedicated set of
address and data buses for reading data from and writing data to memory, and another set of
Harvard architecture
Figure 4.8(b) The bus structure of Harvard architecture((Related

matter from ramesh babu 5th edition page no 11.9)
TheHarvard architectureis
acomputer
physically
separate
matter from
ramesharchitecturewith
babu 4th edition
also there
storageand signal pathways for instructions and data. The term originated from the
Harvard Mark Irelay-based computer, which stored instructions onpunched tape(24
bits wide) and data in electro-mechanical counters. These early machines had data
storage entirely contained within thecentral processing unit, and provided no access
to the instruction storage as data. Programs needed to be loaded by an operator; the
processor could notinitializeitself.
Contrast with von Neumann architectures

Under purevon Neumann architecturetheCPUcan be either reading an
instruction or reading/writing data from/to the memory. Both cannot occur at the
same time since the instructions and data use the same bus system. In a
computer using the Harvard architecture, the CPU can both read an instruction
and perform a data memory access at the same time,even without a cache.
A Harvard architecture computer can thus be faster for a given circuit
complexity because instruction fetches and data access do not contend for a
single memory pathway.
Also, a Harvard architecture machine has distinct code and data address spaces:
instruction address zero is not the same as data address zero. Instruction
address zero might identify a twenty-four bit value, while data address zero
might indicate an eight-bit byte that is not part of that twenty-four bit value
Amodified Harvard architecturemachine is very

much like a Harvard architecture machine, but it
relaxes the strict separation between instruction
and data while still letting the CPU concurrently
access two (or more) memory buses. The most
common modification includes separate instruction
and datacachesbacked by a common address
space. While the CPU executes from cache, it acts
as a pure Harvard machine. When accessing
backing memory, it acts like a von Neumann
machine (where code can be moved around like
data, which is a powerful technique). This
modification is widespread in modern processors,
such as theARM architectureandx86processors.
It is sometimes loosely called a Harvard
architecture, overlooking the fact that it is actually
"modified.
Figure 4.8(c) The bus structure for the architecture with one program
memory and two data memories
GPP(General Purpose Processor) Data Path

Only
Memory Data
Bus
Memory
Register 1
A GPP unlike a Single Purpose

Processor can accomplish various
tasks via programs written in an
Instruction Set that the
microprocessor can recognize.
Most processors are built from a
controller, dataptath and memory
ALU
Same memory for program and

data
11
Register 2
Digital Signal Processors Data Path Only

Program Memory Data
Bus
Data Memory Data
Bus
Program
Memory
Data
Memory
A DSP Chip is a
microprocessor specially
designed for DSP
applications
Harvard architecture allows
multiple memory reads
Architecture optimized to
provide rapid processing of
discrete time signals, e.g.
Multiply and Accumulate
(MAC) in one cycle
12
Multiplexe
r
Multiplexe
r
ALU
Accumulator
Memory structures
13
PIPELINING :
Pipelining is a technique which follows two or more operations to
overlap during execution.
In pipelining, a task is broken down into a number of distinct
subtasks which are then overlapped during execution. It is used
extensively in digital signal processors to increase speed.
An instruction can be broken down into three steps. Each step in
the instruction can be regarded as a stage in a pipeline and so can
be overlapped. By overlapping the instructions , a new instruction is
started at the start of each clock cycle.
The figure shows the timing diagram for a three-stage pipeline ,
drawn to highlight the instruction steps. Typically , each step in the
pipeline takes one machine cycle. Thus, during a given cycle up to
three diferent instructions may be active at the same time ,
although each will be at the diferent stage of completion.
The key to an instruction pipeline is that three parts of the
instruction (that is fetch, decode and execute) are independent and
so the execution of multiple instructions can be overlapped.
It is seen that at the ith cycle , the processor could be
simultaneously fetching the ith instruction , decoding the (i-1)th
Figure 4.13 Pipelining for speeding up the execution of an instruction
Types of DSP
Low End Fixed Point
TMS320C2XX, ADSP21XX, DSP56XXX
High End Fixed Point

TMS320C55XX, DSP16XXX,
ADSP215XX, DSP56800
Floating Point
TMS320C3X, C67XX, ADSP210XX, DSP96000,
DSP32XX
17
Figure 3.1(a) Fixed-point format to represent signed integers
Figure 3.1(b) Fixed-point format to represent signed fractions
Figure 3.2 IEEE 754 format for floating-point numbers
Figure 3.3 (a) An A/D converter with b bits for signal representation,
(b) quantization model for the A/D converter
Figure 3.3 (c) quantization error in truncation A/D converter
Figure 3.3 (d) quantization error in rounding A/D converter, (e)

probability density function for truncation error, (f) probability
density function for rounding error
Figure 3.4 An example showing the D/A converter error due to the
zero-order hold at its output: (a) DSP output, (b) D/A output,
Figure 3.4 An example showing the D/A converter error due to the
zero-order hold at its output: (c) the convolving pulse that generates
(b) from (a), (d) frequency contents of the convolving pulse in (c)
Fixed Point Vs Floating

Point
Fixed Point/Floating Point
fixed point processor are :
cheaper
smaller
less power consuming
Harder to program
Watch for errors: truncation, overflow, rounding
Limited dynamic range

Used in 95% of consumer products
floating point processors

have larger accuracy
are much easier to program
can access larger memory
It is harder to create an efficient program in C on a fixed

point processors than on floating point processors
26
Fixed Point Vs Floating

Point
Floating Point
Fixed Point
Applications
Applications
Modems
Portable Products
Digital Subscriber Line (DSL)
2G, 2.5G and 3G Cell Phones
Wireless Basestations
Digital Audio Players
Central Office Switches
Digital Still Cameras
Private Branch Exchange (PBX)
Electronic Books
Digital Imaging
Voice Recognition
3D Graphics
GPS Receivers
Speech Recognition
Headsets
Voice over IP
Biometrics
Fingerprint Recognition
27

DSP Processors (Gowri Modified)

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

DSP Processors (Gowri Modified)

Uploaded by

Copyright:

Available Formats

DSP Processors

Figure 4.1(a) The 4x4 binary multiplication

Figure 4.1(b) The structure of a 4x4 Braun multiplier

MAC using DSP

; Rep N times the next instruction

; Fetch the two memory locations pointed by R0 and

; Move result to memory

Figure 4.4 A MAC unit

Innumerical analysis, one or moreguard digitscan be used to

Figure 4.5 A MAC unit with accumulator guard

Von Neumann architecture

Figure 4.8(a) The bus structure of von Neumann

architecture, is acomputer architecturebased on that described in 1945 by the

Figure 4.8(b) The bus structure of Harvard architecture((Related

Contrast with von Neumann architectures

Amodified Harvard architecturemachine is very

GPP(General Purpose Processor) Data Path

A GPP unlike a Single Purpose

Same memory for program and

Digital Signal Processors Data Path Only

Figure 4.13 Pipelining for speeding up the execution of an instruction

High End Fixed Point

Figure 3.1(a) Fixed-point format to represent signed integers

Figure 3.1(b) Fixed-point format to represent signed fractions

Figure 3.2 IEEE 754 format for floating-point numbers

Figure 3.3 (c) quantization error in truncation A/D converter

Figure 3.3 (d) quantization error in rounding A/D converter, (e)

Fixed Point Vs Floating

Limited dynamic range

floating point processors

It is harder to create an efficient program in C on a fixed

Fixed Point Vs Floating

Digital Subscriber Line (DSL)

2G, 2.5G and 3G Cell Phones

Digital Audio Players

Central Office Switches

Digital Still Cameras

Private Branch Exchange (PBX)

You might also like