Professional Documents
Culture Documents
You will also discover that is possible to design complex digital systems with little knowledge of circuits
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-2
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-3
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-4
Whats required?
ICs with dramatically higher performance, optimized for applications
and at a size and power to deliver mobility cost to address mass consumer markets
February 6, 2013 http://csg.csail.mit.edu/6.375 L01-5
Source: http://www.intel.com/technology/silicon/mooreslaw/index.htm
April 2012
Samsung Exynos Quad: - quad-core A9 - 1GB DDR2 (low power) - Multimedia processor - ...
but our mind set is that hardware design is: New design Difficult, risky flows and tools Increases time-to-market can change this Inflexible, brittle, error prone, mind set...
Difficult to deal with changing standards,
http://csg.csail.mit.edu/6.375 February 6, 2013 L01-8
64-core Tilera
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-9
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-10
Intellectual Property
Multiple instantiations of a block for different performance and application requirements Packaging of IP so that the blocks can be assembled easily to build a large system (black box model)
Architectural exploration to understand cost, power and performance tradeoffs Full system simulations for validation and verification
February 6, 2013 http://csg.csail.mit.edu/6.375 L01-11
Hardware design today is like programming was in the fifties, i.e., before the invention of high-level languages
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-12
An IBM 650 Instruction: 60 1234 1009 Can you program a computer without knowing , for example, Load the contents locationit1234 how many of registers has? into the distribution; put it also into the upper accumulator; 1950s set lower accumulator to zero; and then go to reaction location 1009 for the next instruction.
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-14
Smaller, simpler, clearer, more correct code not just simulation, synthesis as well
February 6, 2013 http://csg.csail.mit.edu/6.375 L01-15
Scrambler
FEC Encoder
Interleaver
Mapper
IFFT
CP Insertion
D/A
MAC
RX Controller
DeInterleaver
DeMapper
Channel Estimater
FFT
S/P
Synchronizer
A/D
85% reusable code between WiFi and WiMAX Different throughput requirements From WiFi to WiMAX in 4 weeks WUSB: Turbo x15+x14+1 Different algorithms
http://csg.csail.mit.edu/6.375 L01-16
February 6, 2013
First simulate Second run on FPGAs We wont explore the chip design path
C Bluesim
Cycle Accurate
gates
Power estimation tool
Place & Route
FPGA
L01-17
Tapeout http://csg.csail.mit.edu/6.375
Hand-drawn transistors (+ some standard cells) High volume, best possible performance: used for most advanced microprocessors High volume, moderate performance: Graphics chips, network chips, cell-phone chips Prototyping Low volume, low-moderate performance applications
Standard-Cell-Based ASICs
http://www.intel.com/intel/intelis/museum/exhibit/hist_micro/hof/hof_main.htm
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-19
22nm technology
1.4 Billion transistors 3.4 GHz clock frequency Power > 17 Watts (under clocked) Could fit over 1200 486 processors on same size die.
February 6, 2013 http://csg.csail.mit.edu/6.375
L01-20
10
Transistors (M) Relative staffing on back-end Relative staffing on front-end 9x growth in back-end staff 5x growth in front-end staff
Front-end is designing the logic (RTL) Back-end is fitting all the gates and wires on the chip; meeting timing specifications; wiring up power, ground, and clock
February 6, 2013 http://csg.csail.mit.edu/6.375 L01-21
59% chip design (architecture, logic & I/O design, product & test engineering) 30% software and applications development 11% prototyping (masks, wafers, boards)
If we sell 100,000 units, NRE costs add $30M/100K = $300 per chip!
Hand-crafted IBM-Sony-Toshiba Cell microprocessor achieves 4GHz in 90nm, but at the development cost of >$400M
11
Each cell in array contains a programmable logic function Array has programmable interconnect between logic functions Overhead of programmability makes arrays expensive and slow as compared to ASICs However, much cheaper than an ASIC for small volumes because NRE costs do not include chip development costs (only include programming)
February 6, 2013 http://csg.csail.mit.edu/6.375 L01-23
Dramatically reduce the cost of errors Little physical design work Remove the reticle costs from each design
Switching power around ~12X worse Performance up 3-4X worse Still requires Area 20-40X greater tremendous design effort at RTL level
http://csg.csail.mit.edu/6.375 L01-24
February 6, 2013
12
A multicore can be emulated on one FPGA but the programming model is RTL and not too many people design hardware
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-25
6.375 Philosophy
Effective abstractions to reduce design effort
High-level design language rather than logic gates Control specified with Guarded Atomic Actions rather than with finite state machines Guarded module interfaces to systematically build larger modules by the composition of smaller modules Decoupled units rather than tightly coupled state machines Architecture choice has largest impact on solution quality
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-26
13
Bluespec System Verilog Reference manual Bluespec System Verilog Users guide How to use all the tools for developing BSV programs
February 6, 2013
http://csg.csail.mit.edu/6.375
L01-28
14