CS184a: Computer Architecture Structure and Organization - PowerPoint PPT Presentation

About This Presentation

Title:

CS184a: Computer Architecture Structure and Organization

Description:

Chip: 7mm side, 70nm sq. (45nm process) 105 squares across chip ... http://www.cs.caltech.edu/~andre/courses/CS294S97/notes/day14/day14.html. How far in GHz ... – PowerPoint PPT presentation

Number of Views:43

Avg rating:3.0/5.0

Slides: 49

Provided by: andre57

Learn more at: http://courses.cms.caltech.edu

Category:

more less

Transcript and Presenter's Notes

Title: CS184a: Computer Architecture Structure and Organization

1
CS184aComputer Architecture(Structure and
Organization)

Day 13 February 4, 2005
Interconnect 1 Requirements

2
Last Time

Saw various compute blocks
To exploit structure in typical designs we need
programmable interconnect
All reasonable, scalable structures
small to moderate sized logic blocks
connected via programmable interconnect
been saying delay across programmable
interconnect is a big factor

3
Today

Interconnect Design Space
Dominance of Interconnect
Interconnect Delay
Simple things
and why they dont work

4
Dominant Area
5
Dominant Time
6
Dominant Time
7
Dominant Power
XC4003A data from Eric Kusse (UCB MS 1997)
8
For Spatial Architectures

Interconnect dominant
area
power
time
so need to understand in order to optimize
architectures

9
Interconnect

Problem
Thousands of independent (bit) operators
producing results
true of FPGAs today
true for LIW, multi-uP, etc. in future
Each taking as inputs the results of other (bit)
processing elements
Interconnect is late bound
dont know until after fabrication

10
Design Issues

Flexibility -- route anything
(w/in reason?)
Area -- wires, switches
Delay -- switches in path, stubs, wire length
Power -- switch, wire capacitance
Routability -- computational difficulty finding
routes

11
Delay
12
Wiring Delay

Delay on wire of length Lseg
Tseg Tgate 0.4 RC
C Lseg ?Csq
R Lseg ?Rsq
Tseg Tgate 0.4 Csq ? Rsq ? Lseg2

13
Wire Numbers

Rsq 0.17 W/sq.
from ITRSInterconnect
Conductor effective resistance
A/R (aspect ratio)
Csq 7 ? 10-18F/sq.
Rsq? Csq ? 10-18 s
Tgate 30 ps
Chip 7mm side, 70nm sq. (45nm process)
105 squares across chip

14
Wiring Delay

Wire Delay
Tseg Tgate 0.4 Csq ? Rsq ? Lseg2
Tseg 30ps 0.4 10-18 s ? 1010
Tseg 30ps 4ns ? 4ns

15
Buffer Wire

Buffer every Lseg
Tcross (Lcross/Lseg) Tseg
Tcross (Lcross/Lseg) (Tgate 0.4 Csq ? Rsq ?
Lseg2)
(Lcross) (Tgate/Lseg 0.4 Csq ? Rsq ?
Lseg)

16
Opt. Buffer Wire

Tcross (Lcross) (Tgate/Lseg 0.4 Csq ? Rsq ?
Lseg)
Minimize
Take d(Tcross)/d(Lseg) 0
0 (Lcross) (-Tgate/Lseg2 0.4 Csq ? Rsq)
Tgate 0.4 Csq ? Rsq Lseg2

17
Optimization Point

Optimized
Tcross (Lcross/Lseg) (Tgate 0.4 Csq ? Rsq ?
Lseg2)
Tgate 0.4 Csq ? Rsq Lseg2
Says equalize gate and wire delay

18
Optimal Segment Length

Tgate 0.4 Csq ? Rsq Lseg2
Lseg Sqrt(Tgate /0.4 Csq ? Rsq)
Lseg Sqrt(30 10-12 s/0.4 10-18 s)
Lseg ? Sqrt(108 ) ?104 sq.

19
Buffered Delay

Chip 7mm side, 70nm sq. (45nm process)
105 squares across chip
Lseg ? 104 sq.
10 segments
Each of delay 2 Tgate
Tcross 20?30ps 600ps
Compare 4ns

20
Unbuffered Switch

R600W (width 20)
About 3600 squares? 0.17 W/sq.
C5?10-16F
About 100 squares?
Not lumped 2x worse
Together contribute roughly 1200 squares
Maybe 8 per rebuffer?
assumes large switch and no wire

21
Buffered Switch

Pay Tgate at each switch
Slows down relative to
Optimally buffered wire
Unbuffered switch
when placed too often

22
Stub Capacitance

Every untaken switch touching line
C2.5?10-16F
About 50 squares
and lumped so 2?

23
Delay through Switching
0.6 mm CMOS
http//www.cs.caltech.edu/andre/courses/CS294S97/
notes/day14/day14.html
24
First Attempts
25
(1) Shared Bus

Familiar case
Use single interconnect resource
Reuse in Time
Consequence?

26
Shared Bus

Consider operation yAx2 Bx C
3 mpys
2 adds
5 values need to be routed from producer to
consumer
Performance lower bound if have design w/
m multipliers
u madd units
a adders
i simultaneous interconnection busses

27
Resource Bounded Scheduling

Scheduling in general NP-hard
(find optimum)
can approximate in O(E) time

28
Lower Bound Critical Path

ASAP schedule ignoring resource constraints
(look at length of remaining critical path)
Certainly cannot finish any faster than that

29
Lower Bound Resource Capacity

Sum up all capacity required per resource
Divide by total resource (for type)
Lower bound on remaining schedule time
(best can do is pack all use densely)

30
Example
Critical Path
Resource Bound (2 resources)
Resource Bound (4 resources)
31
Example 2
RB 8/24 LB 5 best delay 6
32
Shared Bus

Consider operation yAx2 Bx C
3 mpys
2 adds
5 values need to be routed from producer to
consumer
Performance lower bound if have design w/
m multipliers
u madd units
a adders
i simultaneous interconnection busses

33
Viewpoint

Interconnect is a resource
Bottleneck for design can be in availability of
any resource
Lower Bound on Delay
Logical Resource / Physical Resources
May be worse
Dependencies (critical path bound)
ability to use resource

34
Shared Bus

Area ()
kn switches
O(n)

Flexibility ()
routes everything (given enough time)
can be trick to schedule use optimally
Delay (Power) (--)
wire length O(kn)
parasitic stubs knn
series switch 1
O(kn)
sequentialize I/B

35
Term Bisection Bandwidth

Partition design into two equal size halves
Minimize wires (nets) with ends in both halves
Number of wires crossing is bisection bandwidth

36
(2) Crossbar

Avoid bottleneck
Every output gets its own interconnect channel

37
Crossbar
38
Crossbar
39
Crossbar

Flexibility ()
routes everything (guaranteed)
Delay (Power) (-)
wire length O(kn)
parasitic stubs knn
series switch 1
O(kn)

Area (-)
Bisection bandwidth n
kn2 switches
O(n2)

40
Crossbar

Better than exponential
Too expensive
Switch Area kn22.5Kl2
Switch Area/LUT kn 2.5Kl2
n1024, k4 ? 10M l2
What can we do?

41
Avoiding Crossbar Costs

Typical architecture trick
exploit expected problem structure
We have freedom in operator placement
Designs have spatial locality
?place connected components close together
dont need full interconnect?

42
Exploit Locality

Wires expensive
Local interconnect cheap
1D versions
What does this do to
Switches?
Delay?
(quantify on hmwrk)

43
Exploit Locality

Wires expensive
Local interconnect cheap
Use 2D to make more things closer
Mesh?

44
Mesh Analysis

Can we place everything close?

45
Mesh Closeness

Try placing everything close

46
Mesh Analysis

Flexibility - ?
Ok w/ large w
Delay (Power)
Series switches
1--?n
Wire length
w--w?n
Stubs
O(w)--O(w?n)

Area
Bisection BW -- w?n
Switches -- O(nw)
O(w2n)
larger on homework

47
Mesh

Plausible
but Whats w
and how does it grow?

48
Big IdeasMSB Ideas

Interconnect Dominant
power, delay, area
Can be bottleneck for designs
Cant afford full crossbar
Need to exploit locality
Cant have everything close

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CS184a: Computer Architecture (Structures and Organization) PowerPoint PPT Presentation

CS184a: Computer Architecture (Structures and Organization) - return end of class in basket. or later to Cynthia (256 JRG) Caltech CS184a ... `Science is the belief in the ignorance of experts.'' -- Richard Feynman ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - and why they don't work. Characterizing Interconnect ... Resuming... Caltech CS184a Fall2000 -- DeHon. 15. Rent's Rule. Typically consider. 0.5 P 0.75 ' ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - (2) Crossbar. Avoid bottleneck. Every output gets its own interconnect channel ... Can't afford full crossbar. Need to exploit locality. Can't have everything close ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - Coming Attractions. Administrivia. Big Ideas. MSB. MSB-1. Caltech CS184a Fall2000 -- DeHon ... Coming Attractions: Three Talks by Tom Knight. Thursday 4pm (102 Steele) ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structure and Organization) PowerPoint PPT Presentation

CS184a: Computer Architecture (Structure and Organization) - Lower Upper Bound: 22M functions realizable by M-LUT. Say Need n 4-LUTs to cover; compute n: ... Upper Bound: (M-k)/log2(k- log2(k)) 1. Caltech CS184 ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - minimum area (one study, see paper) K=10, N=12, M=3. A(PLA 10,12,3) ... Questions about homework. Caltech CS184a Fall2000 -- DeHon. 29. Big Ideas [MSB Ideas] ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - Just starting to look at balancing interconnect and logic. Caltech CS184a Fall2000 -- DeHon ... Better results if 'reassociate' rather than keeping original subtrees. ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - CS184a: Computer Architecture (Structures and Organization) Day20: November 29, 2000 Review Today Review content and themes N.B. EOT Feedback Questionnaire return end ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 5th Edition PowerPoint PPT Presentation

William Stallings Computer Organization and Architecture 5th Edition - William Stallings Computer Organization and Architecture 5th Edition Chapter 11 CPU Structure and Function CPU Topics Processor Organization ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition PowerPoint PPT Presentation

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 11 Instruction Sets: Addressing Modes and Formats Addressing Modes Immediate Direct ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 8th Edition PowerPoint PPT Presentation

William Stallings Computer Organization and Architecture 8th Edition - William Stallings Computer Organization and Architecture 8th Edition Chapter 12 Processor Structure and Function CPU Structure CPU must: Fetch instructions Interpret ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structures and Organization) - CS184a: Computer Architecture (Structures and Organization) Day1: September 25, 2000 Introduction and Overview Today Matter Computes Architecture Matters This Course ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 10 Instruction Sets: Characteristics and Functions What is an instruction set? | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 1 Introduction Architecture & Organization 1 Architecture is those attributes visible to ... | PowerPoint PPT presentation | free to view

CS184a: Computer Architecture (Structure and Organization) - ... wired-OR Wired-or Connect series of inputs to wire Any of the inputs can drive the wire high Wired-or Implementation with ... of Technology Other titles: Times ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 8th Edition - William Stallings Computer Organization and Architecture 8th Edition Chapter 14 Instruction Level Parallelism and Superscalar Processors What is Superscalar? | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 18 Parallel Processing Multiple Processor Organization Single instruction, single data ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 8th Edition - William Stallings Computer Organization and Architecture 8th Edition Chapter 8 Operating System Support Objectives and Functions Convenience Making the computer ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 2 Computer Evolution and Performance A brief history of computer The first Generation ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 1 Introduction Architecture & Organization 1 Architecture is those attributes of a system ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 8th Edition - William Stallings Computer Organization and Architecture 8th Edition Chapter 7 Input/Output | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 7th Edition PowerPoint PPT Presentation

William Stallings Computer Organization and Architecture 7th Edition - William Stallings Computer Organization and Architecture 7th Edition Chapter 1 Introduction | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 7th Edition - William Stallings Computer Organization and Architecture 7th Edition Chapter 10 Instruction Sets: Characteristics and Functions | PowerPoint PPT presentation | free to view

Introduction to Computer Systems and Performance PowerPoint PPT Presentation

Introduction to Computer Systems and Performance - Chapter 1 Introduction to Computer Systems and Performance CS.216 Computer Architecture and Organization | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 6th Edition - William Stallings Computer Organization and Architecture 6th Edition Chapter 11 ... Organization and Architecture 6th Edition Addressing Modes Immediate ... | PowerPoint PPT presentation | free to view

William Stallings Computer Organization and Architecture 7th Edition - William Stallings Computer Organization and Architecture 7th Edition Chapter 5 Internal Memory ... | PowerPoint PPT presentation | free to view