Logical Protocol to Physical Design - PowerPoint PPT Presentation

About This Presentation

Title:

Logical Protocol to Physical Design

Description:

only one when every one has reached the barrier, not whe they have left it. use two barriers ... With single-level cache: dual tags (not data) or dual-ported tag RAM ... – PowerPoint PPT presentation

Number of Views:37

Avg rating:3.0/5.0

Slides: 29

Provided by: davidc123

Category:

Tags: conditions | design | logical | physical | protocol

Transcript and Presenter's Notes

Title: Logical Protocol to Physical Design

1
Logical Protocol to Physical Design

CS 258, Spring 99
David E. Culler
Computer Science Division
U.C. Berkeley

2
Lock Performance on SGI Challenge
Loop lock delay(c) unlock delay(d)
3
Barriers

Single flag has problems on repeated use
only one when every one has reached the barrier,
not whe they have left it
use two barriers
two flags
sense reversal
Barrier complexity is linear on a bus, regardless
of algorithm
tree-based algorithm to reduce contention

4
Bag of Tricks for Spatial Locality

Assign tasks to reduce spatial interleaving of
accesses from procs
Contiguous rather than interleaved assignment of
array elements
Structure data to reduce spatial interleaving of
accesses
Higher-dimensional arrays to keep partitions
contiguous
Reduce false sharing and fragmentation as well as
conflict misses

5
Logical Protocol Algorithm

Set of States
Events causing state transitions
Actions on Transition

6
Reality

Protocol defines logical FSM for each block
Cache controller FSM
multiple states per miss
Bus controller FSM
Other Ctrls Get bus
Multiple Bus trnxs
write-back
Multi-Level Caches
Split-Transaction Busses

7
Typical Bus Protocol
BG
BReq
BR
BGnt
Addr
BG
OK
OK
Addr
Data
others may get bus
OK
Data
OK

Bus state machine
Assert request for bus
Wait for bus grant
Drive address and command lines
Wait for command to be accepted by relevant
device
Transfer data

8
Correctness Issues

Fulfill conditions for coherence and consistency
write propagation and atomicity
Deadlock all system activity ceases
Cycle of resource dependences
Livelock no processor makes forward progress
although transactions are performed at hardware
level
e.g. simultaneous writes in invalidation-based
protocol
each requests ownership, invalidating other, but
loses it before winning arbitration for the bus
Starvation one or more processors make no
forward progress while others do.
e.g. interleaved memory system with NACK on bank
busy
Often not completely eliminated (not likely, not
catastrophic)

9
Preliminary Design Issues

Design of cache controller and tags
Both processor and bus need to look up
How and when to present snoop results on bus
Dealing with write-backs
Overall set of actions for memory operation not
atomic
Can introduce race conditions
atomic operations
New issues deadlock, livelock, starvation,
serialization, etc.

10
Contention for Cache Tags

Cache controller must monitor bus and processor
Can view as two controllers bus-side, and
processor-side
With single-level cache dual tags (not data) or
dual-ported tag RAM
must reconcile when updated, but usually only
looked up
Respond to bus transactions

11
Reporting Snoop Results How?

Collective response from s must appear on bus
Example in MESI protocol, need to know
Is block dirty i.e. should memory respond or
not?
Is block shared i.e. transition to E or S state
on read miss?
Three wired-OR signals
Shared asserted if any cache has a copy
Dirty asserted if some cache has a dirty copy
neednt know which, since it will do whats
necessary
Snoop-valid asserted when OK to check other two
signals
actually inhibit until OK to check
Illinois MESI requires priority scheme for
cache-to-cache transfers
Which cache should supply data when in shared
state?
Commercial implementations allow memory to
provide data

12
Reporting Snoop Results When?

Memory needs to know what, if anything, to do
Fixed number of clocks from address appearing on
bus
Dual tags required to reduce contention with
processor
Still must be conservative (update both on write
E -gt M)
Pentium Pro, HP servers, Sun Enterprise
Variable delay
Memory assumes cache will supply data till all
say sorry
Less conservative, more flexible, more complex
Memory can fetch data and hold just in case (SGI
Challenge)
Immediately Bit-per-block in memory
Extra hardware complexity in commodity main
memory system

13
Writebacks

To allow processor to continue quickly, want to
service miss first and then process the write
back caused by the miss asynchronously
Need write-back buffer
Must handle bus transactions relevant to buffered
block
snoop the WB buffer

14
Basic design
15
Non-Atomic State Transitions

Memory operation involves many actions by many
entities, incl. bus
Look up cache tags, bus arbitration, actions by
other controllers, ...
Even if bus is atomic, overall set of actions is
not
Can have race conditions among components of
different operations
Suppose P1 and P2 attempt to write cached block A
simultaneously
Each decides to issue BusUpgr to allow S gt M
Issues
Must handle requests for other blocks while
waiting to acquire bus
Must handle requests for this block A
e.g. if P2 wins, P1 must invalidate copy and
modify request to BusRdX

16
Handling Non-atomicity Transient States

Two types of states
Stable (e.g. MESI)
Transient or Intermediate

Increases complexity
e.g. dont use BusUpgr, rather other mechanisms
to avoid data transfer

17
Serialization

Processor-cache handshake must preserve
serialization of bus order
e.g. on write to block in S state, mustnt write
data in block until ownership is acquired.
other transactions that get bus before this one
may seem to appear later

18
Write completion for SC?

Neednt wait for inval to actually happen
Just wait till it gets bus
Commit versus complete
Dont know when inval actually inserted in
destination processs local order, only that its
before next xaction and in same order for all
procs
Local write hits become visible not before next
bus transaction
Same argument will extend to more complex systems
What matters is not when written data gets on the
bus (write back), but when subsequent reads are
guaranteed to see it
Write atomicity if a read returns value of a
write W, W has already gone to bus and therefore
completed if it needed to

19
Deadlock, Livelock

Request-reply protocols can lead to
protocol-level, fetch deadlock
In addition to buffer deadlock discussed earlier
When attempting to issue requests, must service
incoming transactions
cache controller awaiting bus grant must snoop
and even flush blocks
else may not respond to request that will release
bus

pending request
snoop service
20
Livelock, Starvation

Many processors try to write same line.
Each one
Obtains exclusive ownership via bus transaction
(assume not in cache)
Realizes block is in cache and tries to write it
Livelock I obtain ownership, but you steal it
before I can write, etc.
Solution dont let exclusive ownership be taken
away before write is done
Starvation Solve by using fair arbitration on
bus and FIFO buffers

21
Implementing Atomic Operations

In cache or Memory?
cacheable
better latency and bandwidth on
self-reacquisition
allows spinning in cache without generating
traffic while waiting
at-memory
lower transfer time
used to be implemented with locked read-write
pair of bus transitions
not viable with modern, pipelined busses
usually traffic and latency considerations
dominate, so use cacheable
what the implementation strategy?

22
Use cache exclusivity for atomicity

get exclusive ownership, read-modify-write
error conflicting bus transactions (read or
ReadEx)
can actually buffer request if R-W is committed

23
Implementing LL-SC

Lock flag and lock address register at each
processor
LL reads block, sets lock flag, puts block
address in register
Incoming invalidations checked against address
if match, reset flag
Also if block is replaced and at context switches
SC checks lock flag as indicator of intervening
conflicting write
If reset, fail if not, succeed
Livelock considerations
Dont allow replacement of lock variable between
LL and SC
split or set-assoc. cache, and dont allow memory
accesses between LL, SC
(also dont allow reordering of accesses across
LL or SC)
Dont allow failing SC to generate invalidations
(not an ordinary write)
Performance both LL and SC can miss in cache
Prefetch block in exclusive state at LL
But exclusive request reintroduces livelock
possibility use backoff

24
Multilevel Cache Hierarchies
Processor Chip
snoop ???

snoop

Independent snoop hardware for each level?
processor pins for shared bus
contention for processor cache access ?
Snoop only at L2 and propagate relevant
transactions
Inclusion property
(1) contents L1 is a subset of L
(2) any block in modified state in L1 is in
modified state in L2
1 gt all transactions relavant to L1 are relavant
to L2
2 gt on BusRd L2 can wave off memory access and
inform L1

25
Maintaining Inclusion

The two caches (L1, L2) may choose to replace
different block
Differences in reference history
set-associative first-level cache with LRU
replacement
example blocks m1, m2, m3 fall in same set of L1
cache...
Split higher-level caches
instruction, data blocks go in different caches
at L1, but may collide in L2
what if L2 is set-associative?
Differences in block size
But a common case works automatically
L1 direct-mapped, fewer sets than in L2,
and block size same

26
Preserving Inclusion Explicitly

Propagate lower-level (L2) replacements to
higher-level (L1)
Invalidate or flush (if dirty) messages
Propagate bus transactions from L2 to L1
Propagate all L2 transactions?
use inclusion bits?
Propagate modified state from L1 to L2 on writes?
if L1 is write-through, just invalidate
if L1 is write-back
add extra state to L2 (dirty-but-stale)
request flush from L1 on Bus Rd

27
Contention of Cache Tags

L2 filter reduces contention on L1 tags

28
Correctness

issues altered?
Not really, if all propagation occurs correctly
and is waited for
Writes commit when they reach the bus,
acknowledged immediately
But performance problems, so want to not wait for
propagation
same issues as split-transaction busses

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Advanced Tutorial on Hardware Design and Petri nets PowerPoint PPT Presentation

Advanced Tutorial on Hardware Design and Petri nets - timing diagrams, waveforms, traditionally used by logic designers) Today's hardware description languages allow high level of abstraction ... | PowerPoint PPT presentation | free to view

Design Challenges in Instrumenting the Physical World with TinyOS PowerPoint PPT Presentation

Design Challenges in Instrumenting the Physical World with TinyOS - Design Challenges in Instrumenting the Physical World with TinyOS | PowerPoint PPT presentation | free to view

CREATING THE DESIGN: THE LOGICAL VIEW PowerPoint PPT Presentation

CREATING THE DESIGN: THE LOGICAL VIEW - To create a class in rational rose: Right click on the logical view in the browser ... A method definition describes the steps that a class performs to accomplish a ... | PowerPoint PPT presentation | free to view

Platform Based Design for Wireless Sensor Networks PowerPoint PPT Presentation

Platform Based Design for Wireless Sensor Networks - A collection of cooperating algorithms (controllers) designed ... ASICs. SRAM. Speech. Samples. Interface. UART. Interface. External. Bus. Interface. Platform ... | PowerPoint PPT presentation | free to view

Senior Design Project OPNET Modeler PowerPoint PPT Presentation

Senior Design Project OPNET Modeler - Senior Design Project OPNET Modeler & Short-Range Wireless Routing Erin Butler & Emmy Lai Advisor: Professor H. C. Chang Grant Funded by a grant from CRA-W, a subset ... | PowerPoint PPT presentation | free to view

Multi-Protocol Label Switch (MPLS) PowerPoint PPT Presentation

Multi-Protocol Label Switch (MPLS) - Multi-Protocol Label Switch (MPLS) Overview and short tutorial Credits: Part of this presentation is based on James Yu lecture (Many thanks!) and from MPLS council ... | PowerPoint PPT presentation | free to view

Distributed System Design: An Overview* PowerPoint PPT Presentation

Distributed System Design: An Overview* - Distributed System Design: An Overview* Jie Wu Department of Computer Science and Engineering Florida Atlantic University Boca Raton, FL 33431 U.S.A. | PowerPoint PPT presentation | free to view

Satellite Tracking, Telemetry and Command Design Basis PowerPoint PPT Presentation

Satellite Tracking, Telemetry and Command Design Basis - Satellite Tracking, Telemetry and Command Design Basis Jyh-Ching Juang ( ) Department of Electrical Engineering National Cheng Kung University | PowerPoint PPT presentation | free to view

Introduction to asynchronous circuit design: specification and synthesis PowerPoint PPT Presentation

Introduction to asynchronous circuit design: specification and synthesis - Introduction to asynchronous circuit design: specification and synthesis Jordi Cortadella, Universitat Polit cnica de Catalunya, Spain Michael Kishinevsky, Intel ... | PowerPoint PPT presentation | free to view

Modern Physical Design: Algorithm Technology Methodology PowerPoint PPT Presentation

Modern Physical Design: Algorithm Technology Methodology - Title: Modern Physical Design: Algorithm Technology Methodology (Part I) Last modified by: stefanus Created Date: 7/21/1999 5:58:31 PM Document presentation format | PowerPoint PPT presentation | free to view

Database Systems: Design, Implementation, and Management Eighth Edition PowerPoint PPT Presentation

Database Systems: Design, Implementation, and Management Eighth Edition - Database Systems: Design, Implementation, and Management Eighth Edition Chapter 2 Data Models Database Systems, 8th Edition ... | PowerPoint PPT presentation | free to view

Database Systems: Design, Implementation, and Management Tenth Edition PowerPoint PPT Presentation

Database Systems: Design, Implementation, and Management Tenth Edition - Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems ... | PowerPoint PPT presentation | free to view

UNIT-II BASIC COMPUTER ORGANIZATION AND DESIGN PowerPoint PPT Presentation

UNIT-II BASIC COMPUTER ORGANIZATION AND DESIGN - UNIT-II BASIC COMPUTER ORGANIZATION AND DESIGN REFERENCES Hayes P. John, Computer Architecture and Organisation, McGraw Hill Comp., 1988. Mano M., Computer System ... | PowerPoint PPT presentation | free to view

Performance Evaluation Study for IEEE 802.11 WLAN Protocol. PowerPoint PPT Presentation

Performance Evaluation Study for IEEE 802.11 WLAN Protocol. - Performance Evaluation Study for IEEE ... The logical architecture defines the network s operation 802.11 only defines the MAC and PHY layers MAC layer ... 231 ... | PowerPoint PPT presentation | free to view

Database Systems: Design, Implementation, and Management Tenth Edition PowerPoint PPT Presentation

Database Systems: Design, Implementation, and Management Tenth Edition - Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems ... | PowerPoint PPT presentation | free to view

Industrial Experiences Pioneering Asynchronous Commercial Design PowerPoint PPT Presentation

Industrial Experiences Pioneering Asynchronous Commercial Design - Title: Introduction to basic concepts on asynchronous circuit design Author: Compaq Created Date: 2/13/2000 11:54:46 AM Document presentation format | PowerPoint PPT presentation | free to view

Constraint-Based Watermarking Techniques for Design IP Protection PowerPoint PPT Presentation

Constraint-Based Watermarking Techniques for Design IP Protection - Constraint-Based Watermarking Techniques for Design IP Protection Presented by : Yaniv Sabo * | PowerPoint PPT presentation | free to view

Diode in Digital Logic Design PowerPoint PPT Presentation

Diode in Digital Logic Design - Diode in Digital Logic Design Section 3.1-3.3 Schedule # Date Day Topic Section 1 1/14 Tuesday Diagnostic Test L 1/14 Tuesday Lab protocol, cleaning procedure, Linus ... | PowerPoint PPT presentation | free to view

Protocols and Protocol Suit Review PowerPoint PPT Presentation

Protocols and Protocol Suit Review - Protocols and Protocol Suit Review Lecture 14 IP, TCP and UDP Error Checksum Q:- IP, TCP, and UDP all discard a packet that arrives with a checksum error and do not ... | PowerPoint PPT presentation | free to view

Sem1 - Module 9 TCP/IP Protocol Suite and IP Addressing Review PowerPoint PPT Presentation

Sem1 - Module 9 TCP/IP Protocol Suite and IP Addressing Review - Sem1 - Module 9 TCP/IP Protocol Suite and IP Addressing Review TCP/IP Model: TCP/IP Model: TCP/IP Applications: TCP/IP Applications: TCP/IP Applications: Transport ... | PowerPoint PPT presentation | free to view

Advanced Tutorial on Hardware Design and Petri nets PowerPoint PPT Presentation

Advanced Tutorial on Hardware Design and Petri nets - Advanced Tutorial on Hardware Design and Petri nets Jordi Cortadella Univ. Polit cnica de Catalunya Luciano Lavagno Universit di Udine Alex Yakovlev Univ ... | PowerPoint PPT presentation | free to view

IPCHINOOK: An Integrated IP-based Design Framework for Distributed Embedded Systems PowerPoint PPT Presentation

IPCHINOOK: An Integrated IP-based Design Framework for Distributed Embedded Systems - IPCHINOOK: An Integrated IP-based Design Framework for Distributed Embedded Systems Written by Pai Chou, Ross Ortega, Ken Hines, Kurt Partridge, and Gaetano Borriello | PowerPoint PPT presentation | free to view

Constraint-Based Watermarking Techniques for Design IP Protection PowerPoint PPT Presentation

Constraint-Based Watermarking Techniques for Design IP Protection - Constraint-Based Watermarking Techniques for Design IP Protection. ... In VLSI, SAT formulations have been used in testing, ... | PowerPoint PPT presentation | free to view

Database Systems: Design, Implementation, and Management Ninth Edition PowerPoint PPT Presentation

Database Systems: Design, Implementation, and Management Ninth Edition - Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Transaction Management and Concurrency Control | PowerPoint PPT presentation | free to view

Database Systems: Design, Implementation, and Management Ninth Edition PowerPoint PPT Presentation

Database Systems: Design, Implementation, and Management Ninth Edition - Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems | PowerPoint PPT presentation | free to view

MECOSIG Adapted to the Design of Distributed GIS PowerPoint PPT Presentation

MECOSIG Adapted to the Design of Distributed GIS - Title: MECOSIG Adapted to the Design of Distributed GIS Author: PASQUASY Last modified by: ENGELS Created Date: 10/24/2005 6:42:18 AM Document presentation format | PowerPoint PPT presentation | free to view

Covid-19 Analysis on Physical Security Market PowerPoint PPT Presentation

Covid-19 Analysis on Physical Security Market - Covid-19 Analysis on Physical Security Market By Type (System, Service), By Industry (Education, Oil & Gas, Government, Retail, Hospitality, BFSI, Retail) - Forecast 2022 | PowerPoint PPT presentation | free to view