Operating System Issues in Multi-Processor Systems - PowerPoint PPT Presentation

About This Presentation
Title:

Operating System Issues in Multi-Processor Systems

Description:

Operating System Issues in Multi-Processor Systems John Sung Hardware Engineer Compaq Computer Corporation Outline Multi-Processor Hardware Issues Snoopy Bus System ... – PowerPoint PPT presentation

Number of Views:257
Avg rating:3.0/5.0
Slides: 23
Provided by: John515
Category:

less

Transcript and Presenter's Notes

Title: Operating System Issues in Multi-Processor Systems


1
Operating System Issues in Multi-Processor
Systems
  • John Sung
  • Hardware Engineer
  • Compaq Computer Corporation

2
Outline
  • Multi-Processor Hardware Issues
  • Snoopy Bus System Architecture
  • AMD Athlons Snoopy Protocol
  • ccNUMA System Architecture
  • AMD Athlons LDT System Bus
  • SGI Origions ccNUMA System Architecture
  • Alpha 21364 System Architecture
  • ccNUMA and CPU Scheduling
  • Conclusion

3
Multi-Processor Hardware Issues
  • Bandwidth/Latency
  • Processor to Processor
  • Processor to Memory
  • Processor to I/O
  • Scalability
  • Increase performance as you increase CPU/Memory
  • Coherency/Synchronization
  • Give software coherent view of memory
  • Provide synchronization primitives

4
Snoopy Bus System Architecture
5
Snoopy Bus System Architecture
  • A bus Connects Processors,Memory,and I/O
  • Scales upto 16 processors
  • Limited by bus bandwidth
  • Cache Coherency Protocol
  • Snoops the bus for memory traffic
  • Each set has to listen for addresses in its
    cache
  • Does the right thing to give software coherent
    view of memory

6
Snoopy Bus System Architecture
CPU Core
CPU Core
CPU Core
Cache
Cache
Cache
Bus
Memory I/O
Memory I/O
Memory I/O
7
ccNUMA System Architecture
8
ccNUMA System Architecture
  • Cache-Coherent Non-Uniform Memory Access
  • Memory is distributed and attached to processors
  • Some network connects each processor/memory sets
  • Each processor owns part of the memory space
  • Cache coherency protocol
  • Gives software coherent view of memory
  • Protocol primitives for synchronization
  • Directory to keep track of who has a copy of
    memory

9
ccNUMA System Architecture
CPU Core
CPU Core
Cache
Cache
Memory Directory I/O
Network Router
Network Router
Memory Directory I/O
Network Fabric
10
SGI Origin System Architecture
11
SGI CrayLinkTM
  • Node 2 CPU and their cache
  • Module Memory Directory HUB
  • 2 Modules per Router
  • System Modules Routers CrayLinkTM Network

12
SGI CrayLinkTM
13
Processor System Network
14
Bisectional Bandwidth
15
ccNUMA and CPU Scheduling Issues
16
OSs Questions
  • Single CPU System
  • What to schedule next?
  • ccNUMA System
  • What to schedule next?
  • Which cpu to schedule it to?
  • Where should the process information be located
    at?
  • 1 or many instances of OS?

17
OSs Choices for a Process
  • Single CPU System
  • Process has1 choice
  • Process information has 1 choice
  • ccNUMA System with N CPUs and M Memory
  • Process has N choices
  • Process information M choices per virtual page
  • Distance between process and its information

18
Context Switch Penalty
  • Single CPU System
  • Saving/Restoring process state (PCB)
  • Scheduling routine
  • ccNUMA System
  • Saving/Restoring process state (PCB)
  • Scheduling routine
  • Moving processs information

19
Some Common Sense
  • Replicate parts of the OS across processors
  • System calls will happen often
  • Minimize process movement
  • Cost of moving a process to another CPU is high
  • Less than swaping to disk, most of the time
  • Higher than simple context switching
  • But if you have to move a process
  • Minimize the amount of information to move
  • Opportunity for a cache????

20
Conclusion
  • Hardware
  • Bandwidth and Latency for performance
  • Cache Coherency for correctness
  • Operating System
  • ccNUMA adds complexity in CPU scheduling
  • HW performance Lower Context Switch Penalty gt
    flexibility in scheduling choices for a process

21
References
  • Alpha
  • http//www.digital.com/alphaoem/present/ev7forum98
    .ppt
  • http//www.compaq.com/InnovateForum99/presentation
    /session31/
  • http//www.digital.com/alphaoem/
  • AMD
  • http//www.amd.com/products/cpg/mpf/speech/slides9
    9.ppt
  • SGI
  • http//www-europe.sgi.com/origin/numa_tech.html
  • BenchMarks
  • http//www.spec.org/
  • http//www.tpc.org/

22
Abbreviation Index
  • AMD - Advanced Micro Devices
  • SGI - Silicon Graphics Inc.
  • ECC - Error Correction Code
  • SECDED - Single Error Correct Double Error Detect
  • API - Alpha Processor Inc
  • AGP - Accelerated Graphics Port
  • DDR DRAM - Double Data Rate Dynamic RAM
  • LTD - Lightning Data Transport
  • PCI - Peripheral Component Interconnect
  • CMOS - Complementary Metal Oxide Semiconductor
  • CAS - Column Address Strobe
  • TPC-C -Transaction Processing Performance Council
    Benchmark
  • ccNUMA - Cache-Coherent Non-Uniform Memory Access
  • SMP - Symmetric Multi-Processing
Write a Comment
User Comments (0)
About PowerShow.com