Indirect Adaptive Routing on Large Scale Interconnection Networks - PowerPoint PPT Presentation

About This Presentation
Title:

Indirect Adaptive Routing on Large Scale Interconnection Networks

Description:

Korean Advanced Institute of Science and Technology. Overview. Indirect adaptive routing (IAR) ... Each global channel track the number of incoming MIN packets ... – PowerPoint PPT presentation

Number of Views:50
Avg rating:3.0/5.0
Slides: 28
Provided by: cvaSta
Learn more at: http://cva.stanford.edu
Category:

less

Transcript and Presenter's Notes

Title: Indirect Adaptive Routing on Large Scale Interconnection Networks


1
Indirect Adaptive Routing on Large Scale
Interconnection Networks
Nan Jiang, William J. Dally Computer System
Laboratory Stanford University
John Kim Korean Advanced Institute of Science
and Technology
2
Overview
  • Indirect adaptive routing (IAR)
  • Allow adaptive routing decision to be based on
    local and remote congestion information
  • Main contributions
  • Three new IAR algorithms for large scale networks
  • Steady state and transient performance
    evaluations
  • Impact of network configurations
  • Cost of implementation

3
Presentation Outline
  • Background
  • The dragonfly network
  • Adaptive routing
  • Indirect adaptive routing algorithms
  • Performance results
  • Implementation considerations

4
The Dragonfly Network
  • High Radix Network
  • High radix routers
  • Small network diameter
  • Each router
  • Three types of channels
  • Directly connected to a few other groups
  • Each group
  • Organized by a local network
  • Large number of global channels (GC)
  • Large network with a global diameter of one

5
Routing on the Dragonfly
  • Minimal Routing (MIN)
  • Source local network
  • Global network
  • Destination local network
  • Some Adversarial traffic congests the global
    channels
  • Each group i sends all packets to group i1
  • Oblivious solution Valiants Algorithm (VAL)
  • Poor performance on benign traffic


Group
1
Group
0
Group
2
p
0



Router
0
Router
1
Router
2
p
1

6
Adaptive Routing
  • Choose between the MIN path and a VAL path at the
    packet source Singh'05
  • Decision metric path delay
  • Delay product of path distance and path queue
    depth
  • Measuring path queue length is unrealistic
  • Use local queues length to approximate path
  • Require stiff backpressure

MIN
VAL
GC
GC
q
2
q
3
Source
Router
7
Adaptive Routing Worst Case Traffic
450

400
350
300
Packet Latency (Simulation cycles)
250
200
Valiants
150
Minimal
Adaptive
100

0
0.1
0.2
0.3
0.4
0.5
Throughput (Flit Injection Rate)
8
Indirect Adaptive Routing
  • Improve routing decision through remote
    congestion information
  • Previous method
  • Credit round trip Kim et. al ISCA08
  • Three new methods
  • Reservation
  • Piggyback
  • Progressive

9
Credit Round Trip (CRT)
  • Delay the return of local credits to the
    congested router
  • Creates the illusion of stiffer backpressure
  • Drawbacks
  • Remote congestion is still inferred through local
    queues
  • Information not up to date

MIN
VAL
GC
GC
Source
Router
Kim et. al ISCA08
10
Reservation (RES)
  • Each global channel track the number of incoming
    MIN packets
  • Injected packets creates a reservation flit
  • Routing decision based on the reservation outcome
  • Drawbacks
  • Reservation flit flooding
  • Reservation delay

MIN
VAL
GC
GC
Congestion
Source
Router
11
Piggyback (PB)
  • Local congestion broadcast
  • Piggybacking on each packet
  • Send on idle channels
  • Congestion data compression
  • Drawbacks
  • Consumes extra bandwidth
  • Congestion information not up to date
  • (broadcast delay)

MIN
VAL
GC
GC
Congestion
Source
Router
12
Progressive (PAR)
  • MIN routing decisions at the source are not final
  • VAL decisions are final
  • Switch to VAL when encountering congestion
  • Draw backs
  • Need an additional virtual channel to avoid
    deadlock
  • Add extra hops

MIN
VAL
GC
GC
Congestion
Source
Router
13
Experimental Setup
  • Fully connected local and global networks
  • 33 groups
  • 1,056 nodes
  • 10 cycle local channel latency
  • 100 cycle global channel latency
  • 10-flit packets

14
Steady State Traffic Uniform Random
300

Piggyback
280
Credit Round Trip
Progressive
260
Reservation
Minimal
240
220
Packet Latency (Simulation cycles)
200
180
160
140
120
100

0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Throughput (Flit Injection Rate)
15
Steady State Traffic Worst Case
450

Piggyback
Credit Round Trip
400
Progressive
Reservation
Valiants
350
300
Packet Latency (Simulation cycles)
250
200
150
100

0
0.1
0.2
0.3
0.4
0.5
Throughput (Flit Injection Rate)
16
Transient Traffic Uniform Random to Worst Case
Average Packet Latency per Cycle - UR to WC
500

400
Packet Latency
300
200
100

0
20
40
60
80
100
Cycles After Transition
Packets Routing Non-minimally per Cycle - UR to
WC
100

of Packets Routing Nonminimally
50
0

0
20
40
60
80
100
Cycles After Transition
17
Network Configuration Considerations
  • Packet size
  • RES requires long packets to amortize reservation
    flit cost
  • Routing decision is done on per packet basis
  • Channel latency
  • Affects information delay (CRT, PB)
  • Affects packet delay (PAR, RES)
  • Network size
  • Affects information bandwidth overhead (RES, PB)
  • Global diameter greater than one
  • Need to exchange congestion information on the
    global network

18
Cost Considerations
  • Credit round trip
  • Credit delay tracker for every local channel
  • Reservation
  • Reservation counter for every global channel
  • Additional buffering at the injection port to
    store packets waiting for reservation
  • Piggyback
  • Global channel lookup table for every router
  • Increase in packet size
  • Progressive
  • Extra virtual channel for deadlock avoidance

19
Conclusion
  • Three new indirect adaptive routing algorithms
    for large scale networks
  • Performance and design evaluation of the
    algorithms
  • Best Algorithm?
  • Piggyback performed the best under steady state
    traffic
  • Progressive responded fastest to transient
    changes
  • Network configurations will affect some algorithm
    performance
  • Cost of implementation

20
Thank You!
  • Questions?

21
Adaptive Routing Uniform Traffic
22
Transient Traffic Worst Case to Uniform Random
23
Transient Traffic Worst Case 1 to Worst Case 10
24
1000 Random Permutation Traffic
25
Effect of Packet size on RES Worst Case Traffic
26
Large local network Uniform Random
400

350
300
250
200
Packet Latency - Simulation cycles
150
PB
100
CRT
MIN
50
PAR
RES
0

0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Throughput - Flit Injection Rate
27
Large local network Worst Case
Write a Comment
User Comments (0)
About PowerShow.com