Flattened Butterfly Topology for On-Chip Networks - PowerPoint PPT Presentation

About This Presentation
Title:

Flattened Butterfly Topology for On-Chip Networks

Description:

Flattened Butterfly Topology for On-Chip Networks John Kim, James Balfour, and William J. Dally Presented by Jun Pang Motivation & Goal Most on-chip networks (2D mesh ... – PowerPoint PPT presentation

Number of Views:232
Avg rating:3.0/5.0
Slides: 10
Provided by: JunP2
Category:

less

Transcript and Presenter's Notes

Title: Flattened Butterfly Topology for On-Chip Networks


1
Flattened Butterfly Topology for On-Chip Networks
  • John Kim, James Balfour, and William J. Dally
  • Presented by Jun Pang

2
Motivation Goal
  • Most on-chip networks (2D mesh) low-radix
  • Pros simple short wires
  • Cons long network diameter energy inefficiency
    (many hops)
  • High-radix networks
  • Intermediate routers reduced a lot
  • Small latency lower power
  • Goal how does on-chip network use high-radix
    routers to reduce latency energy

3
On-chip network
  • Plentiful bandwidth due to inexpensive wires
    while buffers are expensive
  • lower cost from smaller distance
  • By reducing number of channels buffers
  • Concentration several terminal nodes share
    resources (routers)
  • Latency
  • Reduce hop count at the expense of TS?to get an
    overall reduced latency

4
On-chip Flattened Butterfly
Fig. 3a
  • Topology
  • Radix10(concentration factor4 3d1 3d2)
  • 2 hops
  • Longer wires-gt deeper buffers
  • Non-minimal global adaptive routing (UGAL)
  • Load balance performance path diversity
  • Routing minimally or non-minimally
  • Non-minimal minimal Direction-ordered routing
    (prevent deadlock)
  • Only 2 VCs

5
Bypass Channels Microarchitecture
  • Goal reduce distance traveled by packets to
    reduce latency and energy
  • Two types of muxes
  • Input muxes bypass inputs or direct inputs
  • Output muxes direct outputs or bypass inputs
  • Yield arbiter to guarantee global fairness
  • If primary input is idle, non-primary input is
    chosen
  • Control packet prevent starvation
  • Combination of minimal and non-minimal routing

6
Bypass Channels (continue)
  • Switch architecture
  • Minimal simplified crossbar switch
  • Non-minimal more complexity
  • Non-minimal with bypass channels less complexity
  • Flow control routing
  • Buffers for non-primary inputs
  • Separate buffers for destination of control
    packets
  • Modify UGAL to support bypass channels

7
Evaluation
  • Throughput up to 50 throughput increase
    compared to concentrated mesh
  • Power about 38 power reduction compared to mesh
  • Latency about 28 latency reduction compared to
    mesh

8
Scalability
  • Lower channel increasing factor than hypercube
  • Three ways to scale
  • Concentrate factor
  • Dimension of the flattened butterfly
  • Hybrid approach
  • Future technology helps long wires
  • Increasing VCs will slightly reduce latency

9
Conclusion Concerns
  • Flattened-butterfly
  • interesting idea
  • Maximum distance between nodes2
  • Non-minimal routing to balance load
  • Bypassing channel to reduce latency
  • Lower latency and power, high throughput compared
    to mesh
  • Concerns
  • High channel count? (bigger than mesh torus)
  • Low channel utilization? (due to high channel)
  • Control complexity? (arbitration, control
    packets)
  • Bypass channel good idea? (How about just use
    non-minimal or minimal?)
Write a Comment
User Comments (0)
About PowerShow.com