Title: ThreeDimensional Layout of OnChip TreeBased Networks
1Three-Dimensional Layout of On-Chip Tree-Based
Networks
- Hiroki Matsutani (Keio Univ, Japan)
- Michihiro Koibuchi (NII, Japan)
- D. Frank Hsu (Fordham Univ, USA)
- Hideharu Amano (Keio Univ, Japan)
2Outline
- Introduction
- Network-on-Chip (NoC)
- 2-D vs. 3-D
- Fat Tree
- 2-D layout
- 3-D layout
- Fat H-Tree
- 2-D layout
- 3-D layout
- Evaluations
- Area, Wire length, Energy
Matsutani, IPDPS07
3Network-on-Chip (NoC)
- Tile architectures
- MIT RAW
- Texas U. TRIPS
- Intel 80-tile NoC
- Various topologies
- Mesh, Torus
- Fat Trees
- Fat H-Tree (FHT)
Tile (core router)
Taylor, Micro02
Buger, Computer04
Vangal, ISSCC07
16-core Tile architecture
Packet switched network on a chip
We proposed FHT as an alternative to Fat Trees
Matsutani, IPDPS07
42D Topologies Mesh Torus
- 2-D Torus
- 2x bandwidth of mesh
RAW Taylor, IEEE Micro02
Router
Core
52D Topologies Fat Tree
p of upward links q of downward links c
of core ports
Rank-1
Rank-2
In this talk, we focus on 3-D layout scheme of
tree-based topologies
Fat Tree (2,4,2)
Fat Tree (2,4,1)
Router
Core
62D NoC vs. 3D NoC
- 2D NoCs
- Long wires (esp. trees)
- Wire delay
- Packets consume power at links according to their
wire length
- 3D NoCs
- Several small wafers or dices are stacked
- Vertical link
- Micro bump
- Through-wafer via
- Very short (10-50um)
Long horizontal wires in 2D NoCs can be replaced
by very short vertical links in 3D NoCs
Ezaki, ISSCC04
Burns, ISSCC01
Next slides show the 3D layout scheme of Fat Tree
and FHT
7Outline
- Introduction
- Network-on-Chip (NoC)
- 2-D vs. 3-D
- Fat Tree
- 2-D layout
- 3-D layout
- Fat H-Tree
- 2-D layout
- 3-D layout
- Evaluations
- Area, Wire length, Energy
Matsutani, IPDPS07
8Fat Tree 2-D layout
p of upward links q of downward links c
of core ports
Fat Tree (2,4,2)
Fat Tree (2,4,1)
Router
Core
We preliminarily show the 3D layout scheme of Fat
Trees
9Fat Tree 3-D layout (4-split)
transformation
Top-rank routers are distributed to each layer
Dividing into 4 layers
Original 2-D layout
10Fat Tree 3-D layout (4-split)
transformation
Top-rank links are replaced with vertical
interconnects (10-50um)
Layer-0
This 3-D layout is evaluated in terms of area,
wire, energy
3-D layout (4-stacked)
Original 2-D layout
11Outline
- Introduction
- Network-on-Chip (NoC)
- 2-D vs. 3-D
- Fat Tree
- 2-D layout
- 3-D layout
- Fat H-Tree
- 2-D layout
- 3-D layout
- Evaluations
- Area, Wire length, Power
Matsutani, IPDPS07
12Fat H-Tree Structure
- Fat H-Tree
- Red Tree (H-Tree)
- Black Tree (H-Tree)
Matsutani, IPDPS07
Combining two H-Trees (red black)
By shifting the location of black tree, the
connection pattern of trees is different from the
original Fat Trees
Router
Core
Router
Core
13Fat H-Tree Structure
- Fat H-Tree
- Red Tree (H-Tree)
- Black Tree (H-Tree)
Matsutani, IPDPS07
Combining two H-Trees (red black)
Fat H-Tree is formed on red black trees
Router
Core
Router
Core
14Fat H-Tree Structure
- Fat H-Tree
- Red Tree (H-Tree)
- Black Tree (H-Tree)
Matsutani, IPDPS07
Combining two H-Trees (red black)
Fat H-Tree is formed on red black trees
Router
Core
Router
Core
15Fat H-Tree Structure
- Fat H-Tree
- Red Tree (H-Tree)
- Black Tree (H-Tree)
Matsutani, IPDPS07
Combining two H-Trees (red black)
Fat H-Tree is formed on red black trees
Router
Core
Router
Core
16Fat H-Tree Structure
- Fat H-Tree
- Red Tree (H-Tree)
- Black Tree (H-Tree)
Matsutani, IPDPS07
Combining two H-Trees (red black)
Each core is connected to both red black trees
Rank-2 or upper routers are omitted in this figure
Router
Core
Router
Core
17Fat H-Tree 2-D layout on VLSI
- Fat H-Tree
- Torus structure
- ? Folded as well as the folded layout of 2-D Torus
Matsutani, IPDPS07
(Long feedback links across the chip)
Topologically equivalent
Fat H-Trees 2-D layout
The next slides propose the 3D layout scheme of
Fat H-Tree
Router
Core
18Fat H-Tree 3-D layout (overview)
- Fat H-Tree
- (Problem) Fat H-Tree has a torus structure
- Folding so as to keep the torus structure
consisting of red black trees
(step 1) fold it horizontally (step 2) fold it
vertically
Until the of folded pieces meets the of
layers the 3-D IC has
E.g., four layers ? fold twice
19Fat H-Tree 3-D layout (overview)
- Fat H-Tree
- (Problem) Fat H-Tree has a torus structure
- Folding so as to keep the torus structure
consisting of red black trees
(step 1) fold it horizontally (step 2) fold it
vertically
Until the of folded pieces meets the of
layers the 3-D IC has
E.g., four layers ? fold twice
20Fat H-Tree 3-D layout (overview)
- Fat H-Tree
- (Problem) Fat H-Tree has a torus structure
- Folding so as to keep the torus structure
consisting of red black trees
(step 1) fold it horizontally (step 2) fold it
vertically
Until the of folded pieces meets the of
layers the 3-D IC has
E.g., four layers ? fold twice
Here we show the 3D layouts of red black trees
separately
21Fat H-Tree 3-D (Red tree 4-split)
transformation
Original 2-D layout
22Fat H-Tree 3-D (Red tree 4-split)
transformation
Top-rank links are replaced with vertical
interconnects (10-50um)
Layer-0
3-D layout (4-stacked)
Original 2-D layout
23Fat H-Tree 3-D (Black tree4-split)
transformation
Original 2-D layout
24Fat H-Tree 3-D (Black tree4-split)
transformation
The periphery cores are connected to different
layers
Original 2-D layout
25Fat H-Tree 3-D (Black tree4-split)
transformation
Top-rank links are replaced with vertical
interconnects (10-50um)
Layer-0
The periphery cores are connected to different
layers
3-D layout (4-stacked)
Original 2-D layout
26Fat H-Tree 3-D layout (4-split)
Layer-0
Layer-0
Layer-0
Red tree (3-D)
Black tree (3-D)
Fat H-Tree (3-D)
The 3-D layout of Fat H-Tree can be formed
by superimposing 3-D layouts of red black
trees
27Outline
- Introduction
- Network-on-Chip (NoC)
- 2-D vs. 3-D
- Fat Tree
- 2-D layout
- 3-D layout
- Fat H-Tree
- 2-D layout
- 3-D layout
- Evaluations
- Area, Wire length, Energy
Matsutani, IPDPS07
28Evaluations 2-D vs. 3-D
- 3-D layout
- 16-core x 4-layer
- Vertical interconnects
L mm
L/2 mm
29Network logic area of routers
FT1 Fat tree(2,4,1) FT2 Fat tree(2,4,2) FHT
Fat H-Tree
- 3-D mesh/torus node degree 7
- Fat H-Tree node degree 5
- Fat Tree (2,4,2) node degree 6
of routers their ports in trees are less than
mesh/torus
30Network logic area 2-D vs. 3-D
- Wormhole router
- 1-flit 64-bit
- 3-stage pipeline
- Network interface
- FIFO buffer
- Packet forwarding (Fat H-Tree only)
- Inter-wafer via
- 1-10um square
- 100um per layer per 1-bit signal
- Network logic area
- Routers, NIs
- Inter-wafer vias
Arbiter
FIFO
Davis, DToC05
5x5 XBAR
FIFO
2
Typical wormhole router
Matsutani, ASPDAC08
Inter-wafer via area is calculated according to
of vertical links
Synthesized with a 90nm CMOS
31Network logic area Overhead of 3D
Synthesis result of 64-core (16-core x 4)
3D layout of trees ? area overheat is modest (at
most 7.8)
FT1 Fat Tree(2,4,1) FT2 Fat Tree(2,4,2)
FHT Fat H-Tree
32Total wire length of all links
- Total unit-length of links
- Core router
- Router router
How many unit-links is required ?
1-unit distance between neighboring cores
1-unit link
1-unit link
33Total wire length of all links
1-unit
FT1 Fat Tree(2,4,1) FT2 Fat Tree(2,4,2)
FHT Fat H-Tree
34Total wire length of all links
1-unit
1-unit
4-stacked
Wire length of trees is reduced by 25-50 (close
to torus)
FT1 Fat Tree(2,4,1) FT2 Fat Tree(2,4,2)
FHT Fat H-Tree
35Energy NoCs energy model
- Ave. flit energy
- Send 1-flit to dest.
- How much energyJ ?
- Parameters
- 8mm square chip
- 64-core (16-core x 4)
- 90nm CMOS
- Switching energy
- 1-bit switching _at_ Router
- Gate-level sim
- 0.183 pJ / hop
- Link energy
- 1-bit transfer _at_ Link
- 0.150 pJ / mm
- Via energy
- 4.34 fF / via
8mm
Davis, DToC05
36Energy Reduction by going 3D
2-D layout
Frequent use of longest links
Short hop count ? less energy
FT1 Fat tree(2,4,1) FT2 Fat tree(2,4,2) FHT
Fat H-Tree
37Energy Reduction by going 3D
2-D layout
3-D layout
Moving distance of packets is reduced
FT1 Fat tree(2,4,1) FT2 Fat tree(2,4,2) FHT
Fat H-Tree
The 3D layout of trees reduces the energy by
30.8-42.9
38Summary 3-D layout of trees
- Drawbacks of on-chip tree-based topologies
- Long links around the root of tree
- Wire delay problem
- Repeater insertion ? additional energy
consumption - 3-D layout schemes of Fat Trees Fat H-Tree
- Wire length is reduced by 25-50
- Area overhead is at most 7.8
- Flit transmission energy is reduced by
30.8-42.9
In addition, energy-hungry repeater buffers can
be removed
Need to consider negative impacts of 3-D
(cost,heat,yield)
39Thank you for your attention
40Backup slides
41Energy Reduction by going 3D
2-D layout (w/o repeaters)
2-D layout (with repeaters)
()
Energy is increased
FT1 Fat tree(2,4,1) FT2 Fat tree(2,4,2) FHT
Fat H-Tree
() Repeater insertion model N. Weste et.al,
CMOS VLSI Design (3rd ed), 2005.