CART, UTCS - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

CART, UTCS

Description:

Block level squash on mis-prediction. Overlapped execution of blocks ... Block level squash on mis-prediction. Block stitching using input/output register masks ... – PowerPoint PPT presentation

Number of Views:64

Avg rating:3.0/5.0

Slides: 22

Provided by: karthikeya9

Category:

Tags: cart | utcs | squash

Transcript and Presenter's Notes

Title: CART, UTCS

1
A Design Space Exploration of Grid Processor
Architectures

Karu Sankaralingam
Ramadass Nagarajan, Doug Burger, and Stephen W.
Keckler
Computer Architecture and Technology Laboratory
Department of Computer Sciences
The University of Texas at Austin

2
Technology and Architecture Trends

Good news
Lots of transistors, faster transistors
Bad news
Pipeline depth near optimal
Pipelining limits will slow clock rate
improvements by half
Performance must come from more ILP
IPC has only doubled in one decade, despite
considerable effort
Global wire delays are growing
At 35nm, less than 1 of die is 1-cycle-reachable
Goals for future architectures
Scalability with process technology improvements
Fast clock and high ILP

3
A New Approach

ALU chaining

Execution model eliminates
Majority of register reads
Associative issue windows
Rename tables
Global bypass
Partitions I-Cache and register file banks around
ALUs
Statically map and dynamically issue

4
Outline

Grid Processor Architecture (GPA)
Block Compilation
Program Execution
Evaluation
Conclusions and Future Work

5
Grid Processor
OP2
Inst
OP1
ALU
Router
6
Block Compilation (1 of 3)
Intermediate Code
Data flow graph
I1) add r1, r2, r3 I2) sub r7, r2, r1 I3) ld r4,
(r1) I4) add r5, r4, r4 I5) beqz r5, 0xdeac
I1
I2
I3
move r2, I1,I2 move r3, I1
I4
I5
Inputs
r7
Temporaries
Outputs
7
Block Compilation (2 of 3)
Mapping
Data flow graph
(1,1)
move r2, (1,3), (2,2) move r3, (1,3)
I1
I1
move r2, I1,I2 move r3, I1
Scheduler
I2
I3
I3
I2
I4
I4
I5
I5
r7
8
Block Compilation (3 of 3)
GPA code
I1) (1,3) add (1,-1), (1,0)
Code generation
Targets
Instruction location
Opcode
9
Block Atomic Execution Model

A block of instructions is an atomic unit of
fetch/schedule/execute/commit
Blocks expose critical path
Operand chains hidden from large structures
Instructions specify consumers as explicit
targets
Blocks allow simple internal control flow
Single point of entry
If-conversion using predication
Predicated hyperblocks

10
Block Execution
DCache bank 0
sub
DCache bank 2
DCache bank 3
Block termination Logic
11
Block Execution
DCache bank 0
sub
DCache bank 2
DCache bank 3
Block termination Logic
12
Instruction Buffers - Frames

What if?
Blocks exceed grid size
Overlap fetch and map

13
Execution Opportunities

Serialized block fetch/map and execute
Overlapped instruction distribution and execution
Overlapped fetch/map
Next-block predictor
Block level squash on mis-prediction
Overlapped execution of blocks
Next-block predictor
Block level squash on mis-prediction
Block stitching using input/output register masks

14
Evaluation

3 SPECInt2000, 3SPECFP2000, 3 Mediabench
benchmarks
adpcm, dct, mpeg2encode
gcc, mcf, parser
ammp, art equake
Compiled using the Trimaran toolset
Hyperblocks parsed and scheduled using custom
tools
Event driven configurable timing simulator used
for performance estimates

15
GPA Evaluation Parameters
GPA
Superscalar

8x8 grid
¼ cycle router ¼ cycle wire delay
32 slots at every node

5 stage pipeline, 8-wide
0 cycle router and wire delay!
512 entry instruction window

Alpha 21264 functional unit latencies
L1 3 cycles, L2 13 cycles, Main memory 62
cycles

16
GPA Performance Comparison
Mean
SPECFP
Mediabench
SPECINT
17
Sensitivity to Communication Delay
18
Conclusions

Technology trends
Enforce partitioning
Wire delays become first order constraint
GPA
Distributed execution engine with few central
structures
Technology scalable, fast clock rate and high ILP
Challenges
Block control mechanisms
Distributed memory interface design
Optimizing predication mechanisms

19
Future Work

Alternate execution models
SMT support
Use frames to run different threads
Stream based execution
Loop re-use and data partitioning in caches
Scientific vector-based execution
Use rows as vector execution units
Vector loads read from caches
Hardware prototype

20
Related Work

Dataflow
Static dataflow architecture Dennis and Misunas
1975
Tagged-Token Dataflow Arvind 1990
Hybrid dataflow execution Culler et. al 1991
RAW architecture Waingold et. al 1997
Multiscalar Processors Sohi et. al 1995
Trace Processors Vajapeyam 1997
Clustered Speculative Multithreaded Processors
Marcuello and González 1999
Levo Uht et. al 2001

21
Questions

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

CART, UTCS PowerPoint PPT Presentation

CART, UTCS - Sony EmotionEngine: 2 specialized vector units ... md5, rijndael, blowfish. Network processing, security. fft, lu. Scientific computing ... | PowerPoint PPT presentation | free to view

Changes in Total Precipitable Water Vapor PowerPoint PPT Presentation

Changes in Total Precipitable Water Vapor - ... 36 from 60 clear sky cases at the SGP ARM-CART site from April 2001 to June 2002. ... north mid-latitude land agree fairly well with the CART site biases. ... | PowerPoint PPT presentation | free to view

MOD07 atmospheric profile algorithm updates for Collection 6 - The new products are tested over the SGP ARM cart site with MWR and GOES and over some selected global days with TOMS and AIRS data. | PowerPoint PPT presentation | free to view

The VT-2004 observing campaign and the Astronomical Unit PowerPoint PPT Presentation

The VT-2004 observing campaign and the Astronomical Unit - ... knowledge of all distances in the solar system La troisi me loi de K pler nous donne toutes les distances dans le syst me solaire partir de la mesure d ... | PowerPoint PPT presentation | free to view

Classification des signaux exemples de signaux r PowerPoint PPT Presentation

Classification des signaux exemples de signaux r - Title: Signaux Author: sidahmed Last modified by: sidahmed Created Date: 8/10/1999 9:47:19 AM Document presentation format: Affichage l' cran Company | PowerPoint PPT presentation | free to view

GOES-R AWG Product Validation Tool Development PowerPoint PPT Presentation

GOES-R AWG Product Validation Tool Development - GOES-R AWG Product Validation Tool Development Sounding Application Team Tim Schmit (STAR) with contributions from many others, such as Jun Li, Zhenglong Li, Jinlong ... | PowerPoint PPT presentation | free to view

Plans d PowerPoint PPT Presentation

Plans d - ... (plut t statistiques) qui peuvent tre mises en uvre pour am liorer la qualit d une production (Shewart, Deming, Dodge et Romig ... | PowerPoint PPT presentation | free to view

MODIS Infrared Atmospheric Profiles and Water Vapor: Updates for Collection 5 PowerPoint PPT Presentation

MODIS Infrared Atmospheric Profiles and Water Vapor: Updates for Collection 5 - ... based on solar zenith and azimuth angles, as a ... by solar azimuth category. ... and azimuth. - Included regression-based ozone in radiosondes where ozone ... | PowerPoint PPT presentation | free to view

Robert G. Ellingson and the ARESE II Science Team PowerPoint PPT Presentation

Robert G. Ellingson and the ARESE II Science Team - Title: No Slide Title Author: Robert G. Ellingson Last modified by: Robert G. Ellingson Created Date: 7/17/2000 3:13:50 PM Document presentation format | PowerPoint PPT presentation | free to view

Measurement of cirrus cloud optical properties as validation for aircraft and satellitebased cloud s PowerPoint PPT Presentation

Measurement of cirrus cloud optical properties as validation for aircraft and satellitebased cloud s - Measurement of cirrus cloud optical ... Lidar (cloud boundaries) FTS (emitted radiance) ... good correlation between ground-based AERI and Raman lidar, ... | PowerPoint PPT presentation | free to view

Identification des causes du probl PowerPoint PPT Presentation

Identification des causes du probl - Title: Pr sentation PowerPoint Author: vbouthor Last modified by: vbouthor Created Date: 12/14/2006 9:15:42 AM Document presentation format: Affichage l' cran | PowerPoint PPT presentation | free to view

Les inconstances des gardes temps PowerPoint PPT Presentation

Les inconstances des gardes temps - Les inconstances des gardes temps et les pr dictions Les inconstances des gardes temps Pr ambule La recherche d un garde temps qui soit fiable est un probl me ... | PowerPoint PPT presentation | free to view

Exemples de plans crois PowerPoint PPT Presentation

Exemples de plans crois - Title: Plans d exp riences et d marche Taguchi M thodes avanc es Author: BOUDAOUD Last modified by: Nassim Created Date: 11/25/2005 8:56:10 AM | PowerPoint PPT presentation | free to view

GPM Continental Supersite: Requirements PowerPoint PPT Presentation

GPM Continental Supersite: Requirements - GPM Continental Supersite: Requirements & Concept ... TC= Tropical Continental. TM= Tropical Maritime. Regime Identification Example: TRMM-LBA ... | PowerPoint PPT presentation | free to view

Early Results from the MODIS Cloud Algorithms PowerPoint PPT Presentation

Early Results from the MODIS Cloud Algorithms - (confident clear is green, probably clear is blue, uncertain is red, cloud is white) ... green cloud=white uncertain=red. MODIS band 31. 11 m. Cloud Mask. ARM ... | PowerPoint PPT presentation | free to view

Diapositive 1 PowerPoint PPT Presentation

Diapositive 1 - D marche qualit au sein d un service biom dical de l AP-HP : Audit qualit du service et contr le qualit en imagerie La pr paration l accr ditation ... | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

Introduction to Online Marketing Intelligence PowerPoint PPT Presentation

Introduction to Online Marketing Intelligence - Online targeted advertising is ... and free gifts. with an online booking. 1 out of every. 2 people who ... Step 3: Classification based on the target variable ... | PowerPoint PPT presentation | free to view

La Radionavigation PowerPoint PPT Presentation

La Radionavigation - La d termination d un point se fait par triangulation en mesurant les ... net/aviationlibrary http://www.f6ddr.fr/aero/navigation/radio ... | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

Atmospheric Soundings, Surface Properties, Clouds PowerPoint PPT Presentation

Atmospheric Soundings, Surface Properties, Clouds - ... for High Clouds ... is green, probably clear is blue, uncertain is red, cloud is white) ... For a single layer of clouds, radiances in one spectral band ... | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

EVALUATION D PowerPoint PPT Presentation

EVALUATION D - Title: LES CAPTEURS PLANS Author: chu-nancy Last modified by: NEC Computers International Created Date: 6/13/2005 7:21:33 AM Document presentation format | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) PowerPoint PPT Presentation

John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) - John Deere 4030 Tractor Operator’s Manual Instant Download (Publication No.OMR53778) | PowerPoint PPT presentation | free to view

AnneMarie Gontier PowerPoint PPT Presentation

AnneMarie Gontier - Le rep rage des directions partir d'observation d'objets c lestes depuis la Terre ... P le moyen = CIP anim du seul mouvement de. pr cession. quateur moyen de la date = grand ... | PowerPoint PPT presentation | free to view