Parallel Database System

About This Presentation

Title:

Parallel Database System

Description:

Parallel Database System. David Dewitt & Jim Cray. presented by Ming Hao. Why parallel database ... relation as input and output a new relation. 3. Indicate the ... – PowerPoint PPT presentation

Number of Views:23

Avg rating:3.0/5.0

Slides: 17

Provided by: donat164

Learn more at: http://www.cs.cornell.edu

Category:

more less

Transcript and Presenter's Notes

Title: Parallel Database System

1
Parallel Database System

David Dewitt Jim Cray

presented by Ming Hao
2
Why parallel database

dominance of Relational data model
1. Large uniform data record
2. Query can be decomposed into a bunch of
relational operators. Each operator
takes a
relation as input and output a new
relation
3. Indicate the built-in parallelism

3
1. pipelined parallelism streaming output of
one operator into the input of another
operator 2. partitioned parallelism
partitioned data and execution 3.
Inter-query parallelism OLTP
4
Hardware support available

High speed network
message passing based client-server operating
system
cheap and powerful PC/Workstation

5
Hardware architecture
1. Shared memory a. can not scale up to
lots of disks and processors network
bandwidth b. interference between
processors private cache does not solve
the problem
6
Hardware architecture
2. Shared disks a. same scale problem as
sharedM b. interference when updating data
7
Hardware architecture
3. Shared nothing a. linear scale up and
speedup b. less interference
c. exploiting commodity processors and
memory
8
Parallelism metrics