Parallel and Distributed Programming Models and Languages - PowerPoint PPT Presentation

1 / 22

About This Presentation

Title:

Parallel and Distributed Programming Models and Languages

Description:

Parallel and Distributed Programming Models and Languages 15-740/18-740 Computer Architecture In-Class Discussion Dong Zhou Kun Li Mike Ralph Why distributed ... – PowerPoint PPT presentation

Number of Views:88

Avg rating:3.0/5.0

Slides: 23

Provided by: dong152

Learn more at: https://cs.login.cmu.edu

Category:

more less

Transcript and Presenter's Notes

Title: Parallel and Distributed Programming Models and Languages

1
Parallel and Distributed ProgrammingModels and
Languages

15-740/18-740 Computer Architecture
In-Class Discussion
Dong Zhou
Kun Li
Mike Ralph

2
Why distributed computations?

Buzzword Big Data
Take sorting as an example
Amount of data that can be sorted in 60 seconds
One computer can read 60 MB/sec from one disk
2012 world record
Flat Datacenter Storage by Ed Nightingale et.al
1470 GB
256 heterogeneous nodes, 1033 disks
Google indexes 100 billion web pages

3
Solution use many nodes

Grid computing
Hundreds of supercomputers connected by high
speed net
Cluster computing
Thousands or tens of thousands of PCs connected
by high speed LANS
1000 nodes potentially give 1000x speedup

4
Distributed computations are difficult to program

Sending data to/from nodes
Coordinating among nodes
Recovering from node failure
Optimizing for locality
Debugging

5
MapReduce

A programming model for large-scale computations
Process large amounts of input, produce output
No side-effects or persistent state
MapReduce is implemented as a runtime library
Automatic parallelization
Load balancing
Locality optimization
Handling of machine failures

6
MapReduce design

Input data is partitioned into M splits
Map extract information on each split
Each map produces R partitions
Shuffle and sort
Bring M partitions to the same reducer
Reduce aggregate, summarize, filter or transform
Output is in R result files

7
More specifically

Programmer specifies two methods
map(k, v) ? ltk', v'gt
reduce(k', ltv'gt) ? ltk'', v''gt
All v' with same k' are reduced together
Usually also specify
partition(k', total partitions) ? partition for
k
often a simple hash of the key

8
Runtime
9
MapReduce is widely applicable

Distributed grep
Distributed clustering
Web link graph reversal
Detecting approx. duplicate web pages

10
Dryad

Similar goals as MapReduce
Focus on throughput, not latency
Automatic management of scheduling, distribution,
fault tolerance
Computations expressed as a graph
Vertices are computations
Edges are communication channels
Each vertex has several input and output edges

11
Why using a dataflow graph?

Many programs can be represented as a distributed
dataflow graph
The programmer may not have to know this
SQL-like queries LINQ
Dryad will run them for you

12
Runtime

Vertices (V) run arbitrary app code
Vertices exchange data through
files, TCP pipes etc.
Vertices communicate with JM to report
status

Daemon process (D)
executes vertices

Job Manager (JM) consults name server(NS)
to discover available machines.
JM maintains job graph and schedules vertices

13
Job Directed Acyclic Graph
Outputs
Processing vertices
Channels (file, pipe, shared memory)
Inputs
14
Advantages of DAG over MapReduce

Big jobs more efficient with Dryad
MapReduce big jobs runs gt 1 MR stages
Reducers of each stage write to replicated
storage
Output of reduce 2 network copies, 3 disks
Dryad each job is represented with a DAG
Intermediate vertices write to local file

15
Pig Latin

High-level procedural abstraction of MapReduce
Contains SQL-like primitives
Example
good_urls FILTER urls BY pagerank gt 0.2
groups GROUP good_urls BY category
big_groups FILTER groups BY COUNT(good_urls)gt106
Output FOREACH big_groups GENERATE category,
AVG(good_urls.pagerank)
Plus user-defined functions (UDFs)

16
Value

Reduces development time
Procedural vs. declarative
Overhead/performance costs worthwhile?

17
Green-Marl

High-level graph analysis language/compiler
Uses basic data types and graph primitives
Built-in graph function
BFS, RBFS, DFS
Uses domain specific optimizations
Both non-architecture and architecture specific
Compiler translates Green-Marl to other
high-level language (ex. C)

18
Tradeoffs

Achieve speedup over hand-tuned parallel
equivalents
Tested only on single workstation
Only works with graph representations
Difficulty representing certain data sets and
computations
Domain specific vs. general purpose languages
Future work for more architectures, user-defined
data structures

19
Questions and Discussion
20
Example count word frequencies in web page

Input is files with one doc per record
Map parses document into words
key document URL
value document contents
Output of map

"to", "1" "be", "1" "or", "1" "not", "1" "to",
"1" "be", "1"
"doc1", "to be or not to be"
21
Example count word frequencies in web page

Reduce computes sum for a key
Output of reduce saved

key "be" values "1", "1"
key "not" values "1"
key "or" values "1"
key "to" values "1", "1"
"2"
"1"
"2"
"2"
"to", "2" "be", "2" "or", "1" "not", "1"
22
Example Pseudo-code

Map(String input_key, String input_value)
//input_key document name
//input_value document contents
for each word w in input_values
EmitIntermediate(w, "1")
Reduce(String key, Iterator intermediate_values)
//key a word, same for input and output
//intermediate_values a list of counts
int result 0
for each v in inermediate_values
result ParseInt(v)
Emit(AsString(result))

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems PowerPoint PPT Presentation

Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems - Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems University of Tsukuba | PowerPoint PPT presentation | free to view

Open TS: an Advanced Tool for Parallel and Distributed Computing PowerPoint PPT Presentation

Open TS: an Advanced Tool for Parallel and Distributed Computing - (Tampa--Redmond, USA) 2. Open TS: an advanced tool for parallel and distributed computing. ... MPI vs Open TS case study. Applications. Future work. 3. Program ... | PowerPoint PPT presentation | free to view

Design and Implementation of the CCC Parallel Programming Language PowerPoint PPT Presentation

Design and Implementation of the CCC Parallel Programming Language - of the CCC Parallel Programming Language Nai-Wei Lin Department of Computer Science and Information Engineering National Chung Cheng University | PowerPoint PPT presentation | free to view

Distributed Systems: PowerPoint PPT Presentation

Distributed Systems: - Distributed Systems: Introduction Overview of chapters Introduction Ch 1: Characterization of distributed systems Ch 2: System models Coordination models and ... | PowerPoint PPT presentation | free to view

Parallel Programming Orientation PowerPoint PPT Presentation

Parallel Programming Orientation - ... no disk required Less than 20 seconds Virtual, ... Shared memory OpenMP Sockets PVM Linda MPI Most distributed parallel programs are now ... Presentation Author ... | PowerPoint PPT presentation | free to view

Programming in the Distributed SharedMemory Model PowerPoint PPT Presentation

Programming in the Distributed SharedMemory Model - Nice, France. Naming Issues ... Nice, France. The Message Passing Model. Programmers control data and work distribution ... Nice, France. Tutorial Emphasis ... | PowerPoint PPT presentation | free to view

Parallel (and Distributed) Computing Overview PowerPoint PPT Presentation

Parallel (and Distributed) Computing Overview - Parallel (and Distributed) Computing Overview Chapter 1 Motivation and History * Current Status Strategy 2 (extend languages) is most popular Augment existing ... | PowerPoint PPT presentation | free to view

A Really Practical Guide to Parallel/Distributed Processing PowerPoint PPT Presentation

A Really Practical Guide to Parallel/Distributed Processing - ... choose the right tools to use (the right level at which you work) ... How does this simple thing have something to do with parallel processing? Bottom line : ... | PowerPoint PPT presentation | free to view

Advanced Parallel Programming with OpenMP PowerPoint PPT Presentation

Advanced Parallel Programming with OpenMP - Advanced OpenMP, SC'2000. 1. Advanced Parallel Programming with ... SC'2000 Tutorial Agenda. OpenMP: A Quick Recap. OpenMP Case Studies. including ... malloc ... | PowerPoint PPT presentation | free to view

Parallel and Distributed Algorithms Spring 2005 PowerPoint PPT Presentation

Parallel and Distributed Algorithms Spring 2005 - Parallel and Distributed Algorithms Spring 2005 Johnnie W. Baker Presentations Professor Johnnie W. Baker Instructor Will give most presentations Guest Lecturers from ... | PowerPoint PPT presentation | free to view

Center for Programming Models for Scalable Parallel Computing PowerPoint PPT Presentation

Center for Programming Models for Scalable Parallel Computing - Center for Programming Models for Scalable Parallel Computing. Libraries, Languages, and Execution Models for ... Marianne Winslett University of Illinois ... | PowerPoint PPT presentation | free to view

Languages%20and%20Compilers%20(SProg%20og%20Overs PowerPoint PPT Presentation

Languages%20and%20Compilers%20(SProg%20og%20Overs - Languages and Compilers (SProg og ... Simple Network Management ... Method Invocation CORBA Common Object Request Broker Architecture An industry ... | PowerPoint PPT presentation | free to view

Formal Models for Distributed Negotiations: Transactions PowerPoint PPT Presentation

Formal Models for Distributed Negotiations: Transactions - Models and Languages for Coordination and Orchestration ... (up to weak barbed congruence) 8. Roberto Bruni @ IMT Lucca. 13 April 2005 ... | PowerPoint PPT presentation | free to view

Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents PowerPoint PPT Presentation

Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents - Parallel Job Deployment and Monitoring in a Hierarchy of Mobile Agents Munehiro Fukuda Computing & Software Systems, University of Washington, Bothell | PowerPoint PPT presentation | free to view

Performance Technology for Complex Parallel Systems Sameer Shende University of Oregon PowerPoint PPT Presentation

Performance Technology for Complex Parallel Systems Sameer Shende University of Oregon - Performance Technology for Complex Parallel Systems Sameer Shende University of Oregon | PowerPoint PPT presentation | free to view

Analytical models and intelligent search for program generation and optimization PowerPoint PPT Presentation

Analytical models and intelligent search for program generation and optimization - Analytical models and intelligent search for program ... Model. Mini-MMM Performance. SGI Performance. TLB effects are important when matrix size is large. ... | PowerPoint PPT presentation | free to view

The Next Mainstream Programming Language: A Game Developer PowerPoint PPT Presentation

The Next Mainstream Programming Language: A Game Developer - The Next Mainstream Programming Language: A Game Developer s Perspective Tim Sweeney (founder,) Epic Games (POPL 2006) | PowerPoint PPT presentation | free to view

A Really Practical Guide to Parallel/Distributed Processing - Parallel/distributed processing, in a broad sense, ... Coordination between remote devices. sensors and processors. Network ... commands modestly with ` ... | PowerPoint PPT presentation | free to view

The Parallel Computing Laboratory: A Research Agenda based on the Berkeley View PowerPoint PPT Presentation

The Parallel Computing Laboratory: A Research Agenda based on the Berkeley View - The Parallel Computing Laboratory: A Research Agenda based on the Berkeley View Krste Asanovic, Ras Bodik, Jim Demmel, Tony Keaveny, Kurt Keutzer, John Kubiatowicz ... | PowerPoint PPT presentation | free to view

Introduction to Parallel Computing PowerPoint PPT Presentation

Introduction to Parallel Computing - Load balancing is important to parallel programs for ... Memory Hybrid Distributed-Shared Memory Shared Memory Shared memory parallel computers vary ... | PowerPoint PPT presentation | free to view

Formal Models for Distributed Negotiations Linda PowerPoint PPT Presentation

Formal Models for Distributed Negotiations Linda - P Q means P evolves to Q without interaction. Formal Models for Distributed Negotiations ... We write P P' if P evolves to P' with a transition labeld by a or ... | PowerPoint PPT presentation | free to view

Introduction to Parallel Programming Message Passing PowerPoint PPT Presentation

Introduction to Parallel Programming Message Passing - Beowulf Computers. Distributed Memory. COTS: Commercial-Off-The-Shelf computers ... Drawbacks that arise when solving Problems using Parallelism ... | PowerPoint PPT presentation | free to view

The Next Mainstream Programming Language: A Game Developers Perspective PowerPoint PPT Presentation

The Next Mainstream Programming Language: A Game Developers Perspective - Distributed Computing (multiplayer game simulation) Visual content authoring tools ... Models the state of the game world as interacting objects evolve over time ... | PowerPoint PPT presentation | free to view

Designing Distributed Applications using Mobile Agents PowerPoint PPT Presentation

Designing Distributed Applications using Mobile Agents - networks of mobile and fixed people, devices and applications ... manage, manipulate or collate information from many distributed sources. ... | PowerPoint PPT presentation | free to view

Implementing Tomorrow's Programming Languages PowerPoint PPT Presentation

Implementing Tomorrow's Programming Languages - We have finally reached the long-expected 'speed wall' for the processor clock. ... engineers who do not know parallel programming will be obsolete in no time. ... | PowerPoint PPT presentation | free to view

Concurrent Models of Computation PowerPoint PPT Presentation

Concurrent Models of Computation - Distributed Version of. 20-th Century Computation ... GR 2-D and 3-D graphics. PN process networks. DPN distributed process networks ... | PowerPoint PPT presentation | free to view

An Introduction to Unified Parallel C UPC PowerPoint PPT Presentation

An Introduction to Unified Parallel C UPC - A number of threads (i.e. processes) working independently in a SPMD fashion ... Distributed Arrays Directory Style ... build directories of distributed ... | PowerPoint PPT presentation | free to view