Lecture 1: Introduction and Memory Systems - PowerPoint PPT Presentation

About This Presentation

Title:

Lecture 1: Introduction and Memory Systems

Description:

Lecture 1: Introduction and Memory Systems CS 7810 Course organization: 7 lectures on memory systems 3 lectures on cache coherence and consistency – PowerPoint PPT presentation

Number of Views:184

Avg rating:3.0/5.0

Slides: 21

Provided by: RajeevBalas184

Learn more at: https://my.eng.utah.edu

Category:

Tags: ieee | introduction | lecture | memory | power | presentations | system | systems

Transcript and Presenter's Notes

Title: Lecture 1: Introduction and Memory Systems

1
Lecture 1 Introduction and Memory Systems

CS 7810 Course organization
7 lectures on memory systems
3 lectures on cache coherence and consistency
2 lectures on transactional memory
2 lectures on interconnection networks
2 lectures on caches
3 lectures on core design
1 lecture on parallel algorithms
3 lectures student paper presentations
2 lectures student project presentations

2
Logistics

Reference texts
Parallel Computer Architecture,
Culler, Singh, Gupta
(a more recent reference is
Fundamentals of
Parallel Computer Architecture,
Yan Solihin)
Principles and Practices of
Interconnection Networks,
Dally Towles
Introduction to Parallel Algorithms
and Architectures,
Leighton
Memory Systems Cache, DRAM, Disk,
Jacob et al.
A number of books in the Morgan and
Claypool
Synthesis Lecture series

3
More Logistics

Projects simulation-based, creative, teams of
up to 4
students, be prepared to spend time towards
middle
and end of semester more details in a few
weeks
Final project report due in late April (will
undergo
conference-style peer reviewing) also watch
out for
workshop deadlines for ISCA
One assignment on memory scheduling due in early
Feb
Grading
50 project
20 assignment
10 paper presentation
20 take-home final

4
DRAM Main Memory

Main memory is stored in DRAM cells that have
much
higher storage density
DRAM cells lose their state over time must be
refreshed
periodically, hence the name Dynamic
DRAM access suffers from long access time and
high
energy overhead
Since the pins on a processor chip are expected
to not
increase much, we will hit a memory bandwidth
wall

5
Memory Architecture
Processor

Bank
Row Buffer
Memory Controller
Address/Cmd
DIMM
Data

DIMM a PCB with DRAM chips on the back and
front
Rank a collection of DRAM chips that work
together to respond to a
request and keep the data bus full
A 64-bit data bus will need 8 x8 DRAM chips or
4 x16 DRAM chips or..
Bank a subset of a rank that is busy during one
request
Row buffer the last row (say, 8 KB) read from a
bank, acts like a cache

6
DRAM Array Access
16Mb DRAM array 4096 x 4096 array of bits
12 row address bits arrive first
Row Access Strobe (RAS)
4096 bits are read out
Eight bits returned to CPU, one per cycle
12 column address bits arrive next
Column decoder
Column Access Strobe (CAS)
Row Buffer
7
Salient Points I

DIMM, rank, bank, array ? form a hierarchy in
the
storage organization
Because of electrical constraints, only a few
DIMMs can
be attached to a bus
Ranks help increase the capacity on a DIMM
Multiple DRAM chips are used for every access to
improve data transfer bandwidth
Multiple banks are provided so we can be
simultaneously
working on different requests

8
Salient Points II

To maximize density, arrays within a bank are
made large
? rows are wide ? row buffers are wide (8KB
read for a
64B request)
Each array provides a single bit to the output
pin in a
cycle (for high density and because there are
few pins)
DRAM chips are described as xN, where N refers
to the
number of output pins one rank may be
composed of
eight x8 DRAM chips (the data bus is 64 bits)
The memory controller schedules memory accesses
to
maximize row buffer hit rates and bank/rank
parallelism

9
Salient Points III

Banks and ranks offer memory parallelism
Row buffers act as a cache within DRAM
Row buffer hit 20 ns access time (must only
move
data from row buffer to pins)
Empty row buffer access 40 ns (must first
read
arrays, then move data from row buffer to
pins)
Row buffer conflict 60 ns (must first
writeback the
existing row, then read new row, then move
data to pins)
In addition, must wait in the queue (tens of
nano-seconds)
and incur address/cmd/data transfer delays (10
ns)

10
Technology Trends

Improvements in technology (smaller devices) ?
DRAM
capacities double every two years, but latency
does not
change much
Power wall 25-40 of datacenter power can be
attributed to the DRAM system
Will soon hit a density wall may have to be
replaced by
other technologies (phase change memory,
STT-RAM)
The pins on a chip are not increasing ?
bandwidth
limitations

11
Power Wall

Many contributors to memory power (Micron power
calc)
Overfetch
Channel
Buffer chips and SerDes
Background power (output drivers)
Leakage and refresh

12
Power Wall

Memory system contribution (see HP power
advisor)

IBM data, from WETI 2012 talk by P. Bose
13
Overfetch

Overfetch caused by multiple factors
Each array is large (fewer peripherals ? more
density)
Involving more chips per access ? more data
transfer pin bandwidth
More overfetch ? more prefetch helps apps
with
locality
Involving more chips per access ? less data
loss
when a chip fails ? lower overhead for
reliability

14
Re-Designing Arrays Udipi et al.,
ISCA10
15
Selective Bitline Activation

Additional logic per array so that only relevant
bitlines
are read out
Essentially results in finer-grain partitioning
of the DRAM
arrays

Two papers in 2010 Udipi et al., ISCA10,
Cooper-Balis and Jacob, IEEE Micro

16
Rank Subsetting

Instead of using all chips in a rank to read out
64-bit
words every cycle, form smaller parallel ranks
Increases data transfer time reduces the size
of the
row buffer
But, lower energy per row read and compatible
with
modern DRAM chips
Increases the number of banks and hence promotes
parallelism (reduces queuing delays)

Initial ideas proposed in Mini-Rank (MICRO 2008)
and MC-DIMM (CAL 2008 and SC 2009)

17
DRAM Variants LPDRAM and RLDRAM

LPDDR (low power) and RLDRAM (low latency)

Data from Chatterjee et al. (MICRO 2012)
18
LPDRAM

Low power device operating at lower voltages and
currents
Efficient low power modes, fast exit from low
power mode
Lower bus frequencies
Typically used in mobile systems (not in DIMMs)

19
Heterogeneous Memory Chatterjee et al.,
MICRO 2012

Implement a few DIMMs/channels with LPDRAM and a
few
DIMMs/channels with RLDRAM
Fetch critical data from RLDRAM and non-critical
data from
LPDRAM
Multiple ways to classify data as critical or
not
identify hot (frequently accessed) pages
the first word of a cache line is often critical
Every cache line request is broken into two
requests

20
Title

Bullet

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

World's Best PowerPoint Templates PowerPoint PPT Presentation

World's Best PowerPoint Templates - CrystalGraphics offers more PowerPoint templates than anyone else in the world, with over 4 million to choose from. Winner of the Standing Ovation Award for “Best PowerPoint Templates” from Presentations Magazine. They'll give your presentations a professional, memorable appearance - the kind of sophisticated look that today's audiences expect. Boasting an impressive range of designs, they will support your presentations with inspiring background photos or videos that support your themes, set the right mood, enhance your credibility and inspire your audiences.

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

Operating Systems Principles Lecture 1: Introduction PowerPoint PPT Presentation

Operating Systems Principles Lecture 1: Introduction - The Role of Operating Systems. Bridge the Hardware/Application Gap ... assemblers, compilers, interpreters. Editors and text processors, linkers and loaders. ... | PowerPoint PPT presentation | free to view

Introduction to Embedded Systems PowerPoint PPT Presentation

Introduction to Embedded Systems - Introduction to Embedded Systems Lecture 1 These lecture notes created by Alex Dean, NCSU | PowerPoint PPT presentation | free to view

Distributed Operating Systems - Introduction PowerPoint PPT Presentation

Distributed Operating Systems - Introduction - Distributed Operating Systems - Introduction Prof. Nalini Venkatasubramanian (includes s borrowed from Prof. Petru Eles, lecture s from Coulouris, Dollimore ... | PowerPoint PPT presentation | free to view

Lecture 6: Memory Hierarchy and Cache (Continued) PowerPoint PPT Presentation

Lecture 6: Memory Hierarchy and Cache (Continued) - Lecture 6: Memory Hierarchy and Cache (Continued) Cache: A safe place for hiding and storing things. Webster s New World Dictionary (1976) Jack Dongarra | PowerPoint PPT presentation | free to view

EE 319K Introduction to Embedded Systems PowerPoint PPT Presentation

EE 319K Introduction to Embedded Systems - Introduction to Embedded Systems Lecture 2: I/O, Logic/Shift Operations, Addressing modes, Memory Operations, Subroutines, Introduction to C Bard, Gerstlauer, Valvano ... | PowerPoint PPT presentation | free to view

361 Computer Architecture Lecture 16: Memory Systems PowerPoint PPT Presentation

361 Computer Architecture Lecture 16: Memory Systems - Computer Architecture Lecture 16: Memory Systems Recap: Solution to Branch Hazard In the Simple Pipeline Processor if a Beq is fetched during Cycle 1: Target address ... | PowerPoint PPT presentation | free to view

CS267 Applications of Parallel Computers Lecture 1: Introduction PowerPoint PPT Presentation

CS267 Applications of Parallel Computers Lecture 1: Introduction - Applications of Parallel Computers Lecture 1: Introduction Kathy Yelick yelick@eecs.berkeley.edu http://www.cs.berkeley.edu/~yelick | PowerPoint PPT presentation | free to view

Lecture One Introduction to Engineering Materials PowerPoint PPT Presentation

Lecture One Introduction to Engineering Materials - Lecture One Introduction to Engineering Materials & Applications Materials science is primarily concerned with the search for basic knowledge about the internal ... | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - Slides by Jim Demmel and Kathy Yelick ... Threads and OpenMP Lecture 6 James Demmel www.cs.berkeley.edu/~demmel/cs267_Spr13/ | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - ... Shared Memory Program is a collection of threads of control. Can be created dynamically, mid-execution, in some languages Each thread has a set of private ... | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - Slides by Jim Demmel and Kathy Yelick ... Threads and OpenMP Lecture 6 James Demmel www.cs.berkeley.edu/~demmel/cs267_Spr14/ | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - Title: Shared Memory Parallel Programming Author: Kathy Yelick Description: Slides by Jim Demmel and Kathy Yelick Last modified by: James Demmel Created Date | PowerPoint PPT presentation | free to view

Introduction to Objective-C Programming (Level: Beginner) Lecture 1 PowerPoint PPT Presentation

Introduction to Objective-C Programming (Level: Beginner) Lecture 1 - Introduction to Objective-C Programming (Level: Beginner) Lecture 1 Introduction Objective-C is implemented as set of extensions to the C language. | PowerPoint PPT presentation | free to view

CprE 588 Embedded Computer Systems PowerPoint PPT Presentation

CprE 588 Embedded Computer Systems - CprE 588 Embedded Computer Systems Prof. Joseph Zambreno Department of Electrical and Computer Engineering Iowa State University Lecture #1 Introduction and Overview | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - Performance comparison Summary CS267 Lecture 6 * Parallel Programming with Threads CS267 Lecture 6 * Recall Programming Model 1: Shared Memory ... Memory/Cache ... | PowerPoint PPT presentation | free to view

Neurophysiology- Organization of Central Nervous System- Introduction- L1 PowerPoint PPT Presentation

Neurophysiology- Organization of Central Nervous System- Introduction- L1 - Neurophysiology- Organization of Central Nervous System-Introduction- L1 Faisal I. Mohammed, MD, PhD Objectives At the end of the lecture students should be able to ... | PowerPoint PPT presentation | free to view

Shared Memory Programming: Threads and OpenMP Lecture 6 PowerPoint PPT Presentation

Shared Memory Programming: Threads and OpenMP Lecture 6 - ... (physical registers, cache, memory ... Threads and OpenMP Lecture 6 Outline Parallel Programming with Threads Recall Programming Model 1: Shared Memory ... | PowerPoint PPT presentation | free to view

Design and Implementation of Signal Processing Systems: An Introduction PowerPoint PPT Presentation

Design and Implementation of Signal Processing Systems: An Introduction - Design and Implementation of Signal Processing Systems: An Introduction | PowerPoint PPT presentation | free to view

Lecture 1 Introduction PowerPoint PPT Presentation

Lecture 1 Introduction - Memory Wall On multi-gigahertz symmetric processors --- even ... 4 Gbytes of memory CSI s High Performance Center ... modeling and simulation ... | PowerPoint PPT presentation | free to view

CENG 450 Computer Systems PowerPoint PPT Presentation

CENG 450 Computer Systems - CENG 450 Computer Systems & Architecture Lecture 1 Amirali Baniasadi amirali@ece.uvic.ca | PowerPoint PPT presentation | free to view

Memory, I/O and Microcomputer Bus Architectures PowerPoint PPT Presentation

Memory, I/O and Microcomputer Bus Architectures - Title: Introduction to Embedded Systems Author: Raj Rajkumar Last modified by: jjohnso2 Created Date: 1/17/2000 5:10:06 AM Document presentation format | PowerPoint PPT presentation | free to view

Introduction to Computer Organization and Architecture PowerPoint PPT Presentation

Introduction to Computer Organization and Architecture - Introduction to Computer Organization and Architecture Lecture 6 By Juthawut Chantharamalee http://dusithost.dusit.ac.th/~juthawut_cha/home.htm | PowerPoint PPT presentation | free to view

CPE/EE 421 Microcomputers: The MSP430 Introduction PowerPoint PPT Presentation

CPE/EE 421 Microcomputers: The MSP430 Introduction - CPE/EE 421 Microcomputers: The MSP430 Introduction Instructor: Dr Aleksandar Milenkovic Lecture Notes Outline MSP430: An Introduction The MSP430 family Technology ... | PowerPoint PPT presentation | free to view

Traditional Expert-Based Information Delivery Systems PowerPoint PPT Presentation

Traditional Expert-Based Information Delivery Systems - This is a commonly-bandied about statistic, ... not necessarily therapy Not trained in clinical epidemiology ... Introduction Author: | PowerPoint PPT presentation | free to view

EE 319K Introduction to Embedded Systems PowerPoint PPT Presentation

EE 319K Introduction to Embedded Systems - EE 319K Introduction to Embedded Systems Lecture 15: Final Exam Review Bill Bard, Andreas Gerstlauer, Jon Valvano, Ramesh Yerraballi Bill Bard, Andreas Gerstlauer ... | PowerPoint PPT presentation | free to view

IT 244 Database Management System PowerPoint PPT Presentation

IT 244 Database Management System - IT 244 Database Management System Lecture 1 14 / 02 /07 Course Information & Introduction to Database & DBMS Topic 1 Introduction Evolution of Database ... | PowerPoint PPT presentation | free to view

EE 319K Introduction to Embedded Systems PowerPoint PPT Presentation

EE 319K Introduction to Embedded Systems - Introduction to Embedded Systems Lecture 9: Local Variables, Stack Frames, Recursion, Fixed-Point Numbers, LCD Bard, Gerstlauer, Valvano, Yerraballi | PowerPoint PPT presentation | free to view