CSED421 Database Systems Lab - PowerPoint PPT Presentation

1 / 8
About This Presentation
Title:

CSED421 Database Systems Lab

Description:

Heap files, secondary indexes based on B Tree. Disk space management system ... Heap Files. Buffer Manager. Disk Space Manager. Project II - Hadoop. One of the ... – PowerPoint PPT presentation

Number of Views:49
Avg rating:3.0/5.0
Slides: 9
Provided by: MicrosoftC144
Category:

less

Transcript and Presenter's Notes

Title: CSED421 Database Systems Lab


1
CSED421Database Systems Lab
2
Welcome
  • Lab Class
  • Library 501, Fri 900 1040
  • Teacher Assistants
  • ???, ???
  • ashworld, sanghoon_at_postech.ac.kr
  • IDS Lab. (279-5878)
  • Database Systems Lab Homepage
  • http//ids.postech.ac.kr/dblab2009/

3
Grading Policy
  • Midterm 25 Final 25
  • 4 Homework 15
  • Participation 5
  • Lab / Project 30
  • Lab 10
  • Minibase project (midterm) 10
  • Hadoop(?) project (final) 10
  • Your lab assignment will be accepted even after
    the deadlineBut some PENALTY will be given.

4
Lab Schedule
  • Introduction
  • Basic DML
  • Constraints
  • Group Functions
  • Join
  • Nested Queries
  • Views
  • Triggers
  • Sequences
  • Indexes
  • User-defined Functions
  • Introduction of Minibase or other term projects
  • (This schedule is tentative, and may be
    sufficiently changeable.)

5
Project I - Minibase
  • http//www.cs.wisc.edu/coral/minibase/minibase.htm
    l
  • A tiny DBMS intended for educational use.
  • It has
  • Parser
  • Optimizer
  • Buffer pool manager
  • Storage mechanisms
  • Heap files, secondary indexes based on BTree
  • Disk space management system
  • The goal is not just to have a functional
    DBMS,but to have a DBMS where the individual
    components can be studied and implemented by
    students.

6
Project I - Minibase
  • We are going to do the project by implementing
    one or two of these
  • Join
  • BTree
  • Heap Files
  • Buffer Manager
  • Disk Space Manager

7
Project II - Hadoop
  • One of the Apache projectsfor reliable,
    scalable, distributed computingincluding
  • Hadoop Core
  • distributed filesystem (HDFS)
  • MapReduce distributed computing metaphor
  • Hbase
  • scalable, distributed database
  • Pig, ZooKeeper, Hive

8
Project II - Hadoop
  • A Simple Inverted Index
  • PageRank on the Wikipedia Corpus
  • Clustering the Netflix Movie Data
  • In advance, TAs have to build
    a Hadoop cluster !!!
Write a Comment
User Comments (0)
About PowerShow.com