CS519%20BGP%20Project%20Report - PowerPoint PPT Presentation

About This Presentation
Title:

CS519%20BGP%20Project%20Report

Description:

CS519 BGP Project Report. Kai-Wen Chung (kc279) San-Yiu ... Move Database from SQLServer - Sparrow. Total 70GB. Modify data schema to reduced record size ... – PowerPoint PPT presentation

Number of Views:393
Avg rating:3.0/5.0
Slides: 21
Provided by: Kev855
Category:

less

Transcript and Presenter's Notes

Title: CS519%20BGP%20Project%20Report


1
CS519 BGP Project Report
  • Kai-Wen Chung (kc279)
  • San-Yiu Cheng (sc345)

2
How to Proceed BGP Analysis

Collect Raw Data
Import into Database
Query Database and Analyze data
3
Collect Raw Data
  • MAE-EAST (1998.1 1998.11)
  • http//archive.routeviews.org/ (2003.1 2003.3)

4
Database Schema
  • Original Schema

5
Database Schema (cont.)
  • Record Size
  • Message 94 bytes/record
  • MsgPath 18 bytes/record
  • Record
  • Message 104,841,405 (98.1 98.11)
  • MsgPath 251,442,478 (98.1 98.11)

6
Database Schema (cont.)
  • Database space allocation 20GB
  • About 12 hours to import raw data for 1 month
    (about 10,000,000 messages, and 20,000,000 paths)
  • Data volume reaches limitation soon

7
Our Solution
  • Allocate larger space
  • Move Database from SQLServer -gt Sparrow
  • Total 70GB
  • Modify data schema to reduced record size

8
Data Schema Modification
9
Data Schema Modification
  • Record Size
  • Message 52 bytes/record
  • MsgPath 14 bytes/record
  • Size Reduces
  • Message 46.9
  • MsgPath 22.2
  • Faster Data Importing

10
Current Status
  • Database
  • P3-500 with 128MB ram, and Windows 2000 Server
    and SQL Server 2000 installed
  • Imported Data
  • 1998.1 1998.11. About 21GB in DB
  • 2003.3. About 34GB in DB

11
Current Database Issue
  • SQL Server Performance
  • It could take several hours to run a query
  • Space problem
  • 70GB is only enough for data of 1 2 month (of
    2003)
  • We need a Tera-byte Database to accommodate all
    data of 2002, and 2003

12
Summary of Data
  • Total space used
  • 55G (1998 and 03/2003)
  • Number of Messages
  • 220.5 Million (1998 and 03/2003)
  • Number of DataSet
  • 30,000 (1998 and 03/2003)

13
Summary of Data (cont.)
  • A small number of IP addresses dominate the
    routing table
  • 15 Source IP addresses occupy about 68 of the
    PeerIp field of the Messages
  • 15 Destination IP Addresses occupy about 47 of
    the NextHop field of the Messages

14
Summary of Data (cont.)
  • Advertisement Vs. Withdrawal Messages
  • There are about 220 Million Messages
  • 31.5 of all Messages are Withdrawal Messages
  • 68.5 of all Messages are Advertisement Messages

15
Data Analysis
16
Data Analysis (cont.)
17
Data Analysis (cont.)
18
Data Analysis (cont.)
19
Data Analysis (cont.)
20
Some Advices
  • Optimize your query
  • Some queries are going to take several hours to
    execute
  • Test on bgpbaby first
  • This is a smaller version of bgpdata (1G)
  • Dont try to execute all your queries on last day
  • The SQL Server database is going to be
    overwhelmed
Write a Comment
User Comments (0)
About PowerShow.com