Predicting and Bypassing EndtoEnd Internet Service Degradation - PowerPoint PPT Presentation

1 / 33
About This Presentation
Title:

Predicting and Bypassing EndtoEnd Internet Service Degradation

Description:

15% different between using the last minute measurement or the 15 minutes ago measurement ... Minute. ago. Quantity Principle: Importance. Test case: Fixed ... – PowerPoint PPT presentation

Number of Views:217
Avg rating:3.0/5.0
Slides: 34
Provided by: anatbrem
Category:

less

Transcript and Presenter's Notes

Title: Predicting and Bypassing EndtoEnd Internet Service Degradation


1
Predicting and Bypassing End-to-End Internet
Service Degradation
  • Anat Bremler-Barr Edith Cohen Haim Kaplan Yishay
    Mansour
  • Tel-Aviv University ATT Labs
    Tel-Aviv University
  • Talk
  • Omer Ben-Shalom
  • Tel-Aviv University

2
Outline
  • Degradation
  • deviation from normal (minimum) RTT.
  • Predicting Degradation
  • Different Predictors
  • Performance Evaluation
  • Precision/recall methodology
  • Suggested Application Gateway selection

3
Motivating Application

AS 41
AS 123
Peering link
AS 56
Peering link
AS 12
  • Gateway selection (Intelligent Routing device)
  • Choosing peering links

4
Data and Measurements Sources
  • Aciri (CA2)
  • ATT (CA1)
  • ATT(NJ1)
  • Princeton (NJ2)
  • Base Measurements from 4 different location (AS)
    simulated 4
  • gateway
  • California (CA) ATT ACIRI
  • New Jersey (NJ) ATT Princeton

5
Data and Measurements Destinations
  • Aciri (CA2)
  • ATT(CA1)
  • ATT(NJ1)
  • Princeton (NJ2)
  • Obtaining a representative sets of web servers
    weights
  • (derived from proxy-log)

6
Data and Measurements RTT
  • Aciri (CA2)
  • ATT(CA1)
  • ATT(NJ1)
  • Princeton(NJ2)
  • Data Weekly RTT (SYN) ( End to End
    (pathserver))
  • Hourly measurements ? 35,124 servers
  • Once-a-minute weighted sample measurements ? 100
    servers

7
Degradation Definition
  • Deviation from minimum recorded RTT (propagation
    delay)
  • Discrete degradation levels 1-6.

8
Objective Avoiding degradation ?
  • Attempt to reroute through a different gateway
  • Two conditions have to hold
  • Need to be able to predict the failure from a
    gateway
  • Need to have a substitute gateway (low
    correlation between gateways)
  • Blackout (consecutive degradation) through one
    gateway

9
Blackout durations
  • Longer duration, easier to predict.
  • Majority of blackouts are short 1-3 consecutive
    points
  • However, considerable fraction occurs in longer
    durations.

Long duration blackout
10
Gateways Correlation
  • Gateways are correlated but often the correlation
    is not too strong

11
Gateways Correlation
  • Longer blackouts more likely to be shared
  • failure closer to the server
  • Majority of 2-gateways blackouts involved
    same-coast pairs

12
Building predictors
  • For a given degradation level l.
  • Prediction per IP.
  • Input Previous RTT Measurements for the
    IP-address.
  • Output probability for a failure
  • Predict failure if probability gt ?

13
Precision \ Recall Methodology
Predicted degraded
Actual degraded
14
Precision-recall curve
  • Sweep the threshold ? in 0,1 to obtain a
    precision-recall curve.
  • In other words, let P(t) the predicted failure
    probability at time t

15
What is important for prediction?
  • Recency principle
  • The more recent RTTs are more important.
  • Quantity Principle
  • The more measurements the higher the accuracy.

16
Recency Principle Importance
  • Test case Single measurement predictor
  • predict according to a measurement x-minute ago.
  • observe the change in the quality of the
    prediction.
  • ? 15 different between using the last minute
    measurement or the 15 minutes ago measurement

17
Quantity Principle Importance
  • Test case Fixed-Window-Count (FWC)
  • the prediction is the fraction of failures in
    the W most recent measurements
  • ? By quantity we can achieve better precision
    for high recall

FWC 1 FWC 5 FWC 10 FWC 50
18
Our predictors
  • Exponential Decay
  • Polynomial Decay
  • Model based Predictors
  • VW-cover Variable Window Cover algorithm
  • HMM Hidden Markov Model

19
Exponential-decay predictors
  • The weight of each measurement is exponentially
    decreasing with its age by factor ?.
  • For consecutive measurements
  • Binary variable ft represents a failure at
    time t.
  • In general,

20
Polynomial-decay predictors
  • Exact computation required to maintaining the
    complete history.
  • We approximated it.

21
The VW-Cover predictor
  • Consists of a list of pairs
  • ( a1 , b1) ( a2 , b2 ) ( an , bn )
  • Predict a failure if exist i such that there are
    at least bi failures among previous ai
    measurements

22
VW-Cover predictor Building
  • Build the predictor greedily to cover the
    failures.
  • Use a learning set of measurements
  • Pick ( a1 , b1 ) to be the pair which maximizes
    precision
  • Pick ( ai , bi ) to be the pair which maximizes
    precision among uncovered failures

23
Hidden Markov Model
  • Finite set states S (we use 3 states)
  • Output probability as(0),as(1)
  • Transition function, determines the probability
    distribution of the next state.
  • The probability for a failure
  • Where ps(t) is the probability to
  • be at state s at time t. Ps(t) is
  • updated according to the output
  • of time t-1.

24
Experimental Evaluation
25
Predictor Performance Level 3
FWC10 FWC 50 ExpDecay 0.99 ExpDecay
0.95 VW-Cover HMM
? A recall 0.5 precision close to 0.9
26
Predictor Performance Level 6
FWC10 FWC 50 ExpDecay 0.99 ExpDecay
0.95 VW-Cover HMM
  • Degradation of level-6 are harder to predict
  • recall 0.5 precision 0.4

27
Predictor Performance Conclusion
  • The best predictors in level 3 and 6 are
  • VW-cover and HMM
  • But they only slightly outperform ExpDecay0.95
    which is considerable simpler to implement

28
Gateway Selection
Level 6
Level 3
29
Gateway Selection Conclusion
  • Active gateway selection resulted in 50
    reduction in the degradation-rate with respect to
    best single gateway.
  • Static gateway selection can avoid at most 25 of
    degradations.
  • Again ExpDecay0.95 only slightly under perform
    the best predictor (VW-cover).

30
Performance of gateway selection as a function of
recency
31
Correlation between coast
  • Gateway selection on same-coast pair resulted
    only in 10 reduction.
  • Chose independent gateways

32
Controlling prediction overhead
  • Type of measurements
  • Active measurements
  • initiate probes (SYN,ping,HTTP request).
  • Scalability problem.
  • Passive measurements
  • collected on regular traffic
  • Controlling the prediction overhead
  • Using less-recent measurements
  • Active measurements only to small set of
    destinations, which cover the majority of
    traffic.
  • Cluster destinations. The measurements of one
    destination can be used to predict another.

33
  • Questions ??
  • natali_at_cs.tau.ac.il
  • edith_at_research.att.com
  • haimk_at_cs.tau.ac.il
  • mansour_at_cs.tau.ac.il
Write a Comment
User Comments (0)
About PowerShow.com