Title: A Heartbeat Mechanism and its Application in Gigascope
 1A Heartbeat Mechanism and its Application in 
Gigascope
- Johnson, Muthukrishnan, Shkapenyuk, Spatscheck 
- Presented by 
- Joseph Frate and John Russo
2Introduction
- Unblocking aggregation, join and union operators 
- Limit scope of output tuples that an input tuple 
 can affect.
- Two techniques 
- Define queries over window of input stream 
- Applicable to continuous query systems for 
 monitoring
- Timestamp mechanism 
- Applicable to data reduction applications 
3Gigascope
- A high-performance streaming database for network 
 monitoring
- Some fields of input stream behave as timestamps 
- Locality of input tuple 
- Aggregation must have timestamp as one of group 
 by fields
- Join query relates timestamp fields of both 
 inputs
- Merge operator 
- Union operator 
- Preserves timestamp property of one of the fields 
4Timestamps and Gigascope
- Effective as long as progress is made in all 
 input streams
- Join or merge operators can stall if one input 
 stream stalls, causing systems failure
5Example
- Sites monitored by Gigascope have multiple 
 gigabit connections to the Internet.
- One or more is a backup link through which 
 traffic can be diverted if primary link fails
- All links are monitored simultaneously 
- Since primary link has gigabit traffic and backup 
 link has no traffic, merge operator will quickly
 overflow.
- Presence of tuples carries timestamps, absence 
 does not
6Focus of Paper
- Authors propose use of heartbeats or punctuation 
 to unblock operators
- Heartbeats originate at source query operators 
 and propagate throughout query
- Timestamp punctuations are generated at source 
 query nodes and inferred at every other operator
- Punctuated heartbeats unblock operators that 
 would otherwise be blocked
- Focus is on multi-stream operators (joins and 
 merges)
- Significantly reduces memory load for join and 
 merge operators
7Related Work
- Heartbeat 
- Widely used for fault-tolerance in distributed 
 systems
- Remote nodes send periodic heartbeats to let 
 other nodes know that they are alive
- Heartbeats are also used in distributed DSMS 
- Stream Punctuations 
- Embed special marks in stream to indicate end of 
 subset of data
- Previous work in heartbeat mechanisms with 
 streaming data have focused on enforcing ordering
 of timestamps of tuples before query processing
8Gigascope Architecture
- Stream-only database 
- No continuous queries, thus no windows to unblock 
 queries
- Streams are labeled with timestampness, such as 
 monotone increasing. Used by query planner to
 unblock blocking operators
9Gigascope Architecture (Continued)
- Aggregation query 
- One of the group by attributes must have 
 timestampness.
- When this attribute changes, all groups and 
 aggregates are flushed to the operators output
- We can define this as an epoch. Flush occurs at 
 the end of each epoch.
- Example here time is labeled as monotone 
 increasing
- select tb, srcIP, count() from TCP 
-  group by time/60 as tb, srcOP 
- In this query, we count the packets from each 
 source IP address during 60 second epochs
10Gigascope Architecture (Continued)
- Merge Operator 
- Union of two streams A and B 
- A and B must have same schema 
- Both streams must have a timestamp field, such as 
 t, on which to merge
- If tuples on A have a larger value of t than 
 those on B, tuples on A are buffered until B
 catches up
11Gigascope Architecture (Continued)
- Join Query 
- Must contain a join predicate such as A.trB.ts 
 or A.tr/3  B.ts2
- Relates timestamp field from A with B 
- Input streams are buffered so that streams match 
 up on the timestamp.
12Two-Level Query Architecture
- Low level used for data reduction 
- High level performs complex processing 
- Controls high streaming rates 
- Data streams from NICs are placed in a ring 
 buffer.
- These are called source streams 
13Two-Level Query Architecture (continued)
- Since volumes are too large to provide copy to 
 each query, Gigascope creates a subquery
- For example 
- Query Q to be executed over source stream S 
- Gigascope creates a subquery q which directly 
 accesses S
- Q is transformed into Q0 which is executed over 
 the stream output of q
14Two-Level Query Architecture (continued)
- Low-level subqueries are called LFTAs 
- Fast, lightweight data reduction queries 
- Objective is to quickly process high volume data 
 stream in order to minimize buffer requirements
- Expensive processing is performed on output of 
 low level queries
- Smaller volume 
- Easily buffered
15Two-Level Query Architecture (continued)
- Much of subquery processing can be performed on 
 the NIC itself
- Low-level aggregation uses a fixed-size hash 
 table for maintaining groups in group by
- If a collision occurs in hash table, old group is 
 ejected as a tuple and new group replaces it in
 its slot
- Similar methodology as subaggregates and 
 superaggregates in data cube computations
- Higher level queries complete aggregation
16Two-Level Query Architecture (continued)
- Traffic shaping policies are implemented in order 
 to spread out processing load
- Aggregation operator uses a slow flush to emit 
 tuples when aggregation epoch changes
17Heartbeats to Unblock Streaming Operators
- Gigascope heartbeats are produced by low-level 
 query operators
- Propagated throughout query DAG 
- Incur same queuing delays as tuples 
- System performance monitoring 
- Aids in detecting failed nodes 
- Stream punctuation mechanism is implemented 
- Injects special temporal update tuples into 
 operator output streams
- Informs receiving operator of end of subset of 
 data
- Authors first attempted on-demand generation of 
 temporal update tuples
- Authors settled upon approach of using heartbeats 
 to carry temporal update tuples
18Schema of Temporal Update Tuple
- Identical to regular tuple 
- All attributes marked as temporal are initialized 
 with values that will not violate the ordering
 properties
- i.e. timebucket is marked as temporal increasing 
- Temporal update tuple with timebuckett is 
 received by an operator
- All future tuples will have timebucket t 
- Non-temporal attributes are ignored by receiving 
 operators
- Goal is to generate temporal attribute values 
 aggressively and set them to highest possible
 value
19Heartbeat Generation at LFTA LevelLow-level 
Streaming Operators
- Read data directly from source data streams 
 (packets sniffed from NICs)
- Use filtering, projection and aggregation to 
 reduce amount of data in a stream before passing
 it to higher-level nodes
- Two types of streaming operators selection and 
 aggregation
- Multi-query optimization through prefilters 
20Heartbeat Generation at LFTA LevelLow-level 
Streaming Operators
- Normal mode of operation is to block waiting for 
 new tuples to be posted to NICs ring buffer.
- Once a tuple is in the buffer, it is processed 
- Wait is periodically interrupted to generate a 
 punctuated heartbeat
21Heartbeats in select LFTAs
- Selection, projection and transformation is 
 performed by LFTAs on packets arriving from data
 stream source.
- If predicate is satisfied, output tuple is 
 generated according to projection list.
- A few additions 
- Modify accept_tuple to save all temporal 
 attribute values referenced in select clause
- Whenever a request is received to generate a 
 temporal update tuple, use maximum of saved value
 of temporal attributes and a value saved by
 prefilter to infer value of temporal update tuple
22Heartbeats in Aggregation LFTAs
- Group by and aggregation functionality is 
 implemented using direct-mapped hash table
- Collision results in ejected tuple being sent to 
 output stream
- High-level aggregation node completes aggregation 
 of partial results produced by LFTA
- Flushing occurs whenever incoming tuple advances 
 the epoch.
- Slow-flush is used to avoid overflow of buffers 
- Gradually emits tuples as new tuples arrive 
- Last seen temporal values in input stream are 
 saved, similar to select.
- This value is used to generate temporal update 
 tuples
- Also maintains value of last temporal attribute 
 of last tuple flushed
23Heartbeats in Aggregation LFTAs
- Whenever a request for a heartbeat is received, 
 uses following formula in order to insure that
 heartbeat does not violate temporal attribute
 ordering properties
-  if we have unflushed tuples 
-  use the value of last flushed tuple 
-  else 
-  use max of saved value of the temporal 
-  attributes and the value saved by the 
-  prefilter
24Using System Time for Temporal Values
- When a link has no traffic for a long time, 
 heartbeat is generated using inference from
 system time.
- Skew between system clock and buffering has to be 
 accounted for when setting up Gigascope system
25High Level Query Nodes (HFTA)
- Second level of query execution (two levels) 
- LFTA for low-level filtering and sub-queries 
- HFTA for complex query processing 
- Selection 
- Multiple types of aggregation 
- Stream merge 
- Inner and outer join 
- Data from multiple streams (HFTAs, LFTAs)
26HFTA Execution
- Normal mode of execution 
- Block while waiting for new tuples to arrive 
- Determines which operator to apply to that stream 
- Process the incoming tuple for that operator 
- Route output (if operator required) 
- Also regularly receives temporal update tuples 
- Operators interpret temporal update tuples 
- Operators use these tuples to unblock themselves
27HFTA Heartbeats in selection
- Mostly identical to selection LFTAs 
- Unpack field values referenced in query predicate 
- Check if predicate is satisfied 
- If satisfied, generate projection list 
- Difference can receive temporal update tuples 
 (in addition to regular data tuples)
- Temporal update tuple is received 
- Operator updates the saved values of all temporal 
 attributes referenced in query Select
- Generates new temporal update tuple 
- Rest of normal tuple processing is bypassed
28HFTA Heartbeats in aggregation and sampling
- Non-blocking operator 
- Aggregates data within a time window (epoch) 
- Epoch defined by values of temporal group-by 
 attributes
- Required to keep all groups and corresponding 
 aggregates until end of epoch
- Then flushes to output stream 
- Slow flush to avoid overflow 
- One output tuple for each input tuple 
- Change in epoch ? immediate flush
29HFTA Heartbeats in stream merge
- Union of two streams R and S in a way that 
 preserves the ordering properties of the temporal
 attributes
- R and S must have same schema 
- Both must have a temporal field to merge on 
- Stream buffers until the other catches up (if 
 difference in time fields)
30HFTA Heartbeats in join
- Relate two data streams by timestamp 
- e.g., R.tr  2  S.ts 
- INNER, LEFT, RIGHT, and FULL OUTER 
- Minimum timestamps for R and S maintained 
- Buffers input streams to ensure they match up by 
 timestamp predicate
- May include temporal attributes
31Other heartbeat applications
- Initial goal of heartbeats was to collect 
 statistics about nodes in a distributed setting
- Main focus of paper is to use heartbeats to carry 
 stream punctuation
- Found that there are other uses for heartbeats
32Fault tolerance
- Heartbeat mechanism used to detect node failures 
 in distributed systems
- Nodes must ping to indicate still alive 
- Gigascope slightly different 
- Heartbeats periodically generated by low-level 
 queries
- Propagated upward through query execution DAG 
- Constant flow of heartbeat tuples indicates if 
 low-level sub-query is responding or not
 responding
- No heartbeat over specified amount of time ? 
 query has failed and recovery is initiated
33System performance analysis
- Heartbeat message subject to same routing as 
 other data in network
- Heartbeat visit all nodes 
- Statistics gathered on every node visit 
- Full trace of all nodes 
- Can be used to identify poor-performing nodes
34Distributed query optimization
- Automated tools to utilize collected statistics 
- Re-optimize query execution plans (eliminate 
 identified bottlenecks)
- Other statistics 
- Predicate selectivities 
- Data arrival rates 
- Tuple processing costs
35Performance evaluation
- Experiments conducted on live network feed 
- Queries monitor three network interfaces 
- main1 and main2 DAG4.3GE Gigabit Ethernet 
 interfaces (see bulk of traffic)
- control 100Mbit interface 
- Approximately 100,000 packets per second 
- 400 Mbits/sec 
- Dual-processor 2.8 GHz P4, 4GB RAM, FreeBSD 4.10 
- Focus is to unblock streaming operators 
- high-rate main links 
- low-rate backup links (control)
36Evaluation Unblocking stream merge using 
heartbeats
- Effect of heartbeats on memory usage of queries 
 that use stream merge operator
- Used the following query 
-  SELECT tb, protocol, srcIP, destIP, srcPort, 
 destPort, count()
-  FROM DataProtocol 
-  GROUP BY time/10 as tb, protocol, srcIP, 
 destIP, srcPort, destPort
37Evaluation Unblocking stream merge using 
heartbeats
- Query computes the number of packets observed in 
 different flows in 10 second time buckets
- Query planner merges three streams 
38Evaluation Unblocking stream merge using 
heartbeats
- When control link has no traffic, large amount of 
 data buffered (from other streams)
- Varied heartbeat interval 
- From 1 second to 30 seconds 
- 5 second increments 
39Evaluation Unblocking stream merge using 
heartbeats
- Heartbeats successfully unblock the stream merge 
 operators
- Higher intervals ? more state maintained 
- Also results in more data being flushed when an 
 epoch advances (could cause query failure when
 system lacks traffic sharing, such as slow flush)
40Evaluation Unblocking join operators using 
heartbeats
- How effectively do heartbeats 
- Unblock join queries 
- Reduce overall query memory requirements 
41Evaluation Unblocking join operators using 
heartbeats (continued)
- Query flow1 
-  SELECT tb,protocol,srcIP,destIP, 
 srcPort,destPort,count() as cnt
-  FROM main0_and_control.DataProtocol 
-  GROUP BY time/10 as tb,protocol,srcIP, destIP, 
 srcPort, destPort
- Query flow2 
-  SELECT tb,protocol,srcIP,destIP, 
 srcPort,destPort,count() as cnt
-  FROM main1.DataProtocol 
-  GROUP BY time/10 as tb,protocol,srcIP, destIP, 
 srcPort, destPort
42Evaluation Unblocking join operators using 
heartbeats (continued)
- Query full_flow 
-  SELECT flow1.tb,flow1.protocol, flow1.srcIP, 
 flow1.destIP,
-  flow1.srcPort,flow1.destPort, 
-  flow1.cnt, flow2.cnt 
-  OUTER_JOIN FROM flow1, flow2 
-  WHERE flow1.srcIPflow2.srcIP and 
-  flow1.destIPflow2.destIP and 
-  flow1.srcPortflow2.srcPort and 
-  flow1.destPortflow2.destPort and 
-  flow1.protocolflow2.protocol and 
-  flow1.tb  flow2.tb
43Evaluation Unblocking join operators using 
heartbeats (continued) 
 44Evaluation Unblocking join operators using 
heartbeats (continued)
- Two sub-queries compute the flows aggregated in 
 10 second time-buckets
- main1control 
- main2 
- Full query results combined using full outer join 
 for final output
45Evaluation Unblocking join operators using 
heartbeats (continued)
- Varied interval of generated heartbeats 
- from 1 second to 60 seconds 
- 10 second increments 
46Evaluation Unblocking join operators using 
heartbeats (continued)
- Less memory usage with greater heartbeat interval 
 (linear growth)
- Avoid accumulating large state of blocking 
 operators
- Decrease bursty-ness of output (since less data 
 is dumped at the end of the heartbeat interval
 when the interval is low)
47Evaluation CPU overhead of heartbeat generation
- Measure CPU overhead from heartbeats 
- Measure average CPU load of a merge query 
-  SELET tb, protocol, srcIP, destIP, srcPort, 
 destPort, count()
-  FROM DataProtocol 
-  GROUP BY time/10 as tb, protocol, srcIP, destIP, 
 srcPort, destPort
- Run on two high-rate interfaces 
- Compared 1 second heartbeat interval vs. 
 identical system with heartbeats disabled
48Evaluation CPU overhead of heartbeat generation 
(continued)
- Both monitored links have moderately high load, 
 so merge operators are naturally unblocked
- Heartbeats disabled 37.3 CPU load 
- Heartbeats enabled 37.5 CPU load 
- Insignificant difference 
- Explained by variations in traffic load 
- Conclusion overhead of heartbeat mechanism is 
 immeasurably small
49Conclusion
- Heartbeats regularly generated 
- Propagated to all nodes 
- Attach temporal update tuples to unblock 
 operators
- Heartbeat mechanism can be used for other 
 applications in distributed setting
- Detecting node failure 
- Performance monitoring 
- Query optimization 
- Performance evaluation 
- Capable of working at multiple Gigabit line 
 speeds
- Significantly decrease query memory utilization