Query Processing for SQL Updates

About This Presentation

Title:

Description:

Number of Views:47

Avg rating:3.0/5.0

Slides: 17

Provided by: S814

Category:

Tags: sql | processing | query | updates | waas

Transcript and Presenter's Notes

Title: Query Processing for SQL Updates

1
Query Processing for SQL Updates

2
Issues in Efficient Processing of Updates

3
Modeling Updates

4
General form of update Plan

5
Index Maintenance

6
Index Maintenance

7
Per-Index Maintenance

8
Split Delta
9
Choosing an Update Plan

Per-index plan can be more efficient for a large
batch
Sorted order, piggy back other steps such as
referential integrity checks
But may be less efficient in other cases due to
overhead of more operators for per-index plan
spooling of delta
Wide vs stacked plans
i.e. separate deltas for each secondary index vs.
shared delta pipelined through many operators
wide seems better since it can prune unnecessary
columns
Cache effects when you have a choice of index
for selection, may be better to use one that is
going to be updated

10
Checking Single Table Constraints

11
Referential Integrity Constraints

Updates/deletes on referenced side
Inserts/updates on referencing side
Use outerjoin of delta rows with other table
ensure match/lack of match depending on updated
side being referencing/referenced
Checks performed after updating the table
But for updates to referenced table, old values
must be used
May require index on attributes of referencing
table
index on referenced table already present
normally

12
Checking Referential Integrity
13
Referential Integrity (Cont.)

When checking self-referential integrity
constraints, perform all updates done by a DML
statement, only then check
Why?
If there is an index on f.k. attrs, integrity
check can be combined with index maintenance,
since both need same sort order
place ref integrity check operator just above
index maintenance
Another optimization eliminate duplicates

14
Cascading Updates

E.g. S has f.k. referencing R, on update cascade
Update to p.k. of tuple in R cascades to S
Similar to f.k. checking, but instead of
validating, update referencing relation(s)
to do so, create delta table, and use update plan
to process delta table

15
Cascading Updates
16
Issues not addressed by paper

All optimizations are for a single transaction
What if there is a sequence of small
transactions?
Insert/update/delete to large B-tree may require
1 I/O per update to leaf page
even assuming internal pages are in memory
Solution write-optimized B-trees
collect deltas and apply as a batch
but ensure queries are correctly answered
meanwhile, even if more slowly
Several schemes including work at IITB in 1997,
and a nice survey paper by Graefe with some neat
implementation ideas