Jerry Held - PowerPoint PPT Presentation

About This Presentation
Title:

Jerry Held

Description:

Agenda The Goal Mythology and other interesting anti-facts and facts Reorganizing Tables Offline Online Entire table Shrinking Space Purging Reorganizing Indexes ... – PowerPoint PPT presentation

Number of Views:131
Avg rating:3.0/5.0
Slides: 35
Provided by: Analy8
Learn more at: http://www.nocoug.org
Category:

less

Transcript and Presenter's Notes

Title: Jerry Held


1
Reorganizations
2
Agenda
  • The Goal
  • Mythology and other interesting anti-facts and
    facts
  • Reorganizing Tables
  • Offline
  • Online
  • Entire table
  • Shrinking Space
  • Purging
  • Reorganizing Indexes
  • Offline
  • Online
  • Entire rebuild
  • Coalesce

3
  • Goal

4
The goal
  • The Goal is to not have to reorganize
  • Do not get into ground hog day mode
  • There are long term solutions for many of the
    common issues
  • The trick is understanding why something happens,
    so you can develop a corrective action
  • EG we purge old data, this leaves table 30
    empty, we want to reclaim this space
  • You could rebuild table
  • You could use partitioning to age old data out
    instead

5
  • Mythology
  • And other interesting anti-facts and facts
  • (found in that inter-web thingy)

6
Indexes
  • Oracle index nodes are not physically deleted
    when table rows are deleted, nor are the entries
    removed from the index. Rather, Oracle
    "logically" deletes the index entry and leaves
    "dead" nodes in the index tree where that may be
    re-used if another adjacent entry is required.
  • However, when large numbers of adjacent rows are
    deleted, it is highly unlikely that Oracle will
    have an opportunity to re-use the deleted leaf
    rows, and these represent wasted space in the
    index. In addition to wasting space, large
    volumes of deleted leaf nodes will make index
    fast-full scans run for longer periods.

If this were true, all indexes would grow, and
grow and grow and grow. If index space were not
reused, all indexes would always need to be
rebuilt at some point in time. It was never true.
Myth1.sql
7
Indexes
If this were true, all indexes on sequence
populated columns and most on dates/timestamps
would be unbalanced. It is physically impossible
to have an unbalanced index. It is most probable
that rebuilding an index with height of 4 will
result in an index of height.. 4
  • Hence, an Oracle index may have four levels, but
    only in those areas of the index tree where the
    massive inserts have occurred. Oracle indexes can
    support many millions of entries in three levels,
    and any Oracle index that has four or more levels
    would benefit from rebuilding.
  • Note that Oracle indexes will spawn to a
    fourth level only in areas of the index where a
    massive insert has occurred, such that 99 of the
    index has three levels, but the index is reported
    as having four levels.

8
Indexes
  • Add to this the various node splitting
    algorithms Oracle uses for non-sequential inserts
    and updates and you can easily see why clustering
    factor increases and can become out of sync with
    reality. An index rebuild coalesces nodes and
    aligns them with the underlying table. Now, in
    many cases this reduces the clustering factor,
  • However, I may have stated things unclearly, the
    goal in index rebuilding is not to reduce
    clustering factor, that is actually a desired
    by-product, Clustering factor ratios are just
    one of several indicators that can tell you an
    index needs to be investigated.

The cluster factor is a metric that describes how
sorted the table data is with respect to the
index key in the given index. Rebuilding an
index can never have any effect on the clustering
factor It requires a table reorganization
Cf.sql Myth2.sql
9
Indexes
DEL_LF_ROWS is very unreliable as a method to
detect an index that needs to be
rebuilt. Contrary to popular belief deleted
rows are in fact cleaned out An index that is
most in need of a rebuild from time to time will
not make itself known using this technique. An
index that will be perfectly fine in a couple of
minutes will be flagged erroneously
  • The second rule of thumb is that the deleted
    leaf rows should be less than 20 of the total
    number of leaf rows. An excessive number of
    deleted leaf rows indicates that a high number of
    deletes or updates have occurred to the index
    column(s). The index should be rebuilt to better
    balance the tree. The INDEX_STATS table can be
    queried to determine if there are excessive
    deleted leaf rows in relation to the total number
    of leaf rows.

Myth3.sql
10
Indexes
If the indexed data arrives randomly (last_name
for example) this is utter non-sense. The index
might end up 50 utilized and a rebuild could
make it 90 utilized for the next couple of
minutes! Think sweeper index from prior slide.
They are candidates for period coalesce or
rebuilds.
  • Indexes that undergo frequent insert, update and
    delete operations need to be rebuilt regularly to
    prevent fragmentation

11
Indexes
This is why atomic_refresh gt false on a
materialized view might be relevant. This is why
you want to consider partitioning and other
physical structures if you have two index
segments, and flip flop partitions, this wont be
an issue. Knowing that this is true is the first
step to solving the problem. Or at least
identifying when you might want to
coalesce/rebuild
  • Index space is not reused within a transaction.
    Hence DELETEINSERT will tend to increase the
    size of the index greatly

fact1.sql
12
Indexes
If you leave a few stragglers behind delete
most but not all old entries then the left hand
side of the index might become brown. Since
you are inserting monotonically values only the
right hand side gets hit You could rebuild or
coalesce forever. Or you could fix it with a
reverse key index (but only if the only goal was
to reclaim space! Range scan issue)
  • Sweeper indexes are candidates for periodic
    rebuilds in order to either
  • Reclaim space
  • Improve the performance of select id from t
    order by id style queries

13
Indexes
?
Well, rebuilding the table over and over could be
an option (especially with dbms_redefinition
ORDERBY_COLS) But so could BTree clustering,
Hash clustering, Index organized tables,
Partitioning anything that forces location on
data
  • If you do large range scans, you should
    reorganize the table to be sorted by the index
    key

14
Indexes corollary
This is false in most all general cases. It is
true in one special case when you access only
the index and not the table and you do so during
a RANGE SCAN (not a fast full scan) Most logical
IOs will be against the table, not the
index And the detrimental side effects?
Increased contention. Special memory set aside
that cannot be used for other stuff. More
management for you.
  • Rebuild indexes that you range scan in a
    tablespace with a larger blocksize than the
    default blocksize. This will reduce logical
    IOs by 50.

Myth4.sql
15
  • How To
  • For
  • Tables

16
How do I reorg a table?
  • Suppose you get the question how do I reorg a
    table
  • You answer quickly Alter table T move, rebuild
    indexes
  • And say they did that, what did they get?
  • Took 2 hours of downtime (big table, lots of
    indexes)
  • Cost , they could have been doing something
    truly productive
  • Did they achieve their goal? Not in this case,
    their perceived problem was chained rows (say
    1,000 of them)
  • After the reorg, they still had 1,000 chained
    rows!
  • We need to ask WHY, what is the goal.
  • Before you do any reorganization you want to know
    why and understand how the operation will solve
    that problem

17
How do I reorg a 50gig database
At a given customer site, I must reorganize a 50
Gigs 8.0.5 Prod DB in a single day. Note that I
do not know the environment yet (I am replacing
somebody leaving on vacation without providing
any analysis report. This guy only told this DB
is bad, we must re-organize it and left on
vacation...). I only know the DB is running on
a Windows (NT) server, the 50 Gigs DB is
mission-critical (hosting the PeopleSoft
Financials suite), the DB is said to be
highly-fragmented at the tablespace-levels and
highly chained at the table-levels but no
bad-perf issue has been apparently reported. I
do not have disk space to create a new db aside
the current one, I only have One Day to
successfully carry out the whole thing. What
would be the right approach ? Keeping in mind,
not to loose anything on the way ... (i.e., low
risk). What would be the best strategy ?
18
How do I reorg a 50gig database
  • I asked why, what is the goal
  • Answer
  • Tablespace fragmented as reported by Toad
  • Chained rows, must get rid of chained rows
  • Turns out peoplesoft uses lots of longs, wont
    matter how many times you rebuild will it.
  • Fragmented tablespace so what? 1 extent or 500
    extents. So what? They dont drop/truncate, so
    so what
  • Only answer is dont even think about doing
    this. What would have happened had I just
    answered the question!

19
How to for Tables
  • Reorganizing Tables
  • Offline
  • Online EE only
  • Entire table
  • Shrinking Space
  • Purging
  • Mostly online
  • Materialized view trick, all editions

20
How to for Tables - offline
  • Two basic approaches
  • ALTER TABLE T MOVE
  • Very offline
  • Queries can proceed while the alter table
    executes
  • Immediately after completion, all indexes go
    invalid
  • That likely kills all queries till they are
    rebuilt
  • Downtime for modifications time to move time
    to rebuild all indexes
  • Downtime for reads time to rebuild all indexes

21
How to for Tables - offline
  • Two basic approaches
  • Make read only Create table as select index
    constrain drop old rename new
  • About same amount of downtime for writes
  • Less downtime for reads (just time to drop and
    rename)
  • Requires more space 2 tables, 2 of each index.
    ALTER MOVE just requires 2 tables for a moment
    plus 1 of each index. And then 1 table and 1
    copy of 1 index plus individual ones as you
    rebuild them
  • Much more work

22
How to for Tables - offline
  • Two basic approaches
  • Never ever use
  • exp/imp (CLM)
  • Expdp/impdp
  • Dump and load
  • Given a choice, I would always choose alter move
  • Simplicity
  • No loss of anything
  • Given need for continuous reads, CTAS

23
How to for Tables - online
  • Enhanced Online Table Redefinition
  • Easy cloning of indexes, grants, constraints,
    etc.
  • Down to 4 easy steps
  • Create new, empty table
  • Start redef (initial copy)
  • Copy_Table_Dependents, instead of manually
    indexing, altering, etc
  • Finish redef
  • Convert from long to LOB online
  • Allow unique index instead of primary key

Redef.sql
24
How to for Tables - online
  • DBMS_REDEFINITION
  • Ranks high on the safety scale
  • Direct path load can use nologging
  • Dependent objects can be
  • Copied automatically
  • Done by hand
  • Combination of above
  • You want primary key (to avoid extra rowid column
    at end of redefed table that will be dropped)
  • High update tables will require frequent syncs

25
Online Segment Shrink
  • Table fills over time
  • You delete rows
  • Lots of whitespace
  • You want to get it back
  • You full scan and want it smaller
  • In the past
  • Alter table move, Alter index rebuild
  • Export/Import
  • Offline
  • Not any more

shrink.sql
26
  • How To
  • For
  • Indexes

27
Here is an email I got once .... HELP!!!!
Riddle me this batman... Why does an rebuilding
an index cause increased redolog generation AFTER
the index has been built? I have a table 35
million rows and an index (nothing is
partitioned) Transactions against this table are
constant. It's always 500,000 rows per day. This
generally creates 10 logs a day Once a month
the indexes are rebuilt. (Alter index rebuild) On
the day following the indexes rebuild 50 logs are
created On the following days 45...40...35...30...
.25....down to 10 at 10 logs this remains
constant at 10 Mining the logs we see that we
have increase INTERNAL INDEX UPDATES Why does
this happen?? Is this always the case??
28
How To For Indexes
  • Remember one of three things will happen as a
    result of a reorganization
  • Itll go better
  • Itll not change at all
  • Itll be much worse than it was before

29
Have metrics and live up to them
  • Keep metrics
  • Statspack/AWR for example, see if LIOs go down
  • Application level statistics
  • Evaluate against them
  • Do an index rebuild
  • Come back tomorrow and verify you did more good
    then harm
  • Rebuilding can be good
  • Coalescing even better (online, without the
    overhead)
  • Most of the time, it is not even needed and can
    do more harm then good
  • Does not mean Ive said you never have to rebuld
    an index
  • Bitmaps
  • Secondary indexes on IOTs
  • Text indexes for example

30
How To For Indexes
  • Reorganizing Indexes
  • Offline
  • Online
  • Entire rebuild
  • Coalesce

31
How to for Indexes - offline
  • Two basic approaches
  • Drop and create
  • Entirely offline
  • Create and swap
  • Almost online in SE
  • Read only during index create

swap.sql
32
How to for Indexes - online
  • Two basic approaches
  • Rebuild
  • Optionally online
  • Need approximately 2 times the storage
  • Use existing index to copy from
  • Skip sort
  • Less physical IO
  • Coalesce
  • Need only current storage
  • Online
  • Combines logically adjacent index blocks as much
    as possible

compare.sql
33
How to for Indexes - online
  • Two basic approaches
  • If I had my way it would be coalesce
  • Always online
  • Least space needs
  • No locking issues
  • Does not skinny up the interior (riddle me this
    batman)

34
ltInsert Picture Heregt
Write a Comment
User Comments (0)
About PowerShow.com