DATA STEWARDSHIP - PowerPoint PPT Presentation

1 / 27
About This Presentation
Title:

DATA STEWARDSHIP

Description:

... CANNOT BE READILY INTEGRATED TO ADEQUATELY MEET THE BUSINESS INFORMATION DEMAND. ... NEED ADEQUATE DATA RESPONSIBILITY. INCLUDES THREE-TIER DATA STEWARD ... – PowerPoint PPT presentation

Number of Views:170
Avg rating:3.0/5.0
Slides: 28
Provided by: michaelb62
Category:

less

Transcript and Presenter's Notes

Title: DATA STEWARDSHIP


1
DATA STEWARDSHIPDATA RESOURCE QUALITY OHIO
DAMAJANUARY 9, 2001MICHAEL H.
BRACKETTCONSULTING DATA ARCHITECTDATA RESOURCE
DESIGN REMODELING
P.O. Box 7006 Olympia, Washington 98507 Voice
(360) 943-4829 Fax (360) 753-7363 mhbrackett_at_aol.c
om members.aol.com/mhbrackett
2
CONTENTS
  • CURRENT SITUATION
  • A NEW DIRECTION
  • ACHIEVING STABILITY
  • HALTING DATA DISPARITY
  • RESOLVING DATA DISPARITY
  • NEW CONCEPTS
  • CONCLUSION

3
CURRENT SITUATION
  • DISPARATE DATA - A TRUISM
  • DATA THAT ARE ESSENTIALLY NOT ALIKE, OR ARE
    DISTINCTLY DIFFERENT IN KIND, QUALITY, OR
    CHARACTER. THEY ARE UNEQUAL AND CANNOT BE
    READILY INTEGRATED TO ADEQUATELY MEET THE
    BUSINESS INFORMATION DEMAND.
  • BASIC DATA PROBLEM
  • UNKNOWN DATA EXISTANCE
  • ORGANIZATION NOT AWARE OF ALL DATA AT ITS
    DISPOSAL
  • USUALLY NOT EVEN INVENTORIED
  • UNKNOWN DATA MEANING
  • CONTENT AND MEANING OF DATA NOT FULLY KNOWN
  • DATA NOT THOROUGHLY UNDERSTOOD
  • HIGH DATA REDUNDANCY
  • DATA HIGHLY REDUNDANT AND INCONSISTENT
  • AVERAGE REDUNDANCY FACTOR OF 10 FOR LARGE
    ORGANIZATIONS
  • HIGH DATA VARIABILITY
  • DATA HIGHLY VARIABLE IN FORMAT AND CONTENT
  • AVERAGE FACTOR OF 15 TO 20 FOR LARGE ORGANIZATIONS

4
CURRENT SITUATION
  • DISPARATE DATA RESOURCE
  • A DATA RESOURCE THAT IS SUBSTANTIALLY COMPOSED
    OF DISPARATE DATA THAT ARE DIS-INTEGRATED AND NOT
    SUBJECT ORIENTED. A STATE OF DISARRAY WHERE THE
    LOW QUALITY DOES NO, AND CANNOT, ADEQUATELY
    SUPPORT THE BUSINESS INFORMATION DEMAND.
  • DISPARATE DATA CYCLE
  • A SELF-PERPETUATING CYCLE WHERE DISPARATE DATA
    CONTINUE TO BE PRODUCED AT AN EVER-INCREASING
    RATE BECAUSE PEOPLE DO NOT KNOW ABOUT EXISTING
    DATA OR DO NOT WANT TO USE EXISTING DATA.

5
CURRENT SITUATION
  • DATA RESOURCE DRIFT
  • THE NATURAL, STEADY DRIFT OF A DATA RESOURCE
    TOWARDS DISPARITY IF ITS DEVELOPMENT IS NOT
    PROPERLY MANAGED AND CONTROLLED.
  • NO STATUS QUO
  • THERE IS NO STATUS QUO FOR DEVELOPING A
    STABLE DATA RESOURCE
  • NATURAL DRIFT TOWARD DISPARITY WILL CONTINUE
  • A STATUS QUO LEADS TO
  • ORGANIZATIONAL FAILURE BY INFORMATION DEPRIVATION!

6
CURRENT SITUATION
  • RAPID CHANGE
  • RAPIDLY CHANGING BUSINESS STRATEGIES AND
    INFORMATION NEEDS
  • RAPIDLY CHANGING TECHNOLOGY - INCLUDING DATA
    MANAGEMENT
  • THE ONLY THING CONSTANT IS INCREASING RATE
    MAGNITUDE OF CHANGE
  • BUSINESS NEEDS INFORMATION
  • HIGH-QUALITY INFORMATION TO SUPPORT BUSINESS
    ACTIVITIES
  • A DATA RESOURCE TO SUPPORT THAT INFORMATION NEED
    IN SPITE OF CHANGE
  • BUSINESS INFORMATION DEMAND
  • AN ORGANIZATIONS CONTINUOUSLY INCREASING,
    CONTANTLY CHAINGING NEED FOR
  • CURRENT, ACCURATE INFORMATION, OFTEN ON SHORT
    NOTICE OR VERY SHORT NOTICE,
  • TO SUPPORT ITS BUSINESS ACTIVITIES
  • DATA DILEMMA
  • NEED INTEGRATED DATA TO MEET BUSINESS INFORMATION
    DEMAND
  • DATA RESOURCE IS DISPARATE AND THE DISPARITY IS
    INCREASING

7
CURRENT SITUATION
  • BUSINESS INTELLIGENCE VALUE CHAIN
  • DATA RESOURCE - TECHNOLOGY REALM
  • DATA - INDIVIDUAL FACTS, OUT OF CONTEXT, HAVE NO
    MEANING
  • DATA IN CONTEXT - FACTS WITH MEANING THAT CAN BE
    READILY UNDERSTOOD
  • INFORMATION - TECHNOLOGY REALM
  • SET OF DATA IN CONTEXT, RELEVANT TO ONE OR MORE
    PEOPLE, AT A POINT IN TIME
  • MANAGE INFORMATION - BUT NOT AS A RESOURCE
  • KNOWLEDGE - HUMAN RESOURCE REALM
  • RETAINED INFORMATION BUSINESS EXPERIENCE
  • MANAGE AN ENVIRONMENT FOR KNOWLEDGE WORKERS, BUT
    NOT THE KNOWLEDGE
  • BUSINESS INTELLIGENCE - HUMAN RESOURCE REALM

8
CURRENT SITUATION
  • I-ORGANIZATION
  • INTELLIGENT LEARNING ORGANIZATION - HUMAN
    RESOURCE REALM
  • HUMAN RESOURCE REALM IS LINK BETWEEN TECHNOLOGY
    REALM
  • BUSINESS REALM
  • REQUIRES GOOD INFORMATION TO SUPPORT BUSINESS
    STRATEGIES GOALS
  • WHICH REQUIRES A STABLE DATA RESOURCE
  • RESOLUTION OF THE DATA DILEMMA
  • BRING STABILITY TO THE DATA RESOURCE TO TRANSCEND
    CHANGE
  • REQUIRES A THOROUGH UNDERSTANDING OF ALL DATA IN
    THE DATA RESOURCE
  • STABILITY REQUIRES A THOROUGH UNDERSTANDING OF
    ALL DATA!

9
NEW DIRECTION
  • COMPARATE DATA
  • DATA THAT ARE ALIKE, SIMILAR IN KIND, QUALITY,
    AND CHARACTER, ARE EASILY UNDERSTOOD, AND CAN BE
    READILY INTEGRATED
  • COMPARATE DATA RESOURCE
  • SUBJECT ORIENTED, INTEGRATED, HIGH QUALITY DATA
    RESOURCE THAT FULLY SUPPORTS THE CURRENT AND
    FUTURE BUSINESS INFORMATION DEMAND
  • SUBJECT ORIENTED
  • BASED ON BUSINESS OBJECTES AND EVENTS IN THE REAL
    WORLD
  • DATA RESOURCE STRUCTURED BY DATA SUBJECTS
  • ALL CHARACTERISTICS ABOUT A DATA SUBJECT STORED
    WITH THE DATA SUBJECT
  • INTEGRATED
  • FULLY INTEGRATED WITHIN A COMMON DATA
    ARCHITECTURE
  • ONE ARCHITECTURE FOR THE ENTIRE DATA RESOURCE
  • ALL DATA MANAGED WITHIN THAT SINGLE ARCHITECTURE
  • DATA PROPERLY DEPLOYED TO MEET BUSINESS NEEDS
  • A COMPARATE DATA RESOURCE MEANS
  • THINKING GLOBALLY AND ACTING LOCALLY!

10
NEW DIRECTION
  • COMPARATE DATA RESOURCE VISION

COMMON DATA ARCHITECTURE
Comparate Data Resource
Information System
BusinessInformation Demand
Data Resource Guide
INFORMATION
DISPARATE DATARESOURCE
11
NEW DIRECTION
  • COMPARATE DATA CYCLE
  • A SELF-PERPETUATING CYCLE WHERE THE USE OF
    COMPARTE DATA IS CONTINUALLY
  • REINFORCED BECAUSE PEOPLE UNDERSTAND AND TRUST
    THE DATA.
  • STOP THE NATURAL DRIFT OF A DATA RESOURCE
  • BY HALTING THE DISPARATE DATA CYCLE
  • AND STARTING A COMPARATE DATA CYCLE!

12
NEW DIRECTION
  • DATA ARCHITECTURE DEFINITION 1
  • THE METHOD OF DESIGN AND CONSTRUCTION OF A DATA
    RESOURCE THAT IS
  • BUSINESS DRIVEN,
  • BASED ON REAL-WORLD SUBJECTS PERCEIVED BY THE
    ENTERPRISE,
  • AND IMPLEMENTED INTO APPROPRIATE OPERATING
    ENVIRONMENTS.
  • IT CONSISTS OF COMPONENTS THAT
  • PROVIDE A CONSISTENT FOUNDATION ACROSS
    ORGANIZATIONAL BOUNDARIES
  • TO PROVIDE EASILY IDENTIFIABLE, READILY AVAILABLE
    HIGH-QUALITY DATA
  • TO SUPPORT THE BUSINESS INFORMATION DEMAND
  • DATA ARCHITECTURE DEFINITION 2
  • THE COMPONENT OF THE DATA RESOURCE FRAMEWORK THAT
  • CONTAINS ALL THE ACTIVITIES, AND THE PRODUCTS OF
    THOSE ACTIVITIES,
  • RELATED TO THE IDENTIFICATION, NAMING,
    DEFINITION, STRUCTURING, INTEGRITY
  • ACCURACY, EFFECTIVENESS, AND DOCUMENTATION OF THE
    DATA RESOURCE
  • NEW DIRECTION REQUIRES FORMAL DEFINITION OF DATA
    ARCHTIECTURE!

13
NEW DIRECTION
  • COMMON DATA ARCHITECTURE
  • COMMON CONTEXT
  • WITHIN WHICH ALL DATA ARE INVENTORIED AND DEFINED
  • RAISING AWARENESS OF EXISTING DATA
  • UNDERSTANDING THE CONTENT AND MEANING OF DATA
  • IMPROVING DATA QUALITY
  • TRANSFORMING AND INTEGRATING DISPARATE DATA
  • DEFINING NEW DATA
  • MANAGING DYNAMIC DATA DEPLOYMENT
  • SCOPE INCLUDES
  • MANUAL AND AUTOMATED
  • INTERNAL AND EXTERNAL
  • TABULAR AND NON-TABULAR
  • OPERATIONAL AND EVALUATIONAL
  • CURRENT AND HISTORICAL
  • ORGANIZATIONAL - DEPARTMENTAL - PERSONAL
  • BUSINESS DATA AND DATA RESOURCE DATA
  • DISPARATE AND COMPARATE

14
ACHIEVING STABILITY
  • DATA RESOURCE QUALITY
  • A MEASURE OF HOW WELL THE DATA RESOURCE MEETS
    THE CURRENT AND FUTURE BUSINESS INFORMATION
    DEMAND
  • ULTIMATE DATA RESOURCE QUALITY
  • A DATA RESOURCE THAT IS STABLE ACROSS CHANGING
    BUSINESS AND CHANGING TECHNOLOGY SO IT CONTINUES
    TO MEET THE BUSINESS INFORMATION DEMAND
  • ACHIEVING DATA RESOURCE STABILITY
  • PHASE 1 - HALT THE CREATION OF DISPARATE DATA
  • ESTABLISH A COMMON DATA ARCHITECTURE
  • DEVELOP ALL NEW DATA WITHIN THAT COMMON DATA
    ARCHITECTURE
  • PROACTIVE PHASE
  • PREVENTS FURTHER DATA DISPARITY
  • MUST BE DONE FIRST
  • PHASE 2 - RESOLVE THE EXISTING DATA DISPARITY
  • TRANSITION DISPARATE DATA TO A COMPARATE DATA
    RESOURCE
  • FORMAL DATA TRANSFORMATION WITHIN A COMMON DATA
    ARCHITECTURE
  • REACTIVE PHASE
  • RESOLVES DISPARATE DATA

15
HALTING DATA DISPARITY
  • BAD HABITS
  • THE THINGS THAT ARE BEING DONE THAT SHOULD NOT
    BE DONE, OR
  • THE THINGS THAT ARE NOT BEING DONE THAT SHOULD
    BE DONE
  • THAT RUIN DATA RESOURCE STABILITY
  • THE IMPACTS OF BAD HABITS
  • DATA CANNOT BE READILY IDENTIFIED
  • DATA UNDERSTANDING INHIBITED
  • LIMITED AWARENESS OF DATA RESOURCE
  • LIMITED DATA SHARING ACROSS BUSINESS ACTIVITIES
  • DISPARATE DATA CYCLE REINFORCED
  • POOR BUSINESS UNDERSTANDING
  • INAPPROPRIATE USE OF DATA
  • INAPPROPRIATE BUSINESS ACTIONS
  • IMPACTS ON BUSINESS AND PEOPLE
  • LOST PRODUCTIVITY OF BUSINESS CLIENTS IT STAFF

16
HALTING DATA DISPARITY
  • GOOD PRACTICES
  • THE THINGS THAT SHOULD BE DONE TO ACHIEVE A
    STABLE DATA RESOURCE
  • THE BENEFITS OF GOOD PRACTICES
  • IMPROVED DATA UNDERSTANDING
  • DATA ARE READILY IDENTIFIED
  • INCREASED AWARNESS OF THE DATA RESOURCE
  • DATA SHARED ACROSS BUSINESS ACTIVITIES
  • DISPARATE DATA CYCLE HALTED
  • IMPROVED BUSINESS UNDERSTANDING
  • IMPROVED BUSINESS SUPPORT
  • FEWER IMPACTS ON BUSINESS AND PEOPLE
  • PRODUCTIVITY IMPROVEMENT FOR BUSINESS CLIENTS
    IT STAFF
  • NO REASON TO DELAY
  • CONCEPTS, PRINCIPLES, TECHNIQUES ARE AVAILABLE
  • THE SITUATION IS ONLY GETTING WORSE

17
HALTING DATA DISPARITY
  • ARCHITECTURAL GOOD PRACTICES
  • DATA NAMES
  • CURRENTLY HAVE INFORMAL DATA NAMES
  • NEED FORMAL DATA NAMES
  • INCLUDES DATA NAMING TAXONOMY VOCABULARY
    CONCEPT
  • DATA DEFINITIONS
  • CURRENTLY HAVE VAGUE DATA DEFINITIONS
  • NEED COMPREHENSIVE DATA DEFINITIONS
  • THOROUGHLY UNDERSTAND DATA IN BUSINESS TERMS
  • DATA STRUCTURE
  • CURRENTLY HAVE IMPROPER DATA STRUCTURES
  • NEED PROPER DATA STRUCTURES
  • INCLUDES TECHNICALLY CORRECT - CULTURALLY
    ACCEPTABLE CONCEPT
  • DATA INTEGRITY RULES
  • CURRENTLY HAVE IMPRECISE DATA INTEGRITY RULES
  • NEED PRECISE DATA INTEGRITY RULES

18
HALTING DATA DISPARITY
  • NON-ARCHITECTURAL GOOD PRACTICES
  • DATA ORIENTATION
  • CURRENTLY HAVE UNREASONABLE DATA ORIENTATION
  • NEED REASONABLE DATA ORIENTATION
  • INCLUDES FIVE TIER - FIVE SCHEMA CONCEPT
  • DATA AVAILABILITY
  • CURRENTLY HAVE UNACCEPTABLE DATA AVAILABIITY
  • NEED ACCEPTABLE DATA AVAILABILITY
  • BALANCES ACCESS WITH PRIVACY SECURITY
  • DATA RESPONSIBILITY
  • CURRENTLY HAVE INADEQUATE DATA RESPONSIBILITY
  • NEED ADEQUATE DATA RESPONSIBILITY
  • INCLUDES THREE-TIER DATA STEWARD CONCEPT
  • DATA VISION
  • CURRENTLY HAVE RESTRICTED DATA VISION
  • NEED EXPANDED DATA VISION

19
RESOLVING DATA DISPARITY
  • DATA RESOURCE TRANSITION
  • FORMALLY MOVING FROM A DISPARATE DATA RESOURCE
    TO A COMPARATE DATA
  • RESOURCE WITHIN THE COMMON DATA ARCHITECTURE
  • FORMALIZE THE UNDERSTANDING OF DISPARATE DATA
  • INTEGRATES DISPARATE DATA WITHIN THE COMMON DATA
    ARCHITECTURE
  • RESOLVES THE EXISTING DATA DISPARITY
  • NOT A MIGRATION - IMPLIES WANDERING OR PERIODIC
    RETURN TO DISPARITY
  • TRANSITION THROUGH FORMAL TRANSITION STATES
  • DISCOVERY PROCESS DUE TO UNCERTAINTY
  • REQUIRES THOUGHT, ANALYSIS, INTUITION,
    PERCEPTION, AND SOME LUCK
  • DATA RESOURCE TRANSITION IS A FORMAL PROCESS!

20
RESOLVING DATA DISPARITY
  • DATA RESOURCE TRANSITION STATES
  • DISPARATE DATA RESOURCE - CURRENT STATE
  • DATA NOT WELL UNDERSTOOD
  • FORMAL DATA RESOURCE - NECESSARY STATE
  • NECESSARY TO THOROUGHLY UNDERSTAND DISPARATE DATA
  • VIRTUAL DATA RESOURCE - DESIRED STATE
  • REAL-TIME PRODUCTION TRANSFORMATION OF DISPARATE
    DATA

ENTERPRISE DATA RESOURCE
COMMON DATA ARCHITECTURE
FORMAL DATA RESOURCE
VIRTUAL DATA RESOURCE
COMPARATE DATA RESOURCE
DISPARATE DATA RESOURCE
PRODUCTION TRANSFORMATION
PERMANENT TRANSFORMATION
UNDERSTAND
21
RESOLVING DATA DISPARITY
  • FORMAL DATA RESOURCE
  • DATA INVENTORY
  • INVENTORY THE EXISTING DISPARATE DATA
  • RAISES AWARENESS OF THE DATA THAT EXIST
  • SOLVES THE FIRST PROBLEM WITH DISPARATE DATA
  • DATA CROSS REFERENCE
  • CROSS DISPARATE DATA TO COMMON DATA ARCHITECTURE
  • PROVIDES AN UNDERSTANDING OF DISPARATE DATA IN A
    COMMON CONTEXT
  • SOLVES THE SECOND PROBLEM WITH DISPARATE DATA
  • DATA VARIABILITY
  • IDENTIFY DATA VARIABILITY
  • DESIGNATE PREFERRED DATA VARIATIONS
  • DEVELOP DATA TRANSLATION SCHEMES
  • PREPARATION TO SOLVE THE THIRD PROBLEM WITH
    DISPARATE DATA
  • DATA REDUNDANCY
  • IDENTIFY DATA REDUNDANCY

22
RESOLVING DATA DISPARITY
  • VIRTUAL / COMPARATE DATA RESOURCE
  • EXTRACT - EXTRACTING DISPARATE DATA FOR DATA
    TRANSFORMATION
  • TARGET DATA IDENTIFICATION
  • SOURCE DATA IDENTIFICATION
  • DATA EXTRACTION - TO DATA DEPOT
  • TRANSFORM - TRANSFORMING DISPARATE DATA TO
    COMPARATE DATA
  • DATA RECONSTRUCTION - REBUILDING HISTORICAL /
    ARCHIVE DATA
  • DATA TRANSLATION - DEVELOPING PREFERRED DATA
  • DATA RECASTING - ALTERING FOR HISTORICAL
    CONTINUITY
  • DATA RESTRUCTURING - ALTER STRUCTURE TO COMPARATE
    DATA RESOURCE
  • DATA DERIVATION - DERIVE NECESSARY DATA
  • LOAD - EDIT AND LOAD COMPARATE DATA
  • DATA INTEGRITY RULES - APPLY DATA INTEGRITY RULES
  • DATA LOADING - LOAD TO TARGET DATABASE
  • DATA REVIEW - REVIEW TO ENSURE CORRECTNESS
  • ACTUALLY CHANGES THE DATA!

23
RESOLVING DATA DISPARITY
  • DATA TRANSFORMATION PROCESS

24
NEW CONCEPTS
  • FIVE-TIER FIVE-SCHEMA CONCEPT

Strategic Logical
Tactical Logical
Operational Business
Operational Data View
Operational Logical
Operational Deployment
Operational Physical
Analytical Business
Analytical Data View
Analytical Logical
Analytical Deployment
Analytical Physical
Predictive Business
Predictive Data View
Predictive Logical
Predictive Deployment
Predictive Physical
25
NEW CONCEPTS
  • SPECIFIC DATA MODEL CONCEPT

26
CONCLUSION
  • ULTIMATE DATA RESOURCE QUALITY IS STABILITY
    ACROSS
  • CHANGING BUSINESS STRATEGIES INFORMATION NEEDS
  • CHANGING TECHNOLOGY
  • STABILITY ACHIEVED BY
  • ESTABLISHING A COMMON DATA ARCHITECTURE
  • HALTING THE CONTINUED DATA DISPARITY
  • RESOLVING EXISTING DATA DISPARITY
  • NO SILVER BULLETS
  • ATTEMPT TO ACHIEVE SOME GAIN WITHOUT ANY PAIN
  • USUALLY ENDURE CONSIDERABLE PAIN WITH MINIMAL
    GAIN
  • DONT DELAY
  • THE SITUATION WILL ONLY GET WORSE
  • TAKE THE INITIATIVE!
  • ESTABLISH A SET OF DATA STEWARDS

27
AUTHORS BIBLIOGRAPHY
  • DEVELOPING DATA STRUCTURED INFORMATION SYSTEMS
  • KEN ORR ASSOCIATES, 1984
  • DEVELOPING DATA STRUCTURED DATABASES
  • PRENTICE HALL, 1987
  • PRACTICAL DATA DESIGN
  • PRENTICE HALL, 1990
  • DATA SHARING USING A COMMON DATA ARCHITECTURE
  • JOHN WILEY SONS, 1994
  • THE DATA WAREHOUSE CHALLENGE TAMING DATA CHAOS
  • JOHN WILEY SONS, 1996
  • DATA RESOURCE QUALITY TURNING BAD HABITS INTO
    GOOD PRACTICES
  • ADDISON WESLEY, 2000
  • DATA RESOURCE QUALITY RESOLVING DATA DISPARITY
Write a Comment
User Comments (0)
About PowerShow.com