Title: A logically interrelated collection of shared data and a description of this data, physically distri
1Distributed Database
- A logically interrelated collection of shared
data (and a description of this data), physically
distributed over a computer network. - A single logical database is split into a number
of fragments, each stored on one or more
computers connected by a communication network. - DDBMS the software that manages the database
and makes the distribution transparent to users.
2Distributed Processing
- A centralized database that can be accessed over
a computer network.
3The Client/Server Architecture
4A Multi-Tier Architecture Environment Example
5Parallel DBMS
- A DBMS running across multiple processors and
disks that has been designed to execute
operations in parallel, whenever possible, in
order to improve performance.
6Homogeneous Distributed Database
7An Oracle Distributed Database System
8Advantages of DDBMS
- Many organizations are naturally distributed over
several locations. - Data can be placed at the site close to the users
who normally use that data. - A failure at one site or communication link does
not make the entire system inoperable. - It costs much less to create a system of smaller
computers with the equivalent power of a single
large computer. - New sites can be added to the network without
affecting the operations of other sites.
9Disadvantages of DDBMS
- Complexity
- Cost
- Security
- Integrity control
- Lack of standards
- Lack of experience
- Database design
10One Reference Architecture for DDBMS
- Global external schemas external views
- Global conceptual schema as a centralized DB
- Fragmentation schema how the data is to be
logically partitioned - Allocation schema where the data is to be
located - Schemas for local DBMS
11Component Architecture for a DDBMS
- Local DBMS component
- Data Communication component
- Global System Catalog
- Distributed DBMS component
12Functions of a DDBMS
- Allow the transfer of queries and data among the
sites using a network. - System catalog should store data distribution
details. - Distributed query processing
- Extended concurrency control
- Extended recovery services
13Objectives of Fragmentation and Allocation
- Locality of reference
- Improved reliability and availability
- Acceptable performance
- Balanced storage capacities and costs
- Minimal communication costs
14Data Allocation
- Centralized
- Partitioned (or fragmented)
- Complete replication
- Selective replication
15Reasons for Fragmentation
- In general, applications work with views rather
than entire relations. - Data not required by local application needs not
be stored efficiency and security. - Increase degree of concurrency or parallelism in
the system.
16Disadvantage of Fragmentation
- Lower performance of applications that require
data from several fragments located at different
sites. - Difficult integrity control if data and
functional dependencies are fragmented and
located at different sites.
17Correctness of Fragmentation
- Completeness each data item that can be found
in the original relation must appear in at least
one fragment. - Reconstruction there must exist a relational
operation that will reconstruct the original
relation from the fragments. - Disjointness if a data item appears in a
fragment, it should not appear in any other
fragment except the primary key in the case of
vertical fragmentation.
18Database Link
19Network Directories and Global Database Names
20Replication
- The process of copying and maintaining database
objects, such as tables, in multiple databases
that make up a distributed database system.
21Reasons for Replication
- Availability it provides application with
alternative data access options. If one site
becomes unavailable, users can continue to query
or even update the remaining locations. - Performance users can access data from the
replication site that has the lowest access cost.
- Disconnected Computing snapshots enable users
to work on a subset of a database while
disconnected from the central database server. - Network Load Reduction applications can access
various regional servers instead of accessing one
central server. - Mass Deployment deployment templates enable you
to create multiple snapshot environments quickly.
22Multimaster Replication
23Read-Only Snapshot Replication
24Updateable Snapshot Replication
25Hybrid Replication