Title: Using Geodatabase Replication To Manage Distributed Data
1Using Geodatabase Replication To Manage
Distributed Data
- David BestRedwood National Park
- Craig Dalby
- Pacific West Regional Office
2008 GIS Data Management Conference
2Contents
- The Problem
- Solutions and Consequences
- Geodatabase Replication
- Pilot Project
- Installing and Configuring SDE Services
- Managing Replication
- Results and Recommendations
- Next Steps
2008 GIS Data Management Conference
3The Problem
- A great challenge is how parks will obtain and
edit national GIS datasets. - Options include moving data (and applications) to
national servers, or - Keeping data local, and pushing changes to
national datasets
2008 GIS Data Management Conference
4Centralized Data
- Field (remote) users give up local applications
and local data in favor of Remote Desktop/Citrix,
or - Repeatedly download and upload data between local
servers and national servers - Few organizations have the network bandwidth to
accommodate local applications directly editing
remote data
2008 GIS Data Management Conference
5Typical NPS Environment
- For most parks there is a dramatic performance
improvement when working with local data. - Local data makes it easier to customize user
interfaces, more efficiently access other data
stored on park servers, and integrate data of
park-only interest.
2008 GIS Data Management Conference
6Geodatabase Replication
- It is inconvenient to move data back and forth
from the edit (local server) and integrated
(national server) environments. - ArcGIS 9.2 introduced new technology for this
problem Distributed Geodatabase. - Allows versioned SDE geodatabases to be deployed
at multiple locations. - Changes are synchronized with replica versions
using either one-way or two-way replication.
2008 GIS Data Management Conference
7Benefits
- A park can house and maintain data for a specific
geographic area, gaining the performance of
local data. - Seamlessly, these data contribute to national
datasets. - Changes made at either the parent (EGIS) or park
(child) can be replicated to the other. - This allows parks and programs to retain
stewardship responsibilities, while ensuring an
authoritative national versions are maintained.
2008 GIS Data Management Conference
82008 GIS Data Management Conference
9Need to Evaluate
- Geodatabase Replication is new want to test
stability and reliability - NPS computing environment is diverse want to
test suitability in small parks with no IT
support and large parks with dedicated GIS
specialists and servers
2008 GIS Data Management Conference
102008 GIS Data Management Conference
11Replication Options
- Conserve bandwidth by
- Re-use schema
- Where target already has data and only needs
updating - Schema only
- Mobile users who input only new data
- Register Existing Data
- Bypass data copy step and use existing data
- Replicate related data
- Include or not include relationship classes
2008 GIS Data Management Conference
12About Synchronization
- Synchronization
- Synchronization is when one replica sends data
changes and the relative replica receives
changes. - Changes include inserts, updates and deletes made
to the replica version. - Must be connected as the same database user that
was used to create the replica - Filters and Relationship Classes
- Filters and classes that were applied during
creation of the replica are also applied during
synchronization - Data transfer based on exchanging replication
messages - Detect and automatically recover from errors by
unreliable messaging.
2008 GIS Data Management Conference
13PWRO Project Funds
- Participating Parks
- Redwood Lassen Volcanic
- Lava Beds Oregon Caves
- Sequoia-Kings Pacific West RO
- Had to overcome reluctance. Difficult for a park
to see benefit. - Common question what am I going to have to do?
2008 GIS Data Management Conference
14Objectives
Can an out-of-the box standardized ArcSDE
deployment be seamlessly and consistently
deployed to park offices?
- Collect and enter building footprints
- Develop methods and procedures to install and
configure two-way replication - Implement two-way replication and train park
staff - Evaluate performance and make recommendations for
EGIS use.
2008 GIS Data Management Conference
15General Steps
- Install SQL Server
- Install SDE
- Perform SDE post-install
- Configure using Access Database
- Create Replica on child
- Schedule Synchronization
2008 GIS Data Management Conference
16Configure Two-way Replication
- MS Access database with embedded VB code is used
to automate - Record information about location, IP addresses,
server name, database locations - Create new Active Directory groups for
administration, editing, and viewing park-based
EGIS data - Create folders/shares for administrative files
- Create SDE connection files
- Enable SQL backup
- Implement SQL database roles and AD security
2008 GIS Data Management Conference
172008 GIS Data Management Conference
182008 GIS Data Management Conference
192008 GIS Data Management Conference
20Database Produces Consistent Share Name with AD
Security
OTHER PARKS
Lassen Volcanic
Redwood
PWRO
EGIS
Lava Beds
21(No Transcript)
22Create database EGIS (SDE created by SDE install)
- Create logins for
- Administrators
- Editors
- Viewers
23- Set permissions
- for SDE and EGIS databases
- Create Database Roles
- Editors
- Viewers
24Connects to parks version on the EGIS database as
the LABE SDE user on the EGIS server
EGIS EGIS Database Park Version as
park SDE use PARK EGIS Database Default
Version as park SDE user
Connects to EGIS database as the LABE SDE user on
the parks SDE server
25SDE Connection Files Can Be Read By ArcCatalog
2008 GIS Data Management Conference
26Right-click to viewConnectionProperties
27Connection Properties
28(No Transcript)
29(No Transcript)
30(No Transcript)
31Automated Backup of SQL Databases Are Consistent
Lassen Volcanic
Lava Beds
PWRO
OTHER PARKS
32(No Transcript)
33Python Script for Synchronization
2008 GIS Data Management Conference
34Create Replica on Child
- In most cases an EGIS responsibility
- Data is mainly pulled instead of pushed.
- No need to disclose EGIS security to parks.
- ArcMap provides spatial filter to limit replica
to parks geographic area (not available in
ArcCatalog) - Filters are great way to eliminate versions and
posting
2008 GIS Data Management Conference
35Activate Distributed Geodatabase Toolbar
36Load data that will participate in replica into
map
37Specify Type of Replica
38Specify Connection and Replica Name
REPLICA OWNER PARENT CHILD
39Specify Spatial Extent
Use to limit geographic area where
synchronization occurs. Useful for avoiding SDE
versions
Use for trusted editors and offices with
oversight responsibility
40Managing Replication
- Synchronization can be manual or automated.
- Synchronization begins for each park as a
separate task within Schedule Tasks. - The task is a batch (.bat) file that calls the
python script created earlier.
FROM EGIS TO
2008 GIS Data Management Conference
41Monitoring Replication
- Right click SDE database
- ? Distributed Geodatabase ?Manage Replicas
2008 GIS Data Management Conference
42Replication Manager
Replication Log
Viewing the Replication Log
43Results and Recommendations
- Complete install about 3 hrs/server
- Get Administrator access
- Other Flavors of SQL/SDE probably not worth coast
savings - SQL backup very easy and reliable
- SDE Service needs monitoring
- Two-way replication avoids permission management
with versions - Replica design should be updatable
- Infrastructure mostly adequate
2008 GIS Data Management Conference
44Results and Recommendations
- Have not trained park users, but do not expect
difficulties. - Unforeseen benefit may be rapid-deployment of
region or network-based integrated datasets. - Datastore should consider one-way data
subscription service for national datasets not
edited by parks. - Annotation is an easy and very useful national
test dataset.
2008 GIS Data Management Conference
45Next Steps
- Complete Klamath Network Parks.
- Encourage frequent park editing.
- Review by EGIS Team.
- Add new theme.
- Move parent to RIMD.
- Work on VPN solution.
- ELA for SQL Server.
- New participants?
2008 GIS Data Management Conference