Title: Benefits in General
1The Benefits of Implementing a Corporate Metadata
Repository at the Census Bureau
Presenter Jerome M. Garrett Systems
Support Division
April 29, 2003
2Agenda
- What is Metadata?
- What is a Metadata Repository?
- What is a Corporate Metadata Repository (CMR)?
- How Can a CMR Benefit the Census Bureau?
- Developments Thus Far
- Next in Store
- Questions
3What is Metadata?
Examples include
- Names and Definitions
- Valid values and codes
- Sample Frames
- Memoranda
- Procedures
- Edits
- Imputation Formulas
- Press Releases
- Questionnaires
- Dataset Definitions
- Specifications
- and much more
Metadata all of the information that makes our
data understandable or can be re-used in our
survey processes.
4What is a Metadata Repository?
- Security
- User accounts
- User groups
- Workflow
- Roles
- Phases
- Web-enabled
- No client s/w
- Input/Output Mechanisms
- standard formats
- custom
- requirements
Metadata Repository an electronic catalog for
controlled, secure access to metadata which
encourages sharing, reuse and exchange.
5What is a Corporate Metadata Repository?
Includes
Corporate Metadata Repository a metadata
repository implementation targeted to specific
business needs, organized with an enterprise
taxonomy and designed to easily accommodate
existing business practices.
- Survey/Census Components
- BOC Survey Lifecycle Phases
- BOC-wide Document Categorization scheme
- BOC Intranet access
- A design which accommodates future requirements
- Functionality that can support Software
Engineering principles and Project Management
practices.
6What is a Corporate Metadata Repository? - contd
In a nut shell, a well designed Corporate
Metadata Repository improves Communication,
Coordination, Efficiency, Quality
Productivity.
7Corporate Metadata Repository Architecture
8How Can a CMR Benefit the Census Bureau?
- Because a CMR can be used by all areas of the
Bureau, it is sometimes hard to comprehend how
it may help each individual.
9Lets ask Joan Sample
- Joan Sample works in the statistical sampling
area. - She is called upon to develop and document the
sampling methodology for new survey programs. - Joan also writes the requirements used by the
programming area to code the sampling programs.
10The benefits for Joan
- Joan can enter the information on the universe,
sampling frames and strata into a SAMPLE REGISTRY - Joan can also route the sample information
through the approval chain for sign off. - Joan can upload any accompanying documents
(memos, spreadsheets, etc.) onto the project
PORTAL site.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
11Lets hear from Mary Forms
- Mary works in the questionnaire content
development area. - She is called upon to develop the content of new
survey questionnaires and to work with the
questionnaire layout area to develop the final
form. - Mary also works with the processing area to
determine how data captured fields are
represented.
12The benefits for Mary
- Mary can enter all of the information into the
QUESTIONNAIRE REGISTRY (including design spec
information, instruction text, response labels,
and questions numbers) - Mary can also route questionnaire information to
the layout area and receive automatic
notification when the layout is done. - Mary can view a .PDF of the complete form.
- Mary can assign and define response cells for
each field of the questionnaire with the RESPONSE
CELL tool.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
13A few words from Lisa Layout
- Lisa works in the questionnaire layout area. She
works a lot with Mary Forms - She is called upon to develop the layouts of
survey questionnaires for later printing by the
print shop or by a print contractor. - Lisa works with a COTS publication software tool
to develop the professional looking BOC forms.
14The benefits to Lisa
- Lisa can import the questionnaire content
directly into the COTS publication tool. (The
design specification is also imported) - Since the content is imported directly, Lisa no
longer has to cut and paste or transcribe
content. This eliminates the possibility of
transcription errors. - When Lisa is done, a notification is sent to Mary
Forms. Also a .PDF of the new form is output to
the CMR. - Lisa is notified by the CMR when additional
changes are desired.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
15How about Lou HQ?
- Lous area makes sure the survey meets the
sponsors needs. He defines the objectives and
terminology. He also develops the clerical
procedures and computer programming requirements. - Additionally, Lous area develops program
schedules and resource requirements. - Lou also coordinates the activities of all
involved areas.
16Benefits to Lou HQ
- Lou can create and maintain all Data Items with
the DATA ELEMENT REGISTRY. (This includes
formats, valid values and size information.) - Lou can also define all files produced by the
system, including record layouts, with the DATA
SET REGISTRY. (This includes information on the
recipient of the data files and the location of
the files.) - Lou can enter and maintain all rules for editing
data elements into the BUSINESS RULE REGISTRY. - Lou can store all external documents (IE.,
requirements, procedures, schedules, etc.) in the
PORTAL site
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
17Dont forget Tina FieldRep
- Tina FieldRep has worked in the Field Division
for years - Tina is responsible for producing all Field
materials including, recruiting pamphlets,
training materials, clerical procedures, job
aids, checklists, planning documents, etc. - Whenever field enumeration is involved, Tina is
there.
18Benefits to Tina FieldRep
- All documents that Tina has to either produce or
reference can be accessed from the PORTAL site. - Tina can go to the DATA ELEMENT REGISTRY to print
out a Code Book of all valid values of the data
elements for a particular survey. - Tina can also go to the SAMPLE REGISTRY for
information on the anticipated workload in a
given Sampling Unit.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
19A word from Peter Capture
- Peter Capture has worked on data capture systems
for years. - Peter Capture develops the barcodes for scanning.
- Peter also is responsible for check-in and
check-out procedures in the processing offices.
20The benefits to Peter Capture
- Peter can enter/maintain pertinent capture
information about each response cell into the CMR
using the RESPONSE CELL TOOL. - Peter can also output a specification for the
Key from Image system of data capture
information using the RESPONSE CELL TOOL. - All documents that Peter has to produce or access
can be located on the PORTAL site.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
21What about Larry Processing?
- Larry Processing handles all data that has been
captured. - Larry writes all software for Editing, Imputing,
and other ways to improve data quality. - Larry is also responsible for creating output
files for analysis and publication.
22Benefits to Larry Processing
- Larry can go to the BUSINESS RULE REGISTRY to
retrieve the rules for editing, imputing and
other ways of correcting the data. - Larry can go to the DATA SET REGISTRY for the
layouts of all output files. Larry will also
input the file locations after files are created.
- Larry will update the DATA ELEMENT REGISTRY with
new fields created by his processing. - Larry can go to the SAMPLE REGISTRY for
information needed to perform weighting.
All of the information is easily available to
select members or the entire team. When deemed
appropriate, the information can also be made
available to the entire BOC or reformatted to go
to users of our data.
23How about Pam Tables?
- Pam Tables creates the publication tables.
- Pam provides the publication tables to the
dissemination system (DADS/AFF) . - Pam is also charged with providing DADS/AFF with
all of the other metadata that can be made
available to the public.
24Benefits to Pam Tables
- Pam can use the MATRIX TABLE REGISTRY to create
Matrix tables or modify existing matrix tables. - Pam can electronically solicit approval and
automatically output the matrix tables to
DADS/AFF. - Finally, Pam can use the TIER 2 PRODUCT REGISTRY
to create Tier 2 product files for DADS/AFF, and
a QUALITY TOOL to verify the files before sending
to DADS/AFF.
The technology utilized can be easily modified to
output Matrix table and Tier 2 product metadata
to systems other than DADS/AFF.
25How about Tonya Block?
- Tonya Block has provided geographic support for
years. - In addition to developing the files for sampling,
Tonya also produces the maps needed for Field
activities. - Tonya delivers map metadata to DADS/AFF.
26Benefits to Tonya Block
- Tonya can use the DATA ELEMENT REGISTRY to
maintain geographic definitions, concepts and
variables. - Tonya can use the DATA SET REGISTRY to create the
layouts of the various geographic reference files
and sample universe files - Tonya can also use the TIER 1 PRODUCT REGISTRY to
create the files for DADS/AFF and a QUALITY TOOL
to verify the files before sending to DADS/AFF.
The technology utilized can be easily modified to
output Tier 1 product metadata to systems other
than DADS/AFF.
27How about Bob and Jane Manager?
- Bob and Jane manage census and survey programs.
- Both report to executive level staff and prepare
responses to congressional inquiries. - Bob and Jane approve of changes when systems go
into production.
28Benefits to Bob and Jane
- Bob and Jane can use the
- SAMPLE REGISTRY, DATA ELEMENT
REGISTRY, DATA SET REGISTRY,
QUESTIONNAIRE REGISTRY, MATRIX TABLE REGISTRY,
RESPONSE CELL TOOL, BUSINESS RULE REGISTRY,
PRODUCT REGISTRIES - to stay abreast of project progress and grant
electronic approvals. - Bob and Jane can also use the PORTAL site to post
and access project documents.
All roles required to grant approvals can be
assigned to multiple people so that progress is
not hindered if a key team member is not
available. The designation of roles and
responsibilities is determined by project
managers.
29And finally, Joe and Susie Public?
- Joe and Susie Public rely on metadata to make our
data more understandable. - Good Metadata empowers users to compare data from
different sources.
30Benefits to Joe and Susie Public
- Thanks to a CMR, the data products received by
Joe and Susie Public are accompanied by accurate
and complete metadata with which they can draw
sound conclusions about the data. - Also, if Joe and Susie refer questions to BOC
personnel, the questions can be quickly answered,
even by BOC staff that did not work on the survey
or census, thanks to a well administered CMR.
31A CMR Can Support All Areas of Survey/Census
Program Development
Sampling
Program Management
Repository
Data Elements
Form Content
Geography Files Maps
Forms
Specs
Universe
Procedures
Pub. Tables
Files
Rules
Rqmts
Workflow
Public Users
Form Design
Security
Publication Tables
Intranet
Program Development
Data Collection
Data Processing
Data Capture
32Developments Thus Far
CMR Core Service Components
Applications
Portal Sites
- ACS Pilot
- ASM Pilot
- Econ 2002 Census Design
- Econ 2002 XML Interchange
- Tier 1 GEO Products cleansing/delivery to
DADS/AFF - Econ 2002 Data Capture System
- Tier 2 Table Products cleansing/delivery to
DADS/AFF - TMO SCIF
- Decennial 2003 IBEAM Pilot
- Current Population Survey Portal
http//cmr.ssd.census.gov1919 - Project Management Repository http//cmr.ssd.censu
s.gov8888/pls/portal30/url/page/pmr - Quality Management Repository http//cmr.ssd.censu
s.gov8888/pls/portal30/url/page/qmr - Corporate Metadata Repository http//cmr.ssd.censu
s.gov - Human Resources Division http//cmr.ssd.census.go
v7979/pls/portal30/url/page/hr_portal - Field Directorate http//cmr.ssd.census.gov2929/p
ls/portalfld - Policy Office http//cmr.ssd.census.gov7979/pls/p
ortal30/url/page/pol_portal
- Data Element Registry
- Data Set Registry
- Questionnaire Registry
- Question Registry
- XML Interchange
Additional information about all software
developed so far can be provided at a later time.
33Quick Look - CMR Home Page
34Quick Look Data Element Registry
35Quick Look Dataset Registry
36Quick Look Econ 2002 Census Design
37Quick Look Tier 1 Product Registry
38Quick Look Tier 2 Matrix Table Registry
39Quick Look TMO SCIF Application
40Next in Store
CMR Core Service Components
Applications
Portal Sites
- Economic Survey Design for Select Current Surveys
- Decennial 2004 Test IBEAM Pilot
- Decennial 2006 Test IBEAM
- TMO SCIF Enhancements
- ..
- Numerous sites slated for development
- Business Rule Registry
- Response Cell Registry
- Sample Registry
- .
Future development can be influenced by yet
unknown program area requirements.
- All registries can be enhanced to meet
specific requirements.
- Applications can be developed to meet
applications
41For further Information
Portal
Goals Security Re Use Control Ease of Use
Portal
Jerome M. Garrett Metadata Staff Systems Support
Division 763 - 6624
Jerome M. Garrett Metadata Staff Systems Support
Division 763 - 6624
Corporate Metadata Repository
Focus Security Re Use Control Ease of Use
CMR
Jerome M. Garrett Metadata Staff Systems Support
Division 763 - 6624
Portal
CMR
or
Give us a call
Visit our Exhibits
We welcome your input on specific functions you
require which may be delivered by the Corporate
Metadata Repository.
42Questions?