DATA MANAGEMENT 101 - PowerPoint PPT Presentation

1 / 34
About This Presentation
Title:

DATA MANAGEMENT 101

Description:

Tools and resources for finding and sharing data ... Metadata and data are like Siamese twins: Split them up and the lives of both are at risk ... – PowerPoint PPT presentation

Number of Views:113
Avg rating:3.0/5.0
Slides: 35
Provided by: marlen71
Category:

less

Transcript and Presenter's Notes

Title: DATA MANAGEMENT 101


1
DATA MANAGEMENT 101
  • Marlene Doyle Dianne Watkins
  • EMAN Coordinating Office

2
Outline
  • Key data management concepts
  • Tools and resources for finding and sharing data
  • Tools and resources specific to EMAN recommended
    protocols

3
Key Concepts
  • Data management is important
  • Digitizing your data is important
  • Documenting your data is important
  • Sharing your data is important
  • Archiving your data is important

4
How important is it, really?
  • Excellent monitoring framework
  • Great study design
  • Excellent training
  • Top of the line equipment
  • Intensive study implemented
  • Field quality assurance considered

5
Early consideration of Data Management
  • Hockey Metaphor Early easy
  • Ask yourself key questions early
  • Who needs the data?
  • What kind of data management is needed to meet
    these uses?
  • Who else might be able to use your data in the
    future?

6
Digitize your data
  • Tedious but necessary
  • Accuracy is critical
  • Store your paper copies for reference
  • ..Enough said

7
Document your data
  • Create metadata records Data about data
  • Data descriptors that document the when, where,
    what, why, how and how good of sample
    collection and analysis

8
Document your data
  • Why is this important?
  • Allows your data to live on and be useful in the
    future
  • Your data can be discovered, understood and
    possibly used by others
  • Defend your results
  • Creates credibility, builds trust
  • Enable integration with or calibration against
    other data sets

9
Key Metadata Fields
  • Basic program meta-data
  • who, when, sample period,
  • Sampling Design
  • number and distribution of monitoring stations,
    maps, whats changed since last sampling event
  • Detailed Protocol metadata
  • Processing steps?
  • Whats changed?
  • Data quality
  • Training of participants, certification,
  • Clarify data codes, assumptions, standards used

10
Rules of engagement
  • Metadata and data are like Siamese twins Split
    them up and the lives of both are at risk

To have and to hold, for richer and for poorer,
in sickness and in health
11
Rules of engagement
  • Dont compare apples to oranges
  • Be careful not to integrate data from different
    sources of mixed methods and unknown quality

12
Rules of engagement
  • Dont compare apples
  • to appels
  • to apels.
  • Comply to accepted standards for metadata and
    data
  • Taxonomic keys
  • Pick lists

13
Sharing Finding data
  • Why would you share data
  • How would you find out who was monitoring
  • Climate change impacts on sea level?
  • Amphibians?
  • Permafrost depth?

14
Sharing data
  • Put your metadata somewhere where others can find
    it. Registries available
  • EMAN Site Directory (www.eman-rese.ca)
  • Geospatial GeoConnections Discovery Portal
    (www.geoconnections.ca)
  • Biological National Biological Information
    Infrastructure (NBII) Metadata Clearinghouse
    Gateway (www.nbii.gov)
  • Ecological Knowledge Network for Biocomplexity
    (KNB) (http//knb.ecoinformatics.org//index.jsp)

15
Archiving your data -
  • Good news! Sharing archiving in many ways
  • Do periodic back-ups of data
  • Provide complete set of data and documentation,
    reports and maps in hard copy and digital formats
    to your current organization
  • Post in applicable metadata registry
  • Avoid duplication of data sets!!

16
Invest now. Invest for the
future.
  • People power and will be required to
  • Establish your data management system
  • Get your data into the system
  • Maintain the system in the long term
  • Use your system

Remember me? Im your future data needs. Budget
for me, please!!
17
Refresher How to manage data well
  • Consider who will use your data and how at the
    design stage
  • Digitize your data and ensure its accuracy
  • Write metadata files so data sets can be
    understood
  • Implement standards Dont mix apples and oranges
    or apples and apels
  • Keep your metadata and data together
  • Share your metadata files on appropriate
    registries
  • Archive your data

18
But data management is boring and I dont know
anything about it. Im an ecologist not a
computer scientist. What do I do?
  • Good news! Theres some tools out there to help
    you
  • But, data management will still require some
    careful consideration on your end.

19
Metadata sharing tools EMAN Site
Directorywww.eman-rese.ca
  • On-line searchable database of partner sites and
    research
  • Form based metadata entry
  • Shares data with NBII and GeoConnections
  • Will be updated in 2005- 2006
  • On-line form
  • Sharing metadata tools
  • Doesnt host data

Partner metadata
EMAN metadata
Existing
Planned
NBII
GeoConnections
20
(No Transcript)
21
Metadata registry NBII Metadata Clearinghouse
http//mercury.ornl.gov/nbii/
  • Inventory of biological data sets
  • Relies on standardized metadata
  • Contributors can register metadata directly
  • Harvest data from nodes (e.g. EMAN Coordinating
    Offfice)

22
NBII Metadata Clearinghousehttp//mercury.ornl.go
v/nbii/
23
Metadata Registry GeoConnections Discovery
Portal www.geoconnections.ca
  • Accepts standardized geospatial data that uses
    web services
  • Less focus on biological aspect

24
GeoConnections
25
Tools for community groups to contribute and
share data
  • EMAN Data Management system http//www.on.ec.gc.c
    a/eman/index.cfm
  • OBBN http//obbn.eman-rese.ca
  • WIX http//www.ontariowatersheds.ca
  • Community Mapping Network http//www.shim.bc.ca/
  • Comap http//www.comap.ca/

26
EMAN Data Management Tools and Support
  • Standardized field sheets
  • Metadata forms and tools (2005-2006)
  • On-line databases
  • EMAN Data Management System
  • OBBN/CABIN database Benthos
  • NatureWatch database
  • Databases can talk to one another through web
    services

27
EMAN Data Management THE FUTURE
  • A comprehensive, integrated system for acquiring,
    managing, querying, processing, collating and
    archiving data generated by partners employing
    the EMAN suite of Ecosystem Monitoring Protocols

Government data
Community Data
Academia data
NGO data
http//www.on.ec.gc.ca/eman/index.cfm
28
EMAN Data Management System
  • On-line data entry
  • 3 protocols available currently
  • Terrestrial Vegetation Biodiversity
  • Backyard Call Count
  • Roadside Survey
  • Allows public to search, map, download data from
    EMAN partners
  • http//www.on.ec.gc.ca/eman/index.cfm
  • Live demo

29
Benthic Macroinvertebrate Database
  • Seamless integration of data between the Ontario
    Benthos Biomonitoring Network (OBBN) and the
    Canadian Aquatic Biomonitoring Network (CABIN)
  • System for capturing, managing, analyzing and
    sharing benthic macroinvertebrate data
  • Includes analytical tools for report card style
    output and integration with GIS mapping
  • http//obbn.eman-rese.ca
  • Data to be shared via use of
    common standards for interoperability

30
(No Transcript)
31
Watersheds InfoXchangewww.ontariowatersheds.ca
32
(No Transcript)
33
Viewing data using web services
  • Ontario Region Information System for the
    Environment http//www.on.ec.gc.ca/ORISE
  • Will be EMAN viewer
  • Community Mapping Network http//www.shim.bc.ca/
  • ResEau (planned) www.environmentandresources.ca

34
Phew! You made it to the end
  • For more information contact
  • Marlene Doyle
  • Marlene.doyle_at_ec.gc.ca
  • (905) 336-4418
Write a Comment
User Comments (0)
About PowerShow.com