Title: Long Term Ecological Research Network Office Strategic Planning Follow up
1Ecological Informatics Building Solutions for
Multi-Decadal Research
SLA 2008 Conference Seattle, WA 17 June 2008
2Roadmap
- Why multi-decadal research?
- A brief history of LTER
- Data/information challenges
- Ecological informatics
- Current state-of-the-art
- Future
3Roadmap
- Why multi-decadal research?
- A brief history of LTER
- Data/information challenges
- Ecological informatics
- Current state-of-the-art
- Future
4Long-Term Research is Required to Reveal
- Slow processes or transients
- Episodic or infrequent events
- Trends
- Multi-factor responses
- Processes with major time lags
5(No Transcript)
6(No Transcript)
7Roadmap
- Why multi-decadal research?
- A brief history of LTER
- Data/information challenges
- Ecological informatics
- Current state-of-the-art
- Future
8Data Dispersion
- Data are massively dispersed
- Ecological field stations and research centers
(100s) - Natural history museums and biocollection
facilities (100s) - Agency data collections (100s to 1000s)
- Individual scientists (1000s to 10,000s)
9Data Entropy
Time of publication
Specific details
General details
Retirement or career change
Information Content
Accident
Death
Time
(Michener et al. 1997)
10Data IntegrationJones et al. 2007
- Data are heterogeneous
- Syntax
- (format)
- Schema
- (model)
- Semantics
- (meaning)
11Information and Storage
Petabytes Worldwide
Information
Available Storage
Source John Gantz, IDC Corporation The
Expanding Digital Universe
12Roadmap
- Why multi-decadal research?
- A brief history of LTER
- Data/information challenges
- Ecological informatics
- Current state-of-the-art
- Future
13Ecological Informatics
A discipline that incorporates both concepts
and practical tools for the understanding,
generation, processing, and propagation of
ecological data, information and knowledge.
14Data Archives
15Existing Tools Provide Needed Functionality
16Metacat Data Distribution
17VegBank an example warehouse
18Roadmap
- Why multi-decadal research?
- A brief history of LTER
- Data/information challenges
- Ecological informatics
- Current state-of-the-art
- Future
- Science
- Technology
- Sociocultural dimension
19Global ChangeSmith, Knapp, Collins. In press.
20Critical Areas in the Earth System
21The Coupled Science/CI Vision
- Facilitate the long-term access and use of
preserved biological, socio-ecological, and earth
observation data - Data are diverse and complex (multi-scale,
multi-discipline, and multi-national) - Agile, evolutionary CI development building upon
a progressively refined, robust core framework - Build sustainable communities of practice and CI
enterprises
22Knowledge Pyramid
Adapted from CENR-OSTP
23Technology Directions
- Cyberinfrastructure that enables the science
- Whole-life-cycle data management
- Domain-agnostic solutions
24Focus on CI that Enables the Science(end-to-end
solutions)
- Discovery, access, and use
- Free, open access to holdings (and tools)
25Support the Data Lifecycle
- Reliable, replicated storage infrastructure
- Interoperability across data centers
26Examples of Data Holdings
Metadata Interoperability Across Data Holdings
27Data Interoperability Ontologies and Semantic
Mediation
- Individual Researchers Notebook/Field
Observations - Micro/Meso-scale projects
- Multiple researchers
- Spreadsheets/databases
- Macro-scale projects (LTER, OOI, PISCO)
- Sensor clusters
- Laboratory analyses
- National-scale programs (e.g., NEON)?
- Sensor networks and flux towers
- Earth observation imagery
28Domain-Agnostic Solutions
Domain Agnostic practice or tool that crosses
domains
29Kilo Nalu Workflow
30Kepler Use Cases Represent Many Science Domains
- Ecology
- SEEK Ecological Niche Modeling
- REAPenvironmental sensor networks
- NEON Ecological sensor networks
- Molecular biology
- SDM Gene promoter identification
- ChIP-chip genome research
- CAMERA metagenomics
- Oceanography
- REAP SST data processing
- LOOKING ocean observing CI
- ROADNet real-time modeling
- Ocean Life project
- Physics
- CPES Plasma fusion simulation
- FermiLab particle physics
- Chemistry
- Resurgence Computational chemistry
- DART (X-Ray crystallography)
- Library science
- DIGARCH Digital preservation
- Cheshire digital library archival
- Conservation biology
- SanParks Thresholds of Potential Concerns
- Geosciences
- GEON LiDAR data processing
- GEON Geological data integration
31Workflows Aid Comprehension
- Publish to workflow repository with accession
number - Documents the linkage between publication,
analysis, and data
32Workflow Sharing Portal
33Sociocultural Directions
- Education and training for CI literacy
- Engaging the broad community through citizen
science - Building sustainable communities of practice and
CI enterprises
Mound built by cathedral termites
34- Experiential, Career-long Education and Training
35Citizen Science
36A Toolkit for Citizen Science
- How-to manual for new practitioners
- Resources for designing, executing, evaluating
projects - Solves data management issues
- Common path to share and archive data, regardless
of domain
www.CitizenScience.org
37CI Enterprises to Become Increasingly Global
38a wide range of partnering organizations
- Digital libraries
- Academic institutions
- Research networks
- NSF- and government-funded synthesis
supercomputer centers/networks - Governmental organizations
- International organizations
- Data and metadata archives
- Professional societies
- NGOs
- Commercial sector
39Longevity of CI Enterprises
- Broad, active community engagement
- Involvement of library and science educators
engaging new generations of students in best
practices - Existing outreach and education programs
- Transparent, participatory governance
- Adoption/creation of sustainable business models
- Strong organizational sustainability
40Thanks!
- Suzie Allard University of Tennessee
- Matt Jones University of California Santa
Barbara - Mike Frame National Biological Information
Infrastructure - Bob Cook Oak Ridge National Laboratory DAAC
- DataNetONE Partners Kepler-CORE Team