Title: Open access to biodiversity data: the speciesLink experience
1Open access to biodiversity data the
speciesLink experience
- Dora Ann Lange Canhos
- dora_at_cria.org.br
2How to promote data sharing?
- Cultural barriers (internet, informatics, )
- Technical limitations (interoperability,
archiving, ) - Legal impediments (biosafety, access to
biodiversity, ) - It doesnt only depend on the will to share
data.... - Must be organized data models, standards,
protocols... - Must be feasible (doable)
- Must be planned adequate resources, expertise,
infrastructure
3Working with data providers
- routine should not be disrupted
- they must have full control over their data
- participation must be easy
- they must be acknowledged
- they must also benefit from sharing their data
4speciesLink data providers biological collections
Linux MySQL
Win98 Biota
Win98 Access
COL 3
FreeeBSD PostgreSQL
Win2000 Brahms
COL 4
COL 2
COL 5
COL 1
?
?
?
?
?
program
5Challenges
- Integrate data from
- different taxonomic groups
- distributed in different collections
- Regardless of
- where the collections are located
- what software the collections uses
- the Internet connectivity available
- the expertise available
- Without changing the routine
- Maintaining full control of data by the
collection - Not expensive (open source and free software)
- Integrated with other networks (local and
international)
6DiGIRPortal (Java)
speciesLink site Presentation Layer
Systems Architecture
Perl
Fast and stable connectivity
Slow or unstable connectivity
7Collections database
Database available on-line
DarwinCore data model
Mapping data fields
Filter for sensitive data
8DiGIRPortal (Java)
speciesLink site Presentation Layer
System Architecture
Perl
Fast and stable connectivity
Slow or unstable connectivity
Collection C
Collection B
Data
SOAP client
Data
SOAP client
SQL
SQL
CollectionManagementSystem
CollectionManagementSystem
Data Repository
Data Repository
9Data Migration Client
- Platform independent (java)
- Connects to any database accessible via JDBC
- (simple text files are also supported)
- Complete control over data
- Low traffic
- Possibility to filter sensitive data using a
regular expression
10(No Transcript)
11(No Transcript)
12(No Transcript)
13Other Applications
- data cleaning
- collection profiles
- indicators
- ecological niche modeling
14Data cleaning
- species name
- georeferencing
15(No Transcript)
16(No Transcript)
17(No Transcript)
18(No Transcript)
19(No Transcript)
20Collection profile ...
21(No Transcript)
22(No Transcript)
23(No Transcript)
24(No Transcript)
25Indicators
26(No Transcript)
2718,727 (2.63)
592,185 (83,28)
28(No Transcript)
29(No Transcript)
30Ecological niche modeling
- See demonstration of openModeller with Tim
Sutton, Renato De Giovanni,
31The speciesLink network
research
education
nomenclature taxonomy
descriptivedata
Decision making
primary data
modeling
Data quality
maps
Biological collection
32(No Transcript)
33Building data infrastructure
is necessarily a collaborative effort
34- http//splink.cria.org.br
- dora_at_cria.org.br
Obrigada