Geen diatitel - PowerPoint PPT Presentation

1 / 25
About This Presentation
Title:

Geen diatitel

Description:

... organisational levels & geography. a scientific discipline ... Technology for CRISs. Essential Building blocks. Metadata. Dictionaries, Thesauri & Ontologies ... – PowerPoint PPT presentation

Number of Views:11
Avg rating:3.0/5.0
Slides: 26
Provided by: Vang5
Category:

less

Transcript and Presenter's Notes

Title: Geen diatitel


1
CURRENT RESEARCH INFORMATION SYSTEMS
TECHNOLOGIES
2
Introduction
  • Who am I?
  • Geert Van Grootel
  • Senior researcher Science division, Ministry of
    the Flemish community.
  • IWETO Flemish CRIS
  • CERIF taskgroup member
  • euroCRIS treasurer.

3
Structure of presentation
  • Introduction Terminology
  • Past present technologies
  • Examples of implementations
  • CERIF Technology

4
Context Integration of CRISs
  • Different organisational levels geography
  • a scientific discipline
  • intra institutional
  • between institutions
  • Different levels of government
  • regional, national, international, global.
  • Different levels of system integration
  • integrated system (ERP)
  • intra process data capture collection
  • extra process

5
CRIS
  • Current research Information System
  • Technologies behind CRISs
  • Document stores
  • Relational Database Managment Systems (RDBMS)
  • Object Oriented Database Managment Systems
    (OODBMS)
  • Information Retrieval systems (IR)

6
Document stores
  • Document systems
  • Based on Markup Languages (SGML, XML)
  • in extistance since the 80s
  • Rise in popularity with XML behind it as semi
    structured database.
  • Querying is usually poor
  • query language is procedural and navigational as
    opposed to declarative predicates
  • Difficult to maintain
  • updating is slow when changes effect several
    entity instances but fast when only with one
    document.
  • Variable report capabilities group, sum,
    average,...

7
Information Retrieval Systems
  • Advantages for databases with many textual
    attributes
  • via Full inverted index
  • very fast retrieval
  • very slow update
  • little or no structural capability ( relations
    between entities)
  • little or no reporting capability
  • group, sum, average,...

8
OODBMS
  • Crucial to OODBMS is the concept of objects
  • Data (structure view)
  • Methods (process view)
  • Messages (event view)
  • Any process has to be codes specifically for any
    object
  • solutions is inheritence to help reduce coding
    efforts
  • Disadvantages
  • performance, worse than RDBMS
  • poorer data representational capabilities

9
RDBMS
  • Pros
  • Mathematically formal
  • easy to understand
  • standard query language (SQL)
  • mature technology
  • Cons
  • hard to represent complex objects
  • High performance needs expert knowledge

Flexible linking relations between business
objects
10
Technology for CRISs
  • Essential Building blocks
  • Metadata
  • Dictionaries, Thesauri Ontologies
  • Keys Binary Relations

11
Data Metadata
  • Incredible amount of data but much of this data
    is unaccesible
  • What we need
  • Find relevant data as information
  • Understand it syntax, semantics
  • Understand any restrictions on its use
  • The key to this is METADATA

12
Metadata
  • Importance
  • Integrity control
  • Access control
  • Support of data
  • Classification, valid terms
  • Interoperability
  • Data exchange
  • Data access
  • Benefits
  • Data quality
  • Access
  • Understanding answers
  • Improving queries
  • Interoperability
  • other CRISs
  • other Systems
  • MIS, RMS
  • Bibliographic systems
  • Scientific data

13
Three Kinds of Metadata
view to users
SCHEMA
NAVIGATIONAL
ASSOCIATIVE
constrain it
how to get it
data (document)
14
SCHEMA METADATA
view to users
SCHEMA
NAVIGATIONAL
ASSOCIATIVE
constrain it
how to get it
data (document)
15
Metadata Kinds Schema
  • intensional description of extensional instances
  • database
  • name
  • size
  • security authorisations
  • attributes
  • name
  • type
  • constraints
  • formal logic relationship to data instances

SCHEMA
constrain it
16
ASSOCIATIVE METADATA
view to users
SCHEMA
NAVIGATIONAL
ASSOCIATIVE
constrain it
how to get it
data (document)
17
Associative Metadata
view to users
  • information for application assistance
  • catalog record (e.g. Dublin Core) - descriptive
  • content rating (e.g. PICS) -
    restrictive
  • security, privacy (cryptography, digital
    signatures) - restrictive
  • information from dictionaries, thesauri,
    hyperglossaries, domain ontologies
    - supportive
  • no formal logic relationship to data instances

ASSOCIATIVE
18
NAVIGATIONAL METADATA
view to users
SCHEMA
NAVIGATIONAL
ASSOCIATIVE
constrain it
how to get it
data (document)
19
NAVIGATIONAL METADATA
  • How to get to information resource direct
  • filename
  • DB name navigational algorithm
  • DB name predicate (query)
  • URL
  • URL predicate (query)
  • or any of the above via
  • web indexing system (eg AltaVista, ExCite)
  • local indexing system bookmarks or proxy server)

NAVIGATIONAL
how to get it
20
Metadata
Collecting observed facts
DATA
Structuring in Context
INFORMATION
Inducing commonly accepted belief
KNOWLEDGE
INSIGHT
21
Technology for CRISs
  • Essential Building blocks
  • Metadata
  • Dictionaries, Thesauri Ontologies
  • Keys Binary Relations

22
ONTOLOGY
  • What is an Ontology
  • A specification of a conceptualization.
  • A formal description of the concepts and
    relationships that can exist for an agent or a
    community of agents
  • The knowledge of a domain defined in a formal
    declarative language
  • The collection of semantic definitions for a
    domain.
  • In practice a resource of terms, their
    definitions and their logical inter-relationships.

23
DOMAIN ONTOLOGY
  • Domain Ontology
  • An ontology covering a specific subject area of
    interest (a domain).
  • The set of objects that can represented can be
    called the universe of discourse.
  • E.g. For a project to exist it must have a
    startdate, a subject, a goal, a promotor and a
    budget
  • Project lt- startdate AND subject AND goal AND
    promotor AND budget gt 0

24
DOMAIN ONTOLOGY
  • Domain Ontologies in IT
  • A representation in first order logic allowing
  • Facts to be expressed
  • Relationships to be expressed
  • Constraints to be expressed
  • New facts and relationships to be deduced or
    induced

25
And so.
  • Metadata is the key to
  • GRIDs
  • SEMANTIC WEB
Write a Comment
User Comments (0)
About PowerShow.com