Title: Access Changes Everything
1Access Changes Everything
- The Benefits of Open Access and Open Semantics
for Researchers
Leslie Carr Intelligence, Agents and Multimedia
Group University of Southampton
2(No Transcript)
3Salutary Warning
- A scholar is just a librarys way of making
another library - Daniel Dennett, Consciousness Explained
4Thanks to Tim Brody and Stevan Harnad
(Southampton University)
5Outline
- Open Access
- Visionary Foundations
- Rationale Research Impact
- Effect of Open Access on Research Impact
- Tools and Services
- Initiatives
- Semantic Web
- Introduction
- Resource Description
- Examples
- Concluding Thoughts
6Open Access
7H. G. Wells, World Brain The Idea of a Permanent
World EncyclopaediaEncyclopédie Française,
August, 1937
- encyclopaedias of the past sufficed for the needs
of a cultivated minority - universal education was unthought of
- gigantic increase in recorded knowledge
- more gigantic growth in the numbers of human
beings requiring accurate and easily accessible
information
8Permanent World Encyclopaedia
- Discontented with the role of universities and
libraries in the intellectual life of mankind - Universities multiply but do not enlarge their
scope - thought knowledge organization of the world
- No obstacle to the creation of an efficient index
to all human knowledge, ideas and achievements
9Vannevar Bush, As We May ThinkAtlantic Monthly,
July 1945
- Director of the Office of Scientific Research and
Development in USA, coordinating 6,000 American
scientists during WWII - Turns to making our bewildering store of
knowledge more accessible - For many years inventions have extended mans
physical powers rather than the powers of his
mind.
10Memex
- The Memex (never built) was to be a mechanised
device to allow a library user to - consult all kinds of written material
- organize it in any way the user wanted
- add private comments and link documents together
at will. - A personal library station which held all written
articles and journals on microfilm. - system of levers allowed users to add links
- create trails
11Doug Engelbart
- Inventor of the mouse, was inspired by Bushs
article. - Computers were too expensive to be used
interactively and for non-numeric tasks - Augment project (1962) to develop computer tools
to augment human capabilities and productivity
12Ted Nelson
- Hypertext is more than text (1965)
- Literature is a system of interconnected
documents - Project Xanadu was a global literature a
repository of documents, their multiple versions
and their interconnections.
13Stevan Harnad, ScholarlySkywriting,
Psychological Science (1990).
- Internet provides improvements in storing and
communicating ideas. - The reward is improvement in generating ideas
research. - Greatest reward is the possibility of much
greater intellectual productivity in one lifetime.
14Tim Berners-Lee
- Inventor of the WWW (1990)
- Intended as a tool for physicists at CERN
- Aim was to help quickly share research results in
collaborative projects - Achieved through simple document, communications
and linking standards. - simple standards caused rapid adoption
15 Paul Ginsparg
- Creator of the Los Alamos preprint archive (1991)
- Now contains 280,000 articles
- High Energy Physics
- Computing
- Maths
- Qualitative Biology
- Founder of the Open Archiving Initiative
16Various Visions
- Wells a centralised, managed global knowledge
repository to combat fragmenting academic
authority. - Bush a cross-disciplinary scholarly paradigm to
combat fragmenting scientific knowledge. - Engelbart computers augment productivity
- Nelson computers create a global literature
- Harnad Internet to boost personal research
impact - Berners-Lee low-impact, standards-based
document dissemination for scientific research - Ginsparg Web to speed up personal scientific
communication against publication delays
17Fast Forward to Open Access
- The Optimal and Inevitable for Researchers.
- The entire full-text refereed corpus online
- On every researchers desktop, everywhere
- 24 hours a day
- All papers citation-interlinked
- Fully searchable, navigable, retrievable
- For free, for all, forever
Stevan Harnad, Les CarrOpCit International DLI
Project Proposal (1999)
18Open Access
19Open Archiving Initiative
- Initially UPS Universal Preprint Service
- discussions initiated by Los Alamos HEP archive
(Paul Ginsparg) - Inaugural meeting October 1999, Santa Fe
- Protocols to facilitate exchange of metadata
- HTTP / XML Schema / Dublin Core
- Data provider / service provider distinction
20EPrint Archiving Software
- A simple, turnkey environment for setting up an
OAI compliant archive - Self archiving
- Institutional archives
- (other software available DSpace, Fedora etc)
21The Literature As We Imagine
22The Literature As It Is
23Twin Peaks Problem
Harvards
financial firewalls
Access
Impact
Have-Nots
24The Research-Impact Cycle
- Open access to research output
- maximizes
- research access
- maximizing (and accelerating)
- research impact
- (hence also research productivity
- and research progress
- and their rewards)
25Impact cycle begins Research is done
Researchers write pre-refereeing Pre-Print
Submitted to Journal
12-18 Months
Pre-Print reviewed by Peer Experts Peer-Review
Pre-Print revised by articles Authors
Refereed Post-Print Accepted, Certified,
Published by Journal
Researchers can access the Post-Print if their
university has a subscription to the Journal
26Impact cycle begins Research is done
Researchers write pre-refereeing Pre-Print
Submitted to Journal
12-18 Months
Pre-Print reviewed by Peer Experts Peer-Review
Pre-Print revised by articles Authors
Refereed Post-Print Accepted, Certified,
Published by Journal
Researchers can access the Post-Print if their
university has a subscription to the Journal
New impact cycles New research builds on
existing research
27Research Impact
- measures the size of a research contribution to
further research (publish or perish) - generates further research funding
- contributes to the research productivity and
financial support of the researchers institution - advances the researchers career
- promotes research progress
28Open Access
- Effect on Research Impact
29Online or Invisible? (Lawrence 2001)
- average of 336 more citations to online
articles compared to offline articles published
in the same venue - Lawrence, S. (2001) Free online availability
substantially increases a paper's impact Nature
411 (6837) 521. - http//www.neci.nec.com/lawrence/papers/online-na
ture01/
30Open vs non-Open Impact(All Physics)
31Open vs non-Open Impact(Nuclear Physics)
32Open vs non-Open Impact(Chemical Physics)
33Open vs non-Open Impact(General Physics)
34Research Assessment, Research Funding, and
Citation Impact
- Correlation between RAE ratings and mean
departmental citations 0.91 (1996) 0.86 (2001)
(Psychology) - RAE and citation counting measure broadly the
same thing - Citation counting is both more cost-effective
and more transparent - (Eysenck Smith 2002)
- http//psyserver.pc.rhbnc.ac.uk/citations.pdf
35Time-Course of Citations (red) and Usage (hits,
green)Witten, Edward (1998) String Theory and
Noncommutative Geometry Adv. Theor. Math. Phys.
2 253
1. Preprint or Postprint appears. 2. It is
downloaded (and sometimes read). 3. Eventually
citations may follow (for more important
papers). 4. This generates more downloads, etc.
36Usage Impact is correlated with Citation Impact
(Physics ArXiv hep, astro, cond, quantum math,
comp)http//citebase.eprints.org/analysis/correla
tion.php
- (Quartiles Q1 (lo) - Q4 (hi))
- All r.27, n219328
- Q1 (lo) r.26, n54832
- Q2 r.18, n54832
- Q3 r.28, n54832
- Q4 (hi) r.34, n54832
- hep r.33, n74020
- Q1 (lo) r.23, n18505
- Q2 r.23, n18505
- Q3 r.30, n18505
- Q4 (hi) r.50, n18505
- (correlation is highest for high-citation
papers/authors)
Most papers are not cited at all
Average UK downloads per paper 10 (UK site
only 18 mirror sites in all)
37Some old and new scientometric (publish or
perish) indices ofresearch impact
- Peer-review quality-level and citation-counts of
the journal in which the article appears - citation-counts for the article
- citation-counts for the researcher
- co-citations, co-text, semantic web (cited with
whom/what else?) - citation-counts for the preprint
- usage-measures (hits, webmetrics)
- time-course analyses, early predictors, etc. etc.
38Open Access
39Tools for(a) creating OAI-compliant university
eprint archives(b) parsing and finding cited
references on the web, (c) reference-linking
eprint archives, (d) doing scientometric
analyses of research impact,(e) creating
OAI-compliant open-access journals
- http//software.eprints.org
- http//paracite.eprints.org/
- http//opcit.eprints.org/evaluation/Citebase-evalu
ation/evaluation-report.html - http//citebase.eprints.org/help/
- http//psycprints.ecs.soton.ac.uk/
-
40Citation Linking Service
- Reference links on PDF copies of papers
- PDF technology from Open Journals Project, David
Brailsford, Steve Probets, David Evans
41Citation-Ranked Search Service
42Citation Visualisation Service
43Open Access
44The Budapest Open Access Initiative
Two open-access strategies Gold and Green
45The two open-access strategiesGold and Green
- Open-Access Publishing
- (OApub) (BOAI-2)
- Create or Convert 23,000 open-access journals
(1000 exist currently) - Find funding support for open-access publication
costs (500-1500) - Persuade the authors of the annual 2,500,000
articles to publish in new open-access journals
instead of the existing toll-access journals
- Open-Access Self-Archiving
- (OAarch) (BOAI-1)
- Persuade the authors of the annual 2,500,000
articles they publish in the existing toll-access
journals to also self-archive them in their
institutional open-access archives.
46Berlin Declarationon Open Access to Knowledge
in the Sciences and Humanitieshttp//www.zim.mpg
.de/openaccess-berlin/berlindeclaration.html
- The pertinent passages
- Open access means
- 1. free... online, full-text access
- 2. A complete version of the open-access
work... is deposited... - in at least one online repository... to
enable open access, unrestricted distribution,
OAI interoperability, and long-term archiving. - We intend to... encourage.. our
researchers/grant recipients to - publish their work according to the principles
of... open access.
47What is needed for open access now
- Universities Adopt a university-wide policy of
making all university research output open access
(via either the gold or green strategy) - Departments Create and fill departmental
OAI-compliant open-access archives - University Libraries Provide digital library
support for research self-archiving and
open-access archive-maintenance. - Promotion Committees Require a standardized
online CV from all candidates, with refereed
publications all linked to their full-texts in
the open-access journal archives and/or
departmental open-access archives - Research Funders Mandate open access for all
funded research (via either the gold or green
strategy). Fund (fixed, fair) open-access journal
peer-review service charges. Assess research and
researcher impact online (from the online CVs). - Publishers Become either gold or green.
48RoMEO Directory of Publishers who have given
theirGreen Light to Self-Archivinghttp//www.she
rpa.ac.uk/romeo.phphttp//romeo.eprints.org
Proportion of journals already formally giving
their green light to author/institution
self-archiving (already 83) continues to grow
49- Percentage of green PUBLISHERS grew from
42 - 58 from 2003-2004 - Percentage of green JOURNALS grew from
55 - 83 from 2003-2004
50OAIster, a cross-archive search engine, now
covers over 250 OAI Archives (about half of them
Eprints.org Archives) indexing over 3 million
items (but not all research papers, and not all
full-texts). http//oaister.umdl.umich.edu/o/oaist
er/
but there are 2.5 million journal articles
published per year!
51Declaration of Institutional Commitmentto
implementingthe Berlin Declaration on
open-access provision
- Our institution hereby commits itself to adopting
and implementing an official institutional policy
of providing open access to our own peer-reviewed
research output -- i.e., toll-free, full-text
online access, for all would-be users webwide --
in accordance with the Budapest Open Access
Initiative and the Berlin Declaration - UNIFIED OPEN-ACCESS PROVISION POLICY
- (OAJ) Researchers publish their research
in an open-access journal if a suitable one
exists
- otherwise
- (OAA) Researchers publish their research
in a suitable toll-access journal and also
self-archive it in their own research
institution's open-access research archive. - To sign http//www.eprints.org/signup/sign.php
- A JISC survey (Swan Brown 2004) "asked authors
to say how they would feel if their employer or
funding body required them to deposit copies of
their published articles in one or more
repositories. The vast majority... said they
would do so willingly. - http//www.jisc.ac.uk/uploaded_documents/JISCOAre
port1.pdf
52Semantic Web
53Archiving More than Articles
- Metadata collection and distribution
- Basis of OAI
- But extra effort for researcher
54Semantic Web
- W3C activity to improve Web resources
- By providing metadata
- Formal descriptions of resources
- Based on strict standards
- RDF - Resource Description Format
- RDF(S) - Schema Language for defining types or
resources and types of properties - OWL - Ontology language for more complex
relationships
55Old Web Service
- Web server sends a document to a user
56Modern Web Services
- Web server sends data to a program
57Semantic Web
- Semantic web provides resources to users and
their semantics to computers
58Semantic Web
59RDF Metadata
- Data about data
- information about documents
- title, author, journal, date, keywords
- information about people
- role, history, salary, expertise
- information about exhibits
- catalogue number, price, date, artist
- information about metadata
- validity, purpose, compiler, authority
60 Catalogue information. artist, title of the
image or picture, date acquired,
dimensions. Syntactic content. primitive
features, e.g. colour, texture and
shapes. Semantic content. what its supposed to
represent, e.g. painting of a landscape or a
representation of happiness.
61RDF Model
http//www.w3c.org/Intro.html
Author
Tim Berners-Lee
62RDF Model
http//www.w3c.org/Intro.html
predicate
Author
subject
Tim Berners-Lee
object
63RDF Model
64Semantic Web Examples
- Example Projects
- CSAKTive Space
- Web Photos
- Ontologies
- Role of ontologies
- How they dovetail in with OAI
- Dspace / SIMILE
- Bridging the semantic gap
65CS AKTive Space
- Integrating info from
- Eprint archives
- Home pages
- Funding agencies
66Web Conference Photo
- Attendees upload photos for public display
- Can then be publicly annotated
- List of known people collected
- community
67Web Photo RDF Model
- Ontologies used
- Dublin Core
- Friend-of-a-Friend
- Creative Commons Rights Management
- Geographical Locations
- Calendar Events
68Simile
- DSpace / MIT / HP / W3C Semantic Web and Digital
Library project - Many resources in many sites catalogued with
different schemes for different purposes - Use ontologies to switch between domains and
perform cross-domain searches
69Simile Scenario
(Taken from Dspace User Group slides)
- Started on ARTstor island
- SUBJECT Abstract
Roamed around island SUBJECT Abstract, CREATOR
Gorky
Travelled over Gorky bridge to OCW
island CREATOR Gorky, IS PART OF ...
Found resource not on ARTstor island
Travelled over Graham bridge
70Semantic Web raison detre
- Bridging between resources
- Through shared semantics of metadata
- Made possible by ontologies
71Lessons for Open Access
- Collect and organise metadata
- and explain to authors the benefits of their
investments - Researchers become responsible maintainers of
their output - For sharing with their community
- For sharing with posterity
- Build value-added services that build on shared
agreements about meaning
72Final Thoughts
- Open access improves science
- Network effect
- more participants -gt better services
- Just do it!
- But start with small steps