Title: Carole Goble
1A Web 2.0 Virtual Research Environment
- Carole Goble
- The University of Manchester, UK
- David De Roure
- University of Southampton, UK
2myexperiment.org
3E. Science laboris
- Workflows are the new rock and roll.
- Machinery for coordinating the execution of
(scientific) services and linking together
(scientific) resources. - The era of Service Oriented Applications
- Repetitive and mundane boring stuff made easier
4Taverna Workflow Workbench
5Taverna domains
- Systems biology
- Proteomics
- Gene/protein annotation
- Microarray data analysis
- Medical image analysis
- Heart simulations
- High throughput screening
- Phenotypical studies
- Phylogeny
- Text mining
- Plants, Mouse, Human
- Astronomy
6Recycling, Reuse, Repurposing
- Paul writes workflows for identifying biological
pathways implicated in resistance to
Trypanosomiasis in cattle - Paul meets Jo. Jo is investigating Whipworm in
mouse. - Jo reuses one of Pauls workflow without change.
- Jo identifies the biological pathways involved in
sex dependence in the mouse model, believed to be
involved in the ability of mice to expel the
parasite. - Previously a manual two year study by Jo had
failed to do this.
7e-Services in the CLOUD
- Independent third party world-wide service
providers of applications, tools and data sets.
In the Cloud. - 850 databases, 166 web servers Nucleic Acids
Research Jan 2006 - My local applications, tools and datasets. In the
Enterprise. In the laboratory. - Easily incorporate new service without coding. So
even more services from the cloud and enterprise.
8e-Scientists in the CLOUD
- Individual life scientists, in under-resourced
labs, who use other peoples applications, with
little systems support. - Exploratory workflows
- Developers (often) the users.
- Consumers are providers.
A distributed, disconnected community of
scientists.
9Taverna has averaged 40 downloads per day since
2006
39,714 total sourceforge downloads to 22 October
2007 Ranked in sourceforge top 200 in June 2007
10Scientific memes. Scientific virusesAccompany
their published outcomes400 Scufl workflows in
the Web Cloud
11New Scientist
12(No Transcript)
13Tryps Twiki World
14Tagging In the Web Cloud
- Picture of workflow in Flicker evidence of
social tagging and networking
15openwetware.org
16Blogging The Lab
17Key evaluator and inspiration
- Cat De Roure
- Her idea.
- 15 years old.
- This is one of her MySpace pictures.
- Facebook, LinkedIn, blah blah
- Amazon, VivaLaDiva
18myExperiment.org is
- A market place.
- A community social network.
- A gateway to other publishing environments.
- A federated repository
- A platform for launching workflows.
- Publishing self-describing Encapsulated
myExperiment Objects. - Mindful publication.
- Started March 2007.
- Closed beta since July 2007
- Open beta Oct 31 2007
19myExperiment.org principles
- Make it easy to publish, easy to participate,
easy to add value through mash-ups - Use familiar techniques
- Shopping, Social networking, gaming
- Use off the shelf, open source web tooling, not
restrictive portals. Keep it funky, keep it
flexible, keep it extensible. Assume other people
will add functionality. - Ruby on Rails, Facebook platform
- Aim it at young people. Make it fun and
attractive. Say no to 1970s library interfaces!
20The world isnt just my workflows
- Kepler, Triana, BPEL
- Music Information Retrieval workflows
- Experimental plans (chemistry)
- And other stuff
- Matlab scripts
- Ontologies
- Computational Economics
- And the associated data!
21The Demo
22(No Transcript)
23(No Transcript)
24(No Transcript)
25(No Transcript)
26Scoping Challenges
- Workflow warehouse / federation of repositories
Open Archives Initiative. Federated
myExperiments. Sharepoint. - Social space organised rich site Social
discourse organised service / workflow space
using curated semantics. - Granularity and identifiers Rolling-up
provenance. Id resolution - Open vs protected content Quality, Reliability,
Validation, Safety, Intellectual Property,
Ownership, Secrecy, A duty of guardianship.
Curation? Policing? Local data mixed with shared
resources - Desktop integration Google gadgets for workflows.
Interacting with workflows through Office
products. - Workflow execution (WHIP) Workflows Hosted in
Portals project - Evolving the myExperiment software Community
development - Enabling Scientists added value through
applications and collaborative tagging
27Front End.A market place.A community social
network.User Participation.
28A Market Place Shoe Shop?
- Shopping for Workflows and Services and Data
should be as easy as shopping for shoes. - Dont need to train people.
- Fuel for diagnostics. Find a similar workflow.
- Organic growth good and bad.
- We need good, organised metadata for automated
use. - Impedance mismatch
- Identity and Ontology Authority
29Screen shot of bio Service shopping site
30A Social Network and Collective Intelligence
- Source of large amount of metadata.
- Open tagging, folksonomies, blogging, profiles,
recommendations. - Masses -gt Curator pipeline?
- Social network analysis and e-tracking are
valuable intelligence. - How do we avoid being deafened by the shouting?
- What are the incentive models for scientists?
31Leveraging and Serving The Long Tail of Users in
the Cloud
- Small labs or individuals
- Specialist workflows, Niche
- Expert and inexpert
- Big labs big groups
- Common de facto workflows
32Challenge Semantic Sweatshop
- Service vocabularies and curation in the wild
- Keywords and tagging only goes some of the way.
- Curation
- Workflow metadata
- Find a workflow that is similar to what I want or
need similar to this one. Or hers. Hmmmm. - Recommendations, people profiles
- Fuel for discovery and diagnostics
- Pipeline from tag cloud to shop
33Challenge Policy and Permissions without Tears
34Back End.Federated repositories.A gateway to
other publishing environments.A platform for
launching workflows.User Participation.
35Warehouse or Federation
- Community web site, federated repository.
- Multiple and My.
- Publish what I want when I want within the group
I want. - Mixed identity regimes an identity authority
- Open Archives Initiative. http//www.openarchives.
org/ - The CombeChem project. http//www.combechem.org/
36A gateway to other publishing services
- Tryps team already has a wiki
- Mash up with Facebook and workflow hosting apps.
- Bring functionality to the user. Cooperate! Dont
Control.
37We Need You! Cooperate, Don't Control
- A network of cooperating data services with
simple interfaces which make it easy to work with
content. - Provides services
- Reuses the service of others.
- Support lightweight programming models so that it
can easily be part of loosely coupled systems. - You add a mashup!
- Workflows as content syndication?
38Developers are Users Too.
- How to Develop and Grow myExperiment?
- Dont just listen to the Scientist.
- Get them to do the work!
39Parties
- 28th 29th Sept 2006
- Hand picked Taverna users Taverna developers
- Facilitated by National Centre for e-Social
Science - AJAX based development
- Previous experiences from other projects
CombeChem, myTea.
- A social networking environment for sharing any
workflow - A Taverna workflow run environment
- A multi-workflow launch environment
26/2/2007 myExperiment Slide 39
40Hack Fests
41Web 2 Implementation
42Experiments are collections of stuff
Mash up
Application
43EMO Examples
- A workflow with its inputs and the products of
executing it (including logs), perhaps multiple
times - Data from instruments, coupled with log book
entries - A collection of all the digital items associated
with one experimentincluding EMOs - A collection of workflows with instructions and
examples - A reproducible academic paper with workflows and
data
44EMO Challenges
- What happens when the parts are scattered across
multiple stores? - What happens if someone updates a part?
- How will my EMO be discovered on the Web?
- How can I work with an EMO offline?
- What is the provenance of the EMO and its parts?
- What happens if a part is unavailable?
- How do I send an EMO by email?
- Can I turn an EMO into a tarball?
- Can I archive an EMO to a CDROM?
- If I delete this file will it break anyones
EMOs? - How do I trust an EMO?
- How do I handle an EMO RESTfully?
- Can my EMO link to objects outside the EMO?
45Our Approach
- Designed for compatibility with Linked Data and
with Open Archives Initiative Object Reuse and
Exchange (OAI-ORE) which deals with compound
object information and aims to build standardised
and interoperable mechanisms
- An EMO file is a Resource Map describing all the
distinct parts contained in the EMO - Like a resource snapshot
- EMOs map to the familiar folders and files
interface
46Microsoft
- Cool front end
- Interface constructed using Silverlight
- Desktop savvy
- Feasibility of blending with desktop tools (e.g.
excel) and exposing myExperiment objects through
existing applications - Mashed up services
- Demonstrate functionality mashups to build new
services over the APIs including other science
content in back end, EMOs and other content and
information surfaced through other services - Research Information Centre
- Windows workflows
24/5/2007 myExperiment Slide 46
47Timeline
- Closed Beta released in July 2007
- Open Beta from November 2007
- In friends and family trials now with
bioinformaticians and chemists - Linking up to Triana
- Enactment and EMOs coming next
- Music and social science in pipeline
- API available
- Open Source
- Join us!
48Take homes
- myExperiment is a Web 2.0 Environment for
Scientists to share experiments - Join us!
- David De Roure
- dder_at_ecs.soton.ac.uk
- Carole Goble
- carole.goble_at_manchester.ac.uk
49Credits
- myGrid and CombeChem
- Matt Lee
- David Withers
- Don Cruickshank
- David Newman
- Mark Borkum
- Rob Procter
- Alex Voss
- Duncan Hull
- Katy Wolstencroft
- June Finch
- Ed Zaluska
- Jeremy Frey
- Simon Coles
- Danius Michaelides
- Marco Roos
- All the users inc. embedders
50Web 2.0 Design Patterns
- The Long Tail
- Data is the Next Intel Inside
- Users Add Value
- Network Effects by Default
- Some Rights Reserved
- The Perpetual Beta
- Cooperate, Don't Control
- Software Above the Level of a Single Device
- http//www.oreillynet.com/pub/a/oreilly/tim/news/2
005/09/30/what-is-web-20.html
511. The Long Tail
- Our target users are not just the specialist
e-Scientists using computing resources to tackle
major scientific breakthroughs, but also the
large number of scientists conducting the routine
processes of science on a daily basis. - Through sharing we have the potential to enable
smart scientists to be smarter and propagate
their smartness, in turn enabling other
scientists to become better and conduct better
science.
522. Data is the Next Intel Inside
- myExperiment understands that scientists are
focused on data, not software or one particular
workflow engine. - Workflows are components of customised
applications, many of which are data-oriented
rather than process-oriented. - Users manipulate, through their own applications,
the product (data, model) yielded by the
workflow. - Furthermore, workflows themselves are the data of
myExperiment and provide its unique value.
533. Users Add Value
- myExperiment makes it easy to find workflows and
is designed to make it useful and straightforward
to share workflows and add workflows to the pool.
- To succeed we draw on the insights into the
incentive models of scientists gained through
experience with Taverna.
544. Network Effects by Default
- myExperiment aggregates user data as a
side-effect of using the VRE. - The ability to execute workflows from
myExperiment, and the integration of tools such
as Taverna with myExperiment, further enable us
to achieve increased value through usage.
555. Some Rights Reserved
- myExperiment users require protection as well as
sharing, but the environment is designed for
maximum ease of sharing to achieve collective
benefits workflows are "hackable" and
"remixable". - Initiatives such as Science Commons provide a
useful context for this.
566. The Perpetual Beta
- myExperiment is an online service (a collection
of online services) and is continually evolving
in response to its users. - To support this, the project commenced with
developers being embedded in the user community. - Through day-to-day contact between designers and
researchers, design is both inspired and
validated.
577. Cooperate, Don't Control
- myExperiment is a network of cooperating data
services with simple interfaces which make it
easy to work with content. - It both provides services and reuses the service
of others. - It aims to support lightweight programming models
so that it can easily be part of loosely coupled
systems.
588. Software Above the Level of a Single Device
- The current model of Taverna running on the
scientists desktop PC or laptop is evolving into
myExperiment being available through a variety of
interfaces and supporting workflow execution.