Title: Jano van Hemert
1http//www.dgemap.org/
Jano van Hemert
23 years EU-funded design study
- Goal design the organisational collaborative
structures, ethical framework, and molecular
genetic informatics technologies necessary for
a new research infrastructure which will
accelerate an integrated European approach to
gene expression in early human development
3Consortium
44 work packages
5Three major goals
- Facilitate collaboration over multiple
laboratories - Improve ways for handling spatial-temporal data
from gene expression studies - Provide integration with other technologies and
databases to help biologists advance their studies
6Laboratory process
7From 2D sections to 3D models
8Framework Edinburgh Mouse Atlas
Space and Anatomy
9Space and Anatomy
anatomical name
10Gene Expression Database
- Query by both space and text...
11emage Query by space
12emage Query by space
13Silicon processes
14The Developing Human e-Portal
15Web services exist
16Where do workflows fit in?
- Advanced queries incorporating other DBs
- Linking genes with diseases (OMIM)
- Genetic pathways (Kegg)
- Mouse-human interoperability
- Using anatomical terms
- Using direct 3D to 3D model mapping
- Using spatial-temporal ontologies
- Data mining and processes
- Hierarchical Clustering
- Association rules
17Mouse-human interoperability
18Hierarchical clustering
McMahon Data TS17
19Hierarchical clustering
McMahon Data TS17
Myt1l
Dlx5
20Let biologists cluster data
21Clustering viewing the output
22What are association rules?
- Based on a set of transactions
- We want to derive rules of the form X gt Y
- Meaning, if X happens then Y happens
- X and
- X and Y are sets of items appearing in the
transactions - The rules come with numbers to express their
quality with respect to the set of transactions
(most common support and confidence)
23Association Rules
- In the context of gene expression if Gene1 and
Gene2 then Gene3 where a transaction equals a set
of genes expressing together at the same time in
the same anatomical component - Alternative if Component1 then Component2 and
Component3 where a transaction equals a number of
components expressing the same gene at the same
time
24Association Rules Results
Transaction genes expressing in the same
anatomical component in the same Theiler stage
Association rules with a minimum confidence of
90
Wnt1, Bmp4 gt Shh 0.053 0.91 Vcam1 gt
Kdr 0.057 0.93 Emx2 gt Otx2 0.054 0.95 Otx1,
Pax6 gt Otx2 0.051 0.92
Techo-fact extracted using web services called
from a Perl script
Source the EMAGE database, using the editorial
spatial annotations extracted on 2006/08/28
25Perl script
26Main issues while using Taverna
- Need for more data mangling functions
- Need for more data formatting controls
- Pipelining and memory concerns
- Library of useful translations services
- Interaction Plug-in Architecture?
- What about Axis version 2?
27Thanks for your attention
- Susan Lindsay
- Demetrius Vouyiouklis
- Marie-Laure Muiras
- Xunxian Wang
- Mark Scott
- Alina Andras
Malcolm Atkinson Jano van Hemert Yin
Chen Richard Baldock Simon Woods Ken Taylor