Title: Drawing, clustering and visualization of biological pathways.
1Drawing, clustering and visualization of
biological pathways.
- Fabien Jourdan,
- LIRMM, Montpellier France.
- fjourdan_at_lirmm.fr
2Visualization
3From data extraction to visualization
4From data extraction to visualization
5From data extraction to visualization
6From data extraction to visualization
Data Extraction
Visualization
7Visualization Process
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
8Visualization Process
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
9Visualization Process
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
10Visualization Process
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
11Visualization loop
Content
Browse
Model
Internal Model
Browsing Strategy
Formulate a Browsing Strategy
Interpret
Interpretation
Spence Diagram
- Visualization is not a linear process !
12Metabolic Pathway visualization
13Metabolic Pathway visualization
KEGG
14Metabolic Pathway visualization
EcoCyc MetaCyc
And many other tools
15Metabolic Pathway visualization
EcoCyc MetaCyc
And many other tools
16Visualization Loop
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
17Importing Data
DB2
DB3
DB4
DB1
- Information is merged in visualization not in
databases - Data is organized under an easy to use and to
exchange format (e. g. XML)
18Importing Data
DB2
DB3
DB4
DB1
- Information is merged in visualization not in
databases - Data is organized under an easy to use and to
exchange format (e. g. XML)
Query engine
19Importing Data
DB2
DB3
DB4
DB1
- Information is merged in visualization not in
databases - Data is organized under an easy to use and to
exchange format (e. g. XML)
Query engine
20Importing Data
- Information is merged in visualization not in
databases - Data is organized using a standard exchange
format (XML)
21KEGG pathways database
Map000100 C1 X10 Y30 C2 X5 Y2 C3 X45 Y99
C1E1-gtC2 C2E3-gtC3 C4E2-gtC3
Map000100 C1-gtC2 C2-gtC3
Map000100 C1 X10 Y30 C2 X5 Y2 C3 X45 Y99
Map000100 C1-gtC2 C2-gtC3
Map000100 C1 X10 Y30 C2 X5 Y2 C3 X45 Y99
Map000100 C1-gtC2 C2-gtC3
Map000100 C1 X10 Y30 C2 X5 Y2 C3 X45 Y99
Map000100 C1-gtC2 C2-gtC3
KGML an XML description for each metabolic
pathway
22Importing Data
- Information is merged in visualization not in
databases - Data is organized using a standard exchange
format (XML)
KGML
23Main steps in visualization.
- Importing Data
- Finding relevant sources
- Organizing data according to future visualization
- Drawing
- Following drawing conventions or porposing new
representations - Providing drawing algorithm
- Linking Data and Drawing
- Assure that data could be access through the
representation (drawing) - Navigation
- Providing synthetical views of data (clustering)
- Enhancing data discovering through navigation
24Drawing
- Providing new representations
- Using deeply rooted drawing conventions in
Metabolic Pathway representations
ViMac
25 26Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
27Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
28Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
29Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
30Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
31Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
32Drawing Algorithms
- Detect strongly connected components
- ? a DAG
- Draw the DAG with a DAG Placement algorithm
- Draw each component with Force Directed Placement
33 34Drawing
- Providing new representations
- Using deeply rooted drawing conventions in
Metabolic Pathway representations
35Drawing
- Providing new representations
- Using deeply rooted drawing conventions in
Metabolic Pathway representations
KEGG
36Drawing
- Providing new representations
- Using deeply rooted drawing conventions in
Metabolic Pathway representations
BIOTAG
37Interacting on metabolic pathwyas
BIOTAG
KEGG
38Drawing
- Our method
- Use KGML files
- The implicit data structure does not match the
KEGG drawing of the network - Data structure transformation
- Place elements according to KGML coordinates
- Compute edge routes
39Drawing
- Our method
- Use KGML files
- The implicit data structure does not match the
KEGG drawing of the network - Data structure transformation
- Place elements according to KGML coordinates
- Compute edge routes
40Drawing
- The network described in KGML is not the one we
want to draw
41Drawing
- Our method
- Use KGML files
- The implicit data structure does not match the
KEGG drawing of the network - Data structure transformation
- Place elements according to KGML coordinates
- Compute edge routes
42Drawing Algorithms
- From KGML data our aim is to compute this
representation
43Drawing Algorithms
- Graphical informations given in KGML files
44Drawing Algorithms
- Graphical informations given in KGML files
45Drawing Algorithms
- Compute barycenter of enzymes
46Drawing Algorithms
- According to the three defined coordinates route
the edge.
47Drawing Algorithms
- According to the three defined coordinates route
the edge.
48Drawing Algorithms
- From KGML data our aim is to compute this
representation
49Drawing Algorithms
- Using KEGG coordinates provided in KGML files
- Routing Edges on a grid.
50Visualization Loop
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
51(No Transcript)
52Linking Data and Drawing
DATA
Visualization
BIOTAG
User
53Linking Data and Drawing
DATA
Visualization
BIOTAG
User
54Visualization Loop
- Import Data
- Clearly separate data from representation
- Organize data according to future visualization
in a separate process - Drawing
- Follow drawing conventions or propose new
representations - Provide drawing algorithms
- Link Data and Drawing
- Make sure that data can be accessed through the
representation (drawing) - Navigation
- Provide direct access to data (multiple views)
- Provide synthetic views of data (clustering)
- Enhance data discovering through navigation
55Navigation Clustering
A. J. Enright PNAS 2002
56Small World Networks
- Short path between each pair of elements
- Each element neighbourhood is densely connected
- Metabolic pathways
- Protein-protein interaction networks
- Social networks
- Software component networks
- Hypermedia networks
- .
57Navigation Clustering
- Giving a synthetical view of data
- According to their values
- Acdording to their organisation (structure)
- Grouping elements
- Manualy
- Automaticaly
Multiscale Visualization of Small World
Networks InfoVis 03.
58Navigation Clustering
- Giving a synthetical view of data
- According to their values
- Acdording to their organisation (structure)
- Grouping elements
- Manualy
- Automaticaly
Multiscale Visualization of Small World
Networks InfoVis 03.
59Navigation Clustering
Software component capture using graph
clustering IWPC 03.
60Navigation Clustering
- Giving a synthetical view of data
- According to their values
- Acdording to their organisation (structure)
- Grouping elements
- Manualy
- Automaticaly
61Navigation keeping context
- When looking closer at an element, keeping the
contextual information - An overview frame
- A Fisheye Semantic Zooming
62Navigation keeping context
- When looking closer at an element, keeping the
contextual information - An overview frame
- A Fisheye Semantic Zooming
63Navigation keeping context
- When looking closer at an element, keeping the
contextual information - An overview frame
- A Fisheye Semantic Zooming
64Conclusion
- Visualization a tool to support data analysis
- Analysis of post-genomic data through metabolic
pathway visualization (Biotag) - Eploratory analysis (Protein-protein / Small
World) - Ongoing work
- Full implementation of fisheye techniques
- Validation of metric-based clustering
65Acknoledgements
- Transcriptome team
- Jacques Marti (Montpellier UM2)
- Oliver Clement (Montpellier UM2)
- David Piquemal (Montpellier UM2)
- Computer Science team
- Guy Melançon (Montpellier LIRMM)
- Isabelle Mougenot (Montpellier LIRMM)
- David Auber (Bordeaux Labri)
- Yves Chiricota (Chicoutimi UQAM)
66Thank you for your attention
67Strength Metric on edges
e
u
v
68Strength Metric on edges
e
u
v
Wuv
?3(e)
Mu Mu Wuv
69Strength Metric on edges
e
u
v
?4(e)