A Systematic approach to the LargeScale Analysis of GenotypePhenotype correlations - PowerPoint PPT Presentation

1 / 32
About This Presentation
Title:

A Systematic approach to the LargeScale Analysis of GenotypePhenotype correlations

Description:

A Systematic approach to the Large-Scale Analysis of Genotype-Phenotype correlations ... Steve Kemp many Others. Preliminary Results. Trypanosomiasis resistance ... – PowerPoint PPT presentation

Number of Views:67
Avg rating:3.0/5.0
Slides: 33
Provided by: AntoonG3
Category:

less

Transcript and Presenter's Notes

Title: A Systematic approach to the LargeScale Analysis of GenotypePhenotype correlations


1
A Systematic approach to the Large-Scale Analysis
of Genotype-Phenotype correlations
Paul Fisher Dr. Robert Stevens Prof. Andrew Brass
2
Genotype
  • The entire genetic identity of an individual that
    does not show any outward characteristics, e.g.
    Genes, mutations

Genes
DNA
Mutations
ACTGCACTGACTGTACGTATATCT ACTGCACTGTGTGTACGTATATCT
3
Phenotype
  • (harder to characterise)
  • The observable expression of genes producing
    notable characteristics in an individual, e.g.
    Hair or eye colour, body mass, resistance to
    disease

vs.
Brown
White and Brown
4
Genotype to Phenotype
5
Current Methods
Genotype
Phenotype
200
?
What processes to investigate?
6
Phenotype
Genotype
200
?
Metabolic pathways
Phenotypic response investigated using microarray
in form of expressed genes or evidence provided
through QTL mapping
Genes captured in microarray experiment and
present in QTL (Quantitative Trait Loci ) region
Microarray QTL
7
Phenotype
Pathway A
CHR
literature
Pathway linked to phenotype high priority
QTL
Gene A
Pathway B
Gene B
literature
Pathway not linked to phenotype medium priority
Gene C
Pathway C
literature
Genotype
Pathway not linked to QTL low priority
8
Issues with current approaches
9
Huge amounts of data
QTL region on chromosome
Microarray
1000 Genes
200 Genes
How do I look at ALL the genes systematically?
10
Hypothesis-Driven Analyses
200 QTL genes
Pick the genes involved in immunological process
Case African Sleeping sickness - parasitic
infection - Known immune response
40 QTL genes
Pick the genes that I am most familiar with
2 QTL genes
  • Result African Sleeping sickness
  • Immune response
  • Cholesterol control
  • Cell death

Biased view
11
Manual Methods of data analysis
No explicit methods
Tedious and repetitive
Human error
Navigating through hyperlinks
12
Implicit methods
13
Issues with current approaches
  • Scale of analysis task
  • User bias and premature filtering
  • Hypothesis-Driven approach to data analysis
  • Constant flux of data - problems with
    re-analysis of data
  • Implicit methodologies (hyper-linking through web
    pages)
  • Error proliferation from any of the listed issues
  • Solution Automate through workflows

14
The Two Ws
  • Web Services
  • Technology and standard for exposing code /
    database with an means that can be consumed by a
    third party remotely
  • Describes how to interact with it
  • Workflows
  • General technique for describing and executing a
    process
  • Describes what you want to do

15
Taverna Workflow Workbench
http//taverna.sf.net
16
Hypothesis
  • Utilising the capabilities of workflows and the
    pathway-driven approach, we are able to provide a
    more
  • - systematic
  • - efficient
  • - scalable
  • - un-biased
  • - unambiguous
  • the benefit will be that new biology results
    will be derived, increasing community knowledge
    of genotype and phenotype interactions.

17
QTL mapping study
Microarray gene expression study
Statistical analysis
Identify genes in QTL regions
Identify differentially expressed genes
Genomic Resource
Annotate genes with biological pathways
Annotate genes with biological pathways
Pathway Resource
Select common biological pathways
Hypothesis generation and verification
Wet Lab
Literature
18
Replicated original chain of data analysis
19
Trypanosomiasis in Africa
Steve Kemp
Andy Brass
many Others
http//www.genomics.liv.ac.uk/tryps/trypsindex.htm
l
20
Preliminary Results
  • Trypanosomiasis resistance
  • A strong candidate gene was found
  • Daxx gene not found using manual investigation
    methods
  • The gene was identified from analysis of
    biological pathway information
  • Possible candidate identified by Yan et al
    (2004) Daxx SNP info
  • Sequencing of the Daxx gene in Wet Lab showed
    mutations that is thought to change the structure
    of the protein
  • Mutation was published in scientific literature,
    noting its effect on the binding of Daxx protein
    to p53 protein p53 plays direct role in cell
    death and apoptosis, one of the Trypanosomiasis
    phenotypes
  • More genes to follow (hopefully) in publications
    being written

21
Shameless Plug!
A Systematic Strategy for Large-Scale Analysis of
Genotype-Phenotype Correlations Identification
of candidate genes involved in African
Trypanosomiasis Fisher et al., (2007) Nucleic
Acids Research doi10.1093/nar/gkm623
  • Explicitly discusses the methods we used for the
    Trypanosomiasis use case
  • Discussion of the results for Daxx and shows
    mutation
  • Sharing of workflows for re-use, re-purposing

22
Recycling, Reuse, Repurposing
Heres the Science!
  • Identified a candidate gene (Daxx) for
    Trypanosomiasis resistance.
  • Manual analysis on the microarray and QTL data
    failed to identify this gene as a candidate.
  • Unbiased analysis. Confirmed by the wet lab.

Heres the e-Science!
  • Trypanosomiasis mouse workflow reused without
    change in Trichuris muris infection in mice
  • Identified biological pathways involved in sex
    dependence
  • Previous manual two year study of candidate genes
    had failed to do this.

Workflows now being run over Colitis/
Inflammatory Bowel Disease in Mice (without
change)
23
Recycling, Reuse, Repurposing
  • Share
  • Search
  • Re-use
  • Re-purpose
  • Execute
  • Communicate
  • Record

http//www.myexperiment.org/
24
What next?
  • More use cases??
  • Can be done, but not for my project
  • Text Mining !!!
  • Aid biologists in identifying novel links between
    pathways
  • Link pathways to phenotype through literature

25
QTL mapping study
Microarray gene expression study
Statistical analysis
Identify genes in QTL regions
Identify differentially expressed genes
Genomic Resource
Annotate genes with biological pathways
Annotate genes with biological pathways
Pathway Resource
Select common biological pathways
Hypothesis generation and verification
Wet Lab
Literature
26
What Does the Text Hold?
Protein Info
Related Proteins
Protein-Protein Interactions
Pathways
Biological processes
27
What Next ?
Biological processes
Generate a Profile for Pathway / Phenotype
Apoptosis Cell Death Stress response ..
28
Two Profiles Phenotype and Pathway
Find common terms
  • Phenotype Terms
  • Apoptosis
  • Cholesterol
  • Diabetes
  • Jak-STAT
  • Ribosome
  • Cell Adhesion Molecules
  • Pathway terms
  • Apoptosis
  • Cholesterol
  • Diabetes
  • Cell Death
  • JNK pathway

High chance pathway is linked to phenotype
29
Link Phenotype to Pathways
Find common terms
  • Phenotype Terms
  • apoptosis
  • Cholesterol
  • Diabetes
  • Jak-Stat
  • Ribosome
  • Cell Adhesion Molecules

apoptosis Cell Death JNK pathway
Cholesterol Cell Death JNK pathway
Simple means of linking pathways
  • apoptosis
  • Cholesterol
  • Diabetes
  • Cell Death
  • JNK pathway
  • Another pathway

30
The Prototype Workflows
2
1
Get terms from abstracts
3
Find common terms
Get abstracts for pathways / phenotype
31
To Sum Up .
  • Need for Genotype-Phenotype correlations with
    respect to disease control
  • High-throughput data can provide links between
    Genotype and Phenotype
  • Highlighted issues with manually conducted in
    silico experiments
  • Improved the methods of current microarray and
    QTL based investigations through systematic
    nature
  • Increased reproducibility of our methods
  • - workflows stored in XML based schema
  • - explicit declaration of services, parameters,
    and methods of data analysis
  • Shown workflows are capable of deriving new
    biologically significant results
  • African Trypanosomiasis in the mouse
  • Infection of mice with Trichuris muris
  • The workflows require expansion to accommodate
    new analysis techniques text mining

32
Many thanks to
including Joanne Pennock, EPSRC, OMII, myGrid,
and lots more people
Write a Comment
User Comments (0)
About PowerShow.com