Title: Bioinformatics
1Bioinformatics
2DNA Sequencer
- Perkin Elmer ABI 3770 Automated Sequencer
3(No Transcript)
4Human Genetic Variation
- Disease risk
- Drug response
- Correlations between biological variation and DNA
sequence variation
5Genetics inherited contribution to phenotypic
variation
- Genetic Diversity
- Genetic Identity
6Inherited contribution to risk of type II Diabetes
- Your neighbor (unrelated) 5-10
- Your sibling 30-40
- Your identical twin gt90
7- Virtually all medical conditions (other than
trauma) have a genetic component - So does ability to react to treatments
- Increasingly, genetic information is being used
to diagnose and treat disease - Gene therapy
- Individualized drugs
8Traditional Approach
- Linkage or recombinatorial mapping
- Successful for single gene disorders
- Little success for common complex traits, such as
heart disease, diabetes, asthma, mental disorders
9- The human genome sequence 3.2 billion bp
10- Human Genetic Variation
- SNPs-Single Nucleotide Polymorphisms
GATTTAGATCGCGATAGAG GATTTAGATCTCGATAGAG
11Pharmacogenomics
- The use of DNA sequence information to measure
and predict the reaction of individuals to drugs.
12Pharmacogenomics
- Personalized drugs
- Faster clinical trials through selected trial
populations - Less drug side effects
13Drug Responses
- Absorption
- Distribution
- Activation
- Metabolism
- Excretion
14- Genetic Factors
- Environmental Factors
15Patients
- Responders
- Non-responders
- Toxic responders
16Goals
- Right Drug
- Right Dose
- Right Patient
17Procedure
18Types of SNPs
- Causative SNPs (synonymous)
- coding SNPS
- non-coding SNPs
- Linked SNPs (non-synonymous)
- non-coding SNPs
19To find SNPs in raw sequence ...
20Single Nucleotide Polymorphisms
- How many?
- What kind?
- How do we find them?
21- 99.9 identity - how many differences?
- O.1 of 3.200 Mb 3.2 Mb
- 3.200.000 SNPs
- One SNP every 1,300 bases.
22GATTTAGATCGCGATAGAGGATTTAGATCTCGATAGAG
- Rare Alleles
- ---o--------------------
- -----o------------------
- -------o----------------
- -----------o------------
- ---------------o--------
- -------------------o----
- Many
- Common Alleles
- ----o-------------------
- ----o-------------------
- ----o-------------------
- --------------------o---
- --------------------o---
- --------------------o---
- Few
23Raw Genome Data
24Database mining
- Discover new SNPs
- Contig overlaps
- Comparison along entire sequence
- Comparing redundant EST sequences
- Validate SNPs
- Sequencing
- Microchips
25SNPs in Overlapping Genomic Sequences
Overlapping BACs from library
50 of overlaps contain polymorphisms
26- gtgbBE588357.1BE588357 194087 BARC 5BOV Bos
taurus cDNA Length369 - Score 272 bits (137), Expect 4e-71
- Identities 258/297 (86), Gaps 1/297 (0)
- Strand Plus / Plus
- Query 17 aggatccaacgtcgctccagctgctcttgacgactccac
agataccccgaagccatggca
- Sbjct 1 aggatccaacgtcgctgcggctacccttaaccact-c
gcagaccccccgcagccatggcc - Query 77 agcaagggcttgcaggacctgaagcaacaggtggagggg
accgcccaggaagccgtgtca -
- Sbjct 60 agcaagggcttgcaggacctgaagaagcaagtggaggg
ggcggcccaggaagcggtgaca - Query137 gcggccggagcggcagctcagcaagtggtggaccaggcca
cagaggcggggcagaaagcc -
- Sbjct 120 tcggccggaacagcggttcagcaagtggtggatcaggcc
acagaagcagggcagaaagcc -
- Query 197 atggaccagctggccaagaccacccaggaaaccatcgac
aagactgctaaccaggcctct -
- Sbjct 180 atggaccaggttgccaagactacccaggaaaccatcga
ccagactgctaaccaggcctct
27(No Transcript)
28(No Transcript)
29Validation
- Correlation between phenotype and SNP
- Recombinatorial linkage vs. SNP linkage
30GATTTAGATCGCGATAGAGGATTTAGATCTCGATAGAG
- Common Alleles
- ----o-------------------
- ----o-------------------
- ----o-------------------
- --------------------o---
- --------------------o---
- --------------------o---
- Few
- Rare Alleles
- ---o--------------------
- -----o------------------
- -------o----------------
- -----------o------------
- ---------------o--------
- -------------------o----
- Many
31Data from 24 individuals for the marker S gene
1118
32GATTTAGATCGCGATAGAGGATTTAGATCTCGATAGAG
- Responders
- ---o----o--------o----o------o----o--o-
- ---o----o--------o----o-----------o--o-
- ---o-o--o--------o----o-----------o--o-
- Non-responders
- --------o--------------------o----o--o-
- ----o---o--------------------o----o--o-
- --------o--------------------o----o--o-
33Microarrays and Genechips
- Expression Studies
- measure RNA levels
- allow study of genes under different conditions,
from different tissues, and/or different
developmental stages - Human Genetic Variation
- Single nucleotide polymorphisms (SNPs)
34cDNA spotted microarrays
355'
Cleavage site
C
5'
C
5'
Release of Probe Arm
G
Probe arm becomes invader in secondary reaction
Cleavage site
F
5'
C
Release of fluorescent molecule
F
Invader Reaction
36Cleavage site
5-CGCGCCGAGG
ATTCCAGCCAGTGGGGA-3 Downstream primary probe
5-CGGCAGCTTCCT CGG CCATTGCT Upstream invader
3-GCCGTCGAAGGAGCCGGTAACGTAAGGTCGGTCACCCCT-5
Target
Cleavage site
5-F-T
CTCGTCTCGG
T
T
Signal Probe
5-CGCGCCGAGGA
T
T
GCGCGGCTCCAGAGCAGAGCC
5-F-T
Invader Reaction for marker S gene 1118
37 Data from 24 individuals for the marker S
gene 1118