Title: presentation ppt
1Human non-synonymous SNP molecular function,
evolution and disease
Shamil Sunyaev Genetics Division, Brigham
Womens Hospital Harvard Medical
School Harvard-M.I.T. Division of HST
2(No Transcript)
3Predicting the effect of mutations in proteins
4Why is this useful?
- Understanding variation in molecular function and
structure - Evolutionary genetics comparison of polymorphism
and divergence rates between different functional
categories is a robust way to detect selection
5Linkage analysis
Rare
6Classical association studies
Common
Control
Disease
7Why is this useful?
- Rare human developmental disorders / mouse
mutagenesis screens linkage studies are
impossible - Genetics of complex disease SNP prioritization
- Genetics of complex disease Rare variants
8Technically, polymorphism should not exist!
9Mendelists Biometricians
Forces to maintain variation
Selection
Mutation
10Common disease / Common variant
Trade off (antagonistic pleiotropy) Balancing
selection Recent positive selection Reverse in
direction of selection
Examples
APOE Alzheimers disease AGT Hypertension C
YP3A Hypertension CAPN10 Type 2 diabetes
11Individual human genome is a target for
deleterious mutations !
Frequency of deleterious variants is directly
proportional to mutation rate (qm/s)
40 of human Mendelian diseases are due to
hypermutable sites
12Multiple mostly rare variants
Many deleterious alleles in mutation-selection
balance
Examples
Plasma level of HDL-C Plasma level of
LDL-C Colorectal adenomas
13What about late onset phenotypes?
14Harmful mutations
- Advantageous pseudogenization (Zhang et al. 2006)
- Gain of function disease mutations
- Sickle Cell Anemia
- Function damaging
- Evolution deleterious
- Phenotype detrimental
15(No Transcript)
16protein
multiple alignment
profile
17PolyPhen
18Prediction rate of damaging substitutions
possibly probably
82 57
Disease mutations
9 3
Divergence
Polymorphism
27 15
1910 of PolyPhen false-positives are due to
compensatory substitutions
20Williamson et al., PNAS 2005
Estimate of selection coefficient
Phylogenetic measures
PAM-120 -5.32 -8.35 -12.76
BLOSUM-45 -8.41 -3.96 -13.39
BLOSUM-62 -8.41 -4.09 -12.75
BLOSUM-80 -8.46 -4.49 -13.52
Site-specific structural/phylogenetic measures
-23.602
-6.072
-11.732
Polyphen
21de novo mutation effect spectrum
Effect of new mutation may range from lethal, to
neutral, to slightly beneficial
22Mutation effect spectrum
?
23Neutral mutation model
Human ACCTTGCAAAT Chimpanzee ACCTTACAAAT Baboon
ACCTTACAAAT
Prob(TAC-gtTGC) ? Prob(TGC-gtTAC)
Prob(XY1Z-gtXY2Z) 64x3 matrix
24Strongly detrimental mutations
25Effectively neutral mutations
26Mildly deleterious mutations
27Mildly deleterious mutations
54 genes, 757 individuals
inflammatory response 236 genes, 46-47 individuals
DNA repair and cell cycle pathways 518 genes,
90-95 individuals
28Frequency itself is a reliable predictor of
function!
The majority of missense mutations observed at
frequency below 1 are deleterious
29Fitness and selection coefficient
30Mildly deleterious mutations
54 genes, 757 individuals
inflammatory response 236 genes, 46-47 individuals
DNA repair and cell cycle pathways 518 genes,
90-95 individuals
31Fraction of detectable polymorphism
32Estimation of selection coefficient - simulation
present
Human effective population size
10010011001111010100100101110101000011110011000111
00010111001
past
33Estimation of selection coefficient - simulation
present
Human effective population size
Fsingl(s)
FMAFgt25(s)
SNP probability to be observed
past
Selection coefficient
-log(s)
34Classical association studies
Common
Control
Disease
35Mutation enrichment association studies
Rare
Control
Disease
36Mutation enrichment association studies
Rare
Control
Disease
37Mutation enrichment association studies
Rare missense variants in NPC1L1 gene contributes
to variability in cholesterol absorption and
plasma levels of low-density lipoproteins
(LDLs) Cohen J et al., PNAS 2006 in
press
Nonsynonymous sequence variants in ABCA1 gene
were significantly more common in individuals
with low HDL-C (ltfifth percentile) than in those
with high HDL-C (gt95th percentile). Coh
en J et al., Science 2004
Multiple rare variants in different genes account
for multifactorial inherited susceptibility to
colorectal adenomas Fearnhead NS et
al., PNAS 2004
38Cholesterol
39Adopted from Brewer et al., 2003
40Effect of rare nsSNPs on HDL-C
41What about common alleles of smaller effect?
- Population of 3500 individuals with known plasma
levels of HDL-C - Population includes both genders and three ethnic
groups - 839 SNPs genotyped
- Independent population of 800 individuals for
validation
42What about common alleles of smaller effect?
- Introduce a linear model (ANCOVA)
- Subsequently add SNPs to the linear model
- Include SNPs based on the likelihood ratio test
- Prioritizing SNPs based on conservation did not
help
43Effect of common SNPs on HDL-C
HDL
44And a different population
HDL
45Acknowledgements
- The lab
- Gregory Kryukov, Steffen Schmidt, Saurabh
Asthana, Victor Spirin, Ivan Adzhubey - Bioinformatics Human genetics
- Vasily Ramensky Jonathan Cohen
-
-
-