Title: What is evolution
1What is evolution?
2Evolution A process of change in a certain
direction. A process of gradual and peaceful
advance. The historical development of a
biological group. A theory that the various types
of animals and plants have their origin in other
preexisting types and that the distinguishable
differences are due to modifications in
successive generations.
3Evolution A change in the composition of the
gene pool.
4Possible Changes 1. allele frequencies 2.
genotype frequencies
1.
2.
51. Changes in allele frequencies are
important. 2. Changes in genotype frequencies are
not so important.
Selection Random genetic drift
1.
2.
Deviations from random mating Migration
6Deviation from random mating By genetic
similarity Assortative mating Disassortative
mating By genetic relatedness Inbreeding Out
breeding
7disassortative
assortative
8Inbreeding
Cousins are preferred mates in many human
cultures.
9The fish Rivulus marmoratus exhibits the most
extreme form of inbreeding Selfing
10Even if extreme deviations from random mating
occur in all generations, allele frequencies
remain constant.
11Selection
12Selection The differential reproduction of
genetically distinct individuals (genotypes)
within a population. Differential reproduction
is caused by differences among individuals in
such traits as (1) mortality, (2) fertility
(offspring), (3) fecundity (gametes), (4) mating
success, and (5) viability of offspring.
13Natural selection is predicated on the
availability of genetic variation among
individuals in characters related to reproductive
success (variation in fitness).
14Variability
15Arashnia levana Non-genetic variability.
16Helix aspersa Genetic variability.
17Variability
18Genetic? Fitness related?
19Synonymous and nonsynonymous genetic variability.
20The fitness (w) of a genotype is a measure of the
individuals ability to survive and reproduce.
The size of a population is constrained by the
carrying capacity of the environment. Thus, an
individuals evolutionary success is determined
not by its absolute fitness, but by its relative
fitness in comparison to the other genotypes in
the population.
21For simplicity
- We assume that fitness is determined solely by
the genetic makeup. - We assume that all loci contribute independently
to fitness (i.e., the different loci do not
interact with one another in a manner that
affects fitness), so that each locus can be dealt
with separately.
22A very simple model (1) One locus A Two
alleles A1 A2 The old allele A1 The new
allele is A2 Three genotypes A1A1, A1A2
A2A2 Each genotype has a typical fitness (w) We
are interested in the fate of A2
23A very simple model (2) The fitness of the old
genotype (A1A1) is set at 1. The relative
fitnesses of the two new possible genotypes (A1A2
A2A2) are defined comparatively as 1 s or 1
t, where s and t are the selection coefficients.
24(No Transcript)
25(No Transcript)
26 Genotype A1A1 A1A2 A2A2 Fitness
w11 w12 w22 Frequency p2 2pq q2
27Change in A2 allele frequency per generation
28 Genotype A1A1 A1A2 A2A2 Fitness
w11 w12 w22 Frequency p2 2pq q2
29codominance
Genotype A1A1 A1A2 A2A2 Fitness 1 1 s 1 2s
30(No Transcript)
31A1 dominance
Genotype A1A1 A1A2 A2A2 Fitness 1 1 1 s
32(No Transcript)
33A2 dominance
codominance
Genotype A1A1 A1A2 A2A2 Fitness 1 1 s 1 s
34Directional Selection
codominance
A2 dominance
A1 dominance
A1 old mutant A2 new mutant
35Tay-Sachs is a recessive allele
36Selection against recessive lethal alleles
b-hexosaminidase A is a lysosomal. It is encoded
by a gene on chromosome 15.
37Selection against recessive lethal alleles
b-hexosaminidase-A catalyzes the removal of
N-acetylgalactosamine from GM2 ganglioside,
thereby degrading and removing it from the
nervous system.
38Selection against recessive lethal alleles
Absence of b-hexosaminidase-A
? Accumulation of GM2 ganglioside in neurons.
39Tay-Sachs is a recessive lethal alleles
Symptoms of classical Tay-Sachs disease first
appear at 4 to 6 months of age when an apparently
healthy baby gradually stops smiling, crawling or
turning over, loses its ability to grasp or reach
out and, eventually, becomes blind, paralyzed and
unaware of its surroundings. Death occurs by age
3-5.
Cherry-red spot from an infant with Tay-Sachs
disease.
40Selection against recessive lethal alleles
allele frequency
recessive allele
41Inefficiency of selection against recessive allele
42It is difficult to rid a population of recessive
alleles, because they hide behind the back of
dominant alleles, and are not exposed to
selection. If q 50, then 50 of all recessive
alleles are in heterozygous state and 50 are
subject to selection. If q 10, then 98 of
all recessive alleles are in heterozygous state
and 2 are subject to selection. If q 1, then
99.98 of all recessive alleles are in
heterozygous state and 0.02 are subject to
selection.
43Selection against dominant lethal alleles
Dr. George Sumner Huntington 1850-1916
Protein huntingtin Gene 180 Kb (chromosome
4) Exons 67 Amino acids 3,141 Mode autosomal
dominant
44Selection against dominant lethal alleles
45It should be easy to rid a population of dominant
alleles, because all of them are exposed to
selection at all frequencies. So why are there
dominant lethal diseases?
461. Recurrent mutations.
472. Late age of onset.
48Balancing Selection
49(No Transcript)
50Overdominance
codominance
Genotype A1A1 A1A2 A2A2 Fitness 1 1 s 1
t s gt 0 s gt t Depending on whether the fitness
of A2A2 is higher than, equal to, or lower than
that of A1A1, t can be positive, zero, or
negative, respectively.
51The change in the frequency of A2 from generation
to generation is
52At equilibrium, i.e., when ?q 0.
53overdominance
underdominance
s 0.04 and t 0.02
s - 0.02 and t - 0.01
54Overdominant selection is inherently inefficient,
even if the two homozygotes are not viable.
RIP
Powderpuff
Chinese Crested
55Random Genetic Drift
56Drift
Allele frequency changes can occur by chance, in
which case the changes are not directional but
random. An important factor in producing
changes in allele frequencies is the random
sampling of gametes during reproduction.
57Niche capacity 10 plants
58- A simple idealized model
- All the individuals in the population have
the same fitness (selection does not operate). - The generations are nonoverlapping.
- Adult population size is finite does not
change from generation to generation. - Gamete population size is infinite.
- The population is diploid (N individuals, 2N
alleles). - One locus with two alleles, A1 and A2, with
frequencies p and q 1 p, respectively.
59(No Transcript)
60When 2N gametes are sampled from the infinite
gamete pool, the probability, Pi, that the sample
contains exactly i alleles of type A1 is given by
the binomial probability function
Since Pi is always greater than 0 for populations
in which the two alleles coexist (i.e., 0 lt p lt
1), the allele frequencies may change from
generation to generation without the aid of
selection.
61(No Transcript)
62The process of change in allele frequency due
solely to chance effects is called random genetic
drift.
63Fixation
64(No Transcript)
65Magnitude of fluctuations depends on population
size. Large population Small fluctuations.
Small population Large fluctuations. Mean
time to fixation or loss depends on population
size. Large population Long time. Small
population Short time.
66(No Transcript)
67yet
68(No Transcript)
69In mathematical terms, the mean and variance of
the frequency of allele A1 at generation t are
given by Expected mean frequency does not
change with time variance increases with time.
70With each passing generation the allele
frequencies will tend to deviate further and
further from their initial values, however the
change in allele frequencies will not be
systematic in its direction.
71Change in probability of not deviating from
initial frequencies with time
72Effective population size
73Population size the total number of individuals
in a population. From an evolutionary point of
view, however, the relevant size consists of only
those individuals that actively participate in
the reproductive process. This part is called
the effective population size and is denoted by
Ne.
74Why isnt the census size satisfactory?
Some individuals may contribute little to the
reproductive potential of a population
75Sewall Wright (1931) introduced the concept of
effective population size, which he rigorously
defined as the size of an idealized population
that would have the same effect of random
sampling on allele frequencies as that of the
actual population.
76Assume a population with census size N and
frequency of allele A1 at generation t p. If
the number of individuals taking part in
reproduction N, then the variance of the
frequency of allele A1 in the next generation,
pt1, may be obtained from the variance equation
by setting t 1.
77(No Transcript)
78Gene Substitution
Gene substitution is the process whereby a mutant
allele completely replaces the predominant or
wild type allele in a population. Gene
substitution occurs when a mutant allele arises
in a population as a single copy and, then,
increases its frequency to 1 (i.e., becomes
fixed) after a certain number of generations.
79Not all mutants, however, reach fixation. In
fact, the majority of them are lost after a few
generations.
80(No Transcript)
81For a neutral mutation, i.e., s 0 For
positive values of s and large values of
N
82Thus, if an advantageous mutation arises in a
large population and its selective advantage over
the rest of the alleles is small (up to 5),
then the fixation probability is approximately
twice its selective advantage. For example, the
probability of fixation of a new codominant
mutation with s 0.01 is 2. Out of all
advantageous mutations with a selective advantage
of 1, 98 will be lost.
83Probabilities of Fixation
84Conditional Fixation Time In the case of a new
neutral mutation whose initial frequency in a
diploid population is by definition q 1/(2N),
the mean conditional fixation time is
approximated by For a mutation with a
selective advantage of s, the mean conditional
fixation time is approximated by
85Conditional Fixation Times
86Rate of Gene (or Allele) Substitution number
of mutants reaching fixation per unit time
87Rate of Gene Substitution Neutral mutations
If neutral mutations occur at a rate of u per
gene per generation, then the number of mutants
arising at a locus in a diploid population of
size N is 2Nu per generation. The probability
of fixation for each neutral mutation is 1/(2N).
The rate of substitution of neutral alleles is
obtained by multiplying the total number of
mutations by the probability of their fixation.
88(No Transcript)
89(No Transcript)
90Intuitive explanation In a large population the
number of mutations arising in every generation
is high, but the fixation probability of each
mutation is low. In a small population the
number of mutations arising in every generation
is low, but the fixation probability of each
mutation is high. The rate of substitution for
neutral mutations is independent of population
size.
91Rate of Gene Substitution Advantageous
mutations If advantageous mutations occur at a
rate of u per gene per generation, then the
number of mutants arising at a locus in a diploid
population of size N is 2Nu per generation. The
probability of fixation for each mutation is
2s. The rate of substitution of advantageous
alleles is 4Nsu.
92Deleterious mutations
Neutral mutations
Overdominant mutations
Advantageous mutations
93 New mutations may be 1. Advantageous 2.
Overdominant 3. Deleterious 4. Neutral
94Probability of fixation 1. Advantageous -
low 2. Overdominant close to zero (genetic
load) 3. Deleterious - very very low 4. Neutral
- very low
95Conditional time to fixation 1. Advantageous -
fast 2. Overdominant - extremely slow 3.
Deleterious - fast 4. Neutral - slow
96Amount of variability created 1. Advantageous -
none 2. Overdominant - a lot 3. Deleterious -
none 4. Neutral - some
973 types of evolutionary explanations
- Mutationism evolutionary phenomena are
explained by the joint effects of mutational
input and random genetic drift.
983 types of evolutionary explanations
- Neutralism evolutionary phenomena are explained
by the joints effects of mutation, random genetic
drift, and purifying selection.
993 types of evolutionary explanations
- Selectionism evolutionary phenomena are
explained by the joints effects of advantageous
selection and balancing selection.
100Selectionism
- Gene substitutions occur as a consequence of
selection for advantageous mutations.
Polymorphism is maintained by balancing
selection. - Gene substitution and polymorphism are two
separate phenomena driven by different
evolutionary forces.
101Selectionism leads to the Panglossian
paradigm It is proved that things cannot be
other than they are, for since everything was
made for a purpose, it follows that everything is
made for the best purpose. Dr. Pangloss in
Candide by Voltaire
102The Panglossian paradigm Observe, for
instance, the nose is formed for spectacles,
therefore we wear spectacles.
103Abbott Handerson Thayer 1849-1921
104(No Transcript)
105(No Transcript)
106(No Transcript)
107(No Transcript)
108Neutralism, or The Neutral Theory of Molecular
Evolution
Motoo Kimura
109The Neutral Theory of Evolution (Neutralism)
considers mutation, random genetic drift, and
purifying selection to be the main driving forces
of the evolutionary process.
110The neutral theory of molecular evolution
contends that at the molecular level the majority
of evolutionary changes and much of the
variability within species are caused by random
genetic drift of mutant alleles that are
selectively neutral or nearly so.
111Neutrality, in the sense of the neutral theory,
does not imply strict equality in fitness for all
alleles. It only means that the fate of alleles
is determined largely by random genetic drift.
112Selection may operate, but its intensity is too
weak to offset the influences of chance effects.
?s?lt 1/(2Ne) Ne effective population size.
113According to the neutral theory, the frequency of
alleles is determined by purely stochastic rules,
and the picture that we obtain at any given time
is merely a transient state representing a
temporary frame from an ongoing dynamic process.
114The neutral theory regards substitution and
polymorphism as two facets of the same
phenomenon. Gene substitution is a long and
gradual process whereby the frequencies of mutant
alleles increase or decrease randomly, until the
alleles are ultimately fixed or lost by chance.
115Polymorphic loci consist of alleles that are
either on their way to fixation or are about to
become extinct.
Thus, at any given time, some loci will possess
alleles at frequencies that are neither 0 nor
100. These are the polymorphic loci.
116All molecular manifestations that are relevant to
the evolutionary process should be regarded as
the result of a continuous process of mutational
input and a concomitant random extinction or
fixation of alleles.
117- A single evolutionary force.
- Genetic polymorphism is transient - allele
frequencies fluctuate with time and the
polymorphic alleles themselves are continuously
replaced.
118A population that is free from selection can
accumulate many polymorphic neutral alleles.
Then, if a change in ecological circumstances
occurs, some of the neutral alleles will no
longer be neutral but deleterious, against which
purifying selection may operate. After these
alleles are removed, the population will become
more adapted to its new circumstances than
before. Kimura (1983)
119(No Transcript)
120(No Transcript)
121The neutralist-selectionistdebate
122(No Transcript)
123(No Transcript)
124Fact 1 The cheetah is about to become extinct.
125Fact 2 Cheetah populations are devoid of
genetic variation.
126Two explanations
1. Selectionist explanation The cheetahs lack
genetic variation and, therefore, they are on the
verge of extinction.
127Two explanations
2. Neutralist explanation The cheetahs are on
the verge of extinction and, therefore, they lack
genetic variation.
128(No Transcript)
1291922, Baja California, 20 animals 2000, Baja
California, 120,000 animals Genetic variation
0
130Helix aspersa (brown garden snail)
H 0.121
Europe
1850
California
few individuals
131Helix aspersa (brown garden snail)
H 0.121
Europe
1990
California
Large populations. Important pests of citrus
trees.
H 0.000
132Neutrality tests
133(No Transcript)
134(No Transcript)
135(No Transcript)
136(No Transcript)
137(No Transcript)
138(No Transcript)
139(No Transcript)
140(No Transcript)
141Numbers of synonymous and nonsynonymous fixed
differences and polymorphisms at the
glucose-6-phosphate dehydrogenase locus between
Drosophila melanogaster and D. simulans The
comparisons are based on 32 sequences from D.
melanogaster and 12 sequences from D. simulans,
with an aligned length of 1,705 bp.
142The essence of the dispute between neutralists
and selectionists basically concerns the
distribution of fitness values of mutant alleles.
143Both theories agree that most new mutations are
deleterious, and that these mutations are quickly
removed from the population so that they
contribute neither to the rate of substitution
nor to the amount of polymorphism within
populations.
144The difference concerns the relative proportion
of neutral mutations among nondeleterious
mutations. While selectionists claim that very
few nondeleterious mutations are selectively
neutral, neutralists maintain that the vast
majority of nondeleterious mutations are
effectively neutral.
145(No Transcript)
146(No Transcript)
147 Detecting Positive Darwinian Selection
148(No Transcript)
149(No Transcript)
150Deleterious mutations
Neutral mutations
synonymous
Overdominant mutations
Advantageous mutations
nonsynonymous
151(No Transcript)
152(No Transcript)
153Prevalence of positive selection
In humans, estimates of the prevalence of
positive selection in protein-coding genes range
from 0.02 to 8.7 (a 435-fold range). In other
words, 91.30 to 99.98 of all genes evolve in a
neutral fashion.
154Codon Usage
155Codon-usage bias
156Measures of codon-usage bias
157The relative synonymous codon usage (RSCU) is the
number of times a codon appears in a gene divided
by the number of expected occurrences under equal
codon usage. n number of synonymous codons
(1 ? n ? 6) for the amino acid under study, Xi
number of occurrences of codon i. If the
synonymous codons of an amino acid are used with
equal frequencies, their RSCU values will equal
1.
158The codon adaptation index (CAI) measures the
degree with which genes use preferred codons.
We first compile a table of RSCU values for
highly expressed genes. From this table, it is
possible to identify the codons that are most
frequently used for each amino acid. The relative
adaptiveness of a codon (wi) is computed
as where RSCUmax the RSCU value for the
most frequently used codon for an amino acid.
159The CAI value for a gene is calculated as the
geometric mean of wi values for all the codons
used in that gene. where L number of
codons.
160The effective number of codons (ENC) where Fi
(i 2, 3, 4, or 6) is the average probability
that two randomly chosen codons for an amino acid
with i codons will be identical. ENC values
range from 20 (the number of amino acids), which
means that the bias is at a maximum, and only one
codon is used from each synonymous-codon group,
to 61 (the number of sense codons), which
indicates no codon-usage bias.
161(No Transcript)
162Universal and species-specific patterns of codon
usage
163The genome hypothesis All genes in a genome
tend to have the same coding strategy. That is,
they employ the codon catalog similarly and show
similar choices between synonymous codons.
Different taxa have different coding
strategies.
Richard Grantham
164Universal patterns Codons that contain the CG
dinucleotide are universally avoided (low-usage
codons). This phenomenon is particularly notable
as far as the arginine codons CGA and CGG are
concerned.
165Codon Usage is related to Translation Efficiency
166The translation efficiency of a codon is related
to the relative quantity of tRNA molecules that
recognize the particular codon.
167(No Transcript)
168Codon Usage is related to Translation Efficiency
169Toshimishi Ikemura
170Rules determining choice of optimal codons in
unicellular organisms ____________________________
______ 1. tRNA availability. 2. Preference
for A over G when thiolated uridine or
5-carboxymethyl are at the anticodon wobble
position. 3. Preference for T and C over A
when inosine is at the anticodon wobble
position. 4. Codons of the AAN, ATN, TAN, and
TTN type prefer C in the third codon
position. __________________________________
171Codon usage in multicellular organisms
172(No Transcript)
173Subramanian S. 2008. Genetics 1782429-2432