Title: Exon Shuffling
1Exon Shuffling
2Module definition and identification
3A plot for the b chain of hemoglobin from humans.
The radius of the protein is 27Å.
4(No Transcript)
5Module 1
Module 2
Module 3
6Module 2
Module 1
Module 3
7AP apple module EG epidermal growth-factor
F1 F2 fibronectin type-1 type-2 GA
g-carboxy-glutamate domain KR kringle
8Mosaic (or chimeric) protein a protein encoded
by a gene that contains regions also found in
other genes. The existence of such proteins is
evidence for exon shuffling.
9(No Transcript)
10exon shuffling ? mosaic proteins
113 types of exon shuffling exon duplication the
duplication of one or more exons within a gene
(internal duplication) exon insertion
exchange of domains between genes or insertions
into a gene exon deletion the removal of a
segment from a gene.
12Phases of introns and classes of exons
13Consequences of exon duplication deletion
14Consequences of exon insertion into introns
15Some module-coding exons are more prone to
participate in exon shuffling more than others.
One such module is the kringle.
16Exonization Pseudoexonization
Exonization is the process through which an
intronic sequence becomes an exon. Pseudoexonizat
ion is the process through which an exonic
sequence becomes an intron.
17Seneb Family (Dynasty IV) Seneb was Chief of
all the Dwarfs of the Clothing
Exonization leading to Laron Syndrome
Growth hormone receptor
18(No Transcript)
19The aggrecan gene in rat contains 18 paralogous
exons 1 paralogous pseudoexon in intron 12.
20Pseudoexon
21For exonization and pseudoexonization to occur
without disruption of the reading frame, the
rules pertaining to exon insertion and exon
deletion must be respected.
22Principal biochemical reactions in the synthesis
of fatty acids from malonyl CoA in eukaryotes and
eubacteria _______________________________________
__________________________________________________
____________________________ Reaction
Enzyme __________________________________________
__________________________________________________
_________________________ 1. acetyl CoA
condensing-enzyme domain ? acetyl-condensing
enzyme acetyl transferase 2. malonyl
CoA acyl-carrier peptide ? malonyl-acyl-carrier
peptide malonyl transferase 3.
acetyl-condensing enzyme malonyl-acyl-carrier
peptide ? b-ketoacyl-carrier peptide
b-ketoacyl synthase 4. b-keto-acyl carrier
peptide NADPH H ? b-hydroxyacyl-carrier
peptide NADP
b-ketoacyl reuctase 5. b-hydroxyacyl-carrier
peptide ? 2-butenoyl-acyl-carrier peptide
H2O b-hydroxyacyl dehydratase 6.
2-butenoyl-acyl-carrier peptide NADPH H ?
butyryl-acyl-carrier peptide NADP enoyl
reductase 7. butyryl-acyl-carrier peptide
condensing-enzyme domain ? butyryl-condensing
enzyme acyl-carrier peptide
thioesterase _____________________________________
__________________________________________________
______________________________
7 enzymatic activities 1 acyl carrier protein
23Principal biochemical reactions in the synthesis
of fatty acids from malonyl CoA in eukaryotes and
eubacteria _______________________________________
__________________________________________________
____________________________ Reaction
Enzyme __________________________________________
__________________________________________________
_________________________ 1. acetyl CoA
condensing-enzyme domain ? acetyl-condensing
enzyme acetyl transferase 2. malonyl CoA
acyl-carrier peptide ? malonyl-acyl-carrier
peptide malonyl transferase 3.
acetyl-condensing enzyme malonyl-acyl-carrier
peptide ? b-ketoacyl-carrier peptide
b-ketoacyl synthase 4. b-keto-acyl carrier
peptide NADPH H ? b-hydroxyacyl-carrier
peptide NADP
b-ketoacyl reuctase 5. b-hydroxyacyl-carrier
peptide ? 2-butenoyl-acyl-carrier peptide
H2O b-hydroxyacyl dehydratase 6.
2-butenoyl-acyl-carrier peptide NADPH H ?
butyryl-acyl-carrier peptide NADP enoyl
reductase 7. butyryl-acyl-carrier peptide
condensing-enzyme domain ? butyryl-condensing
enzyme acyl-carrier peptide
thioesterase ____________________________________
__________________________________________________
_______________________________
In most bacteria, these functions are carried on
by discrete monofunctional proteins.
24Principal biochemical reactions in the synthesis
of fatty acids from malonyl CoA in eukaryotes and
eubacteria _______________________________________
__________________________________________________
____________________________ Reaction
Enzyme __________________________________________
__________________________________________________
_________________________ 1. acetyl CoA
condensing-enzyme domain ? acetyl-condensing
enzyme acetyl transferase 2. malonyl CoA
acyl-carrier peptide ? malonyl-acyl-carrier
peptide malonyl transferase 3.
acetyl-condensing enzyme malonyl-acyl-carrier
peptide ? b-ketoacyl-carrier peptide
b-ketoacyl synthase 4. b-keto-acyl carrier
peptide NADPH H ? b-hydroxyacyl-carrier
peptide NADP b-ketoacyl
reuctase 5. b-hydroxyacyl-carrier peptide ?
2-butenoyl-acyl-carrier peptide
H2O b-hydroxyacyl dehydratase 6.
2-butenoyl-acyl-carrier peptide NADPH H ?
butyryl-acyl-carrier peptide NADP enoyl
reductase 7. butyryl-acyl-carrier peptide
condensing-enzyme domain ? butyryl-condensing
enzyme acyl-carrier peptide
thioesterase ____________________________________
__________________________________________________
_______________________________
In fungi, the activities are distributed between
two proteins encoded by two unlinked intronless
genes, FAS1 and FAS2. FAS1 encodes 3 of the 8
activities (b-ketoacyl synthase, b-ketoacyl
reductase, and acyl-carrier protein). FAS2
encodes the rest of the five enzymatic
activities.
25In animals, all functions are performed by one
polypeptide, fatty-acid synthase. The gene
product has 8 modules, including one that
performs a dual function and another whose
function is unrelated to fatty-acid synthesis but
may determine the 3D structure of this protein.
26introns-early versus introns-late
27The introns-early hypothesis (the exon theory of
genes)
Ancient genes possessed self-splicing introns.
Most self-splicing introns were lost in
Bacteria and Archaea. In Eucarya, self-splicing
introns evolved into spliceosomal introns.
Walter Gilbert
28The introns-late hypothesis
Ancient genes had no introns. The addition of
introns occurred after the emergence of the
eukaryotic cell and the endosymbiotic process
that gave rise to the mitochondria.
Russell Doolittle
29The introns-late hypothesis
Nuclear spliceosomal introns were derived from
group II self-splicing introns, which in turn
were derived from transposable elements (e.g.,
retrons).
Russell Doolittle
30Molecular mechanisms for both intron gain and
intron loss are known.
31Models and examples of intron loss
32Models and examples of intron gain
33 Roy and Gilbert. 2006. Nature Reviews Genetics
7, 211221
34Dollo estimate (maximum likelihood estimate)
35The introns-early hypothesis is supported by the
correspondence between exons and modules
(concordant positions). However, a huge number
of introns are found in positions not
corresponding with the borders of modules
(discordant positions).
36However, not all introns in discordant positions
can be taken as evidence against the
introns-early hypothesis.
37Intron sliding Introns liding
3835 early introns 65 late introns
39Carmel L, Wolf YI, Rogozin IB, Koonin EV. 2007.
Three distinct modes of intron dynamics in the
evolution of eukaryotes. Genome Res 171034-1044.
Intron gain and loss rates. Node sizes are
proportional to their (known or inferred) intron
density, and the branches are color-coded
(green) predominant intron gain (red)
predominant intron loss (blue) balanced gain and
loss (brown) extensive (significantly greater
than the mean over the tree) gains and losses.
40(No Transcript)
41(No Transcript)
42(No Transcript)
43Genetic map of the single-stranded DNA phage FX174
44(No Transcript)
45Causes of overlapping (a) Deletion of the stop
codon or deletion of a segment that includes the
stop codon.
46Causes of overlapping (b) Point mutation in stop
codon.
47Causes of overlapping (c) Frameshift by deletion
or insertion of segment that is not a multiple of
three nucleotides.
48Inferred causes of overlapping deduced from a
genomic comparison between Mycoplasma genitalium
Mycoplasma pneumoniae Deletion 65 Point
mutation 4 Frameshift 7 Unknown 24
49The rate of evolution in stretches of DNA
encoding overlapping genes is expected to be
lower than in stretches of DNA encoding single
genes.
50(No Transcript)
51Eukaryotic genes (No alternative splicing)
RNA splicing removal of introns ligation of
adjacent exons
52Alternative splicing One gene, several proteins!
splice variant II
splice variant I
532 types of exons Constitutive exons are
included within all the mRNAs transcribed from a
gene. Facultative exons are sometimes spliced
in and sometimes spliced out.
54Conditional Unconditional Alternative Splicing
- Alternative splicing may be unconditional, i.e.,
two or more mRNA variants are produced in all
tissues expressing the gene. - Alternative splicing may be conditional, i.e.,
tissue specific, developmental-stage specific or
physiological-state specific.
55(No Transcript)
56(No Transcript)
57(No Transcript)
58Types of alternative splicing
59In the troponin-T gene, the presence of 5
cassette exons in conjunction with 2 mutually
exclusive exons, allows the production of 64
different proteins from a single gene.
60It is (apparently) very easy to create de novo
splicing sites!
61(No Transcript)
62Trans splicing
63(No Transcript)
64cob/box gene in yeast mitochondria codes for
cytochrome c apoprotein
765
14
1240
417
alternative splicing
111
304
14
840
maturase (423 aa)
65(No Transcript)
66(No Transcript)
67...
F8A
...
F8B
26
23
24
25
22
i25
i24
i23
i22
26
23
24
25
F8B
F8A
Coagulation factor VIII gene
68(No Transcript)
69myoglobin Sulculus diversicolor
YEGVSDAPVTMIGGSAAQSSAMQLLDNLLGVTHSPDKQAFLDEISNYMIP
AHKQLL
YEGVWDTPKKFSGGSAGQSSIFQSLDVLLGVKHDV
GKGAFLQEIREYMPPAHKNFL
indoleamine 2,3-dioxygenase Mus musculus
70Perfection isnt evidence for evolution. Imperfec
tion is.
71(No Transcript)
72RNA editing
73Substitutional editing
74(No Transcript)
75Insertion/deletion editing
When RNA editing is extensive, so that the
resulting RNA bears little resemblance to the DNA
sequence, the DNA template is called a cryptogene.
76(No Transcript)
77(No Transcript)
78(No Transcript)
79Organs of extreme perfection and complication
by Charles Darwin To suppose that the eye,
with all its inimitable contrivances... could
have been formed by natural selection, seems, I
freely confess, absurd to the highest possible
degree.
80There is nothing new in evolution. Omni gena
est gena.
81Bricolage moleculaire (Molecular tinkering)
François Jacob
82Lactalbumin was considered a mammalian novelty
83Lactalbumin
Lysozyme
84in mammals only
throughout the living world
Lactalbumin
Lysozyme
85no enzymatic activity
enzyme
Lactalbumin
Lysozyme
86Lactalbumin B chain of lactose synthetase
lactose synthetase heterodimer (A B)
87The A chain of lactose synthetase also serves as
N-acetyllactosamine synthetase.
88Molecular tinkering is the paradigm of molecular
evolution. Does tinkering drive morphological,
anatomical and physiological evolution?
89The best of all possible worlds!
available
90The evolution of imperfection and the
imperfection of evolution
91(No Transcript)
92genome shuffling ? disruption of synteny
93Mouse-human synteny. Human chromosomes can be cut
into a small number of pieces, then reassembled
into a reasonable approximation of the mouse
genome.