Title: Transcription in Eukaryotes
1Transcription in Eukaryotes
2Txn in Eukaryotes
- More complex
- Multiple Rpol (I,II,III)
- Require General Txn Factors (or GTFs)
- Txn in chromatin/nucleosomes complex
- Mediators, various DBP
- More complex promoter structures
3Rpol II Promoters
- Core minimal set of consensus elements to drive
txn on in vitro templates. - Ca. 40 bp long, 5 and 3 of start
Note usually a core has only 2 or 3
Consensus sequences shown for each of these
elements.
4Regulatory element overview
- Promoter proximal
- UAS
- Enhancers
- Slincers
- Boundary elements
- Insulators
10s to 100s kb away
5Initiation by Rpo II in eukarya
- GTFs are like sigma factor
- Help find promoter
- Assist in forming closed/open complex
- Assist in escape of Rpol from initiaion to
elongation phase - Complete set of GTFs on promoter pre-initiation
complex
6Pre-initiation
- Starts at TATA box (-30)
- TATA recognized by TFIID
- Multiptn complex
- Includes TBP TATA binding ptn
- Recognizes and binds TATA site
- TFIID also includes TAFs
- TBP associated factors
7TBP-DNA Complex
- Platform to recruit all factors of GTF and Rpol
II. - Order of assembly studied extensively
- After binding complete melting of template at
promoter occurs - Melt out requires TFIIH ATP energy
- Helicase like activity drives melt
8- The process Txn initiation Rpol II promoter
- DNA/TBP disorts DNA forms plaform
- GTF Assemble IIAgtIIBgtIIF/mediator complex
(chromatin remodeling) - IIEgtIIF then melting
- Abortive initiation/txn (short aborts)
- Rpol II escapes via Phosphorylation of CTD
- Extends tail of RpolI _at_ heptapeptide repeats
(also TFIIH is phosphorylated) - Phosphates alter charge of complex GTFs
dissociate - Other post-tln phos takes place (?)
X52
9TBP Binds/Distorts DNA via b-sheet insert into
minor groove
- TBP binds to TATA via beta-sheet
- Strange! Most DBP are alpha-helices
- TBP widens/flattens TATA minor groove
- BENDS the DNA big time!
- AT pairs favored in the bending event
- residues in beta sheet interact with
sugar-phosphates in DNA further promotes bending
10Beta sheet
DNA BEND
11GTFs
- TAFs TBP associates with ca. 10 TAFs
- Some bind specific DNA sequences (TATA, Inr, DPE)
- Some are like histone ptns (bind similarly)
12TFIIB
- Binds complex AFTER TBP to BRE
Sequence specific binding Ptn bridges BRE and
TATA and Rpol II homology to sigma
13TFIIF
- Two subunit complex
- Recruited to promoter along with Rpol II
- Stabilizes DNA-TBP-TFIIB complex
- Required for IIE and IIH antecedently
14TFIIE and IIH
- 2 subunits in IIE
- Regulates helicase of IIH
- IIH controls transition to open complex
- IIH has many (9) subunits and rivals complexity
of Rpol II - Potent ATPase activity assoc. with TFIIH
15In vivo mediator complex
- GTFs describe Rpol action on naked DNA
- In vivo Nucleosomes
- Chromatin structure requires more factors
- Including MEDIATOR COMPLEX
- Various modifying factors
- Effects are different depend on promoter
- Mediator and modifier factors are very complex
16In vivo mediator complex
- GTFs and Txn activators bind to recruit
nucleosome modifiers/remodeling complexes - Plus Mediator complex helps form the
pre-initiation structure
17Various Mediators
- Variable subunit constituents
- Highly modular modules may dissociate
- Depending on physiological setting
- Due to complexity some question of whether
artifacts or not. - Rpol II holoenzyme like coli?
- A complex can be isolated rpol, GTFs, mediator,
but NOT TFIID - Not clear if actually a holoenzyme
- Note that absence of TFIID raises some concern
18Elongation
- Transition to Elongation Shed GTF/mediator
factors. - Elongation factors recruited (TFIIS)
- Stimulate processive elongation
- Some RNA processing factors enter
- Factors recruited _at_ CTD of Rpol II
- CTD of pol is now phosphorylated
- Phosphorylated form drives exchange of initiation
factors and elongation factors
19- RNA Processing Recruitment strongly localized to
CTD - CTD localized at RNA exit channel up to 800
Angs. Away!
Kinases (P-TEFb) recruited stimulate elongation
CTD
CTD Phos. Domain showing Serine targets Ser 5
Phos splicing factorsSer 2 Phos Capping
factors
20Elongation efficiency
- TFIIS reduces pausing of Rpol at certain
sequences - Not all DNA seq. read by pol at same rate!
- Such sites induce stalling
- TFIIS minimizes this stalling
- TFIIS also contributes to proofreading
- RNase (intrinsic to pol) removes misincorporated
NTPs - Similar to hydrolytic processing via GRE in coli
21ELONGATION AND PROCESSING ARE LINKED
- Euk. RNA processed as primary transcript
- Capping 5
- Poly A addition 3
- Splicing
- Elong. Factors can play roles in processing
- hSPT5 recruits 5 capping enzyme
- TAT-SF1 recruits splicing machinery
Elongation, termination, Processing ALL connected
Ensures proper coordination!
22Capping
- 5 cap is first key modification in transcipt
- Modified G base (methylated) via a weird 5-5
linkage and 3 phosphates
Mechanism of Capping
23- RNA triphosphatase removes g Phos.
- Guan. Trans. Links up GTP as shown
- PPi leaving group helps drive rxn
- Methylation at 7 position of G carried out
PPi
cleaved
245' Caps added when transcript 20-40 nt long
25Poly A and Termination
- Two processes are linked closely
- Rpol CTD recruits factors
- At end of gene Rpol encounters sequences that
attract enzymes for poly A addition at 3 end of
transcript - 3 steps
- Cleavage of transcript
- Addition of poly A
- Termination of txn
26- Two factors
- CPSF (cleavage poly A specificity factor)
- CstF (cleavage stimulation factor)
- Poly A signal seq. stimulates transfer of factors
to RNA - After CPSF/CstF bind other factors recruited
- RNA CLEAVAGE and POLY A FOLLOWS as shown in
Figure left - Rpol continues for few hundred nt after cleavage
of transcript, then stops and extra bit RNA
degraded
Ca. 200 ATemplate independent(only pol II RNA
is poly A)
27Rpol I and III act at distinct promoters and
different sets of Txn factors still need TBP
- Read subset of specialized genes for RNA
- Pol I rRNA
- Single Gene high copy number
- Expressed at VERY high rates
- Explains why dedicated Rpol is required
- Pol III tRNA
- Many different tRNA genes
- High rates of txn still important
28Pol I promoter
2 parts Core and UCE
Recruits Pol I to promoter core
- Txn requires Pol I, UBF and SL1 (includes TBP)
29Pol III promoter
- Unique Feature promoter actually downstream
from start site! - Most have 2 regions (box A,B)
- Some have TATA box as well
- Txn requires TFIIIB, TFIIIC
- TFIIIC binds, recruits B (has TBP) which then
recruits pol III - Thought that IIIC dissociated as Rpol III moves
thru
Pol
30How does Rpol find a promoter?
- Consider 3-D Structure of DNA in nucleo
- Trying to locate a promoter in real time is very
difficult - But cell does so. Here is how
Rpol
31Rpol must find promoter in context of DNA/protein
structure of genome
- Assume promoter is ca. 60 bp (E. coli)
- Genome 4 x 106 bp
- Selectivity Pol must specifically locate
0.0000002 of genome - Stated differently Genome has 4Mbp
- This 4million possible binding sites for Rpol
- Technically must find efficiently one out of
4 million! - In reality its much less since promoters are
redundant (consensus based) and Rpol molecules
are as well.
32- Rpol distribution in cell (in vivo)
- Core and holoenzyme are all thought to be DNA
bound - VERY little is free
- Excess core is in loose complexes (scanning)
33Rpol has general/weak affinity for normal B-form
DNA
- For Rpol to find promoter it must
- Dissociate from site 1Find site 2
- Bind site 2
- Movement of Rpol is DIFFUSION LIMITED (for a 60
bp site rate constant MUST be less than
10-8M-1sec-1 (max diffusion rate for a molecule
to move through medium is less than 10-8M-1sec-1) - Actual rate in vitro is greater than this (or
equal to this value). - If this applies in vivo time required for
successive cycles of dissoc/assoc. is too great
to account for txn responses
34Conceptually Holoenzyme must release and rebind
to find promoter. The rate is limited by
diffusion ie, how fast a macromolecule can
migrate at random through a physiological
solution at 37oC. BUT. This process is MUCH
MUCH faster! Thus Diffusion cannot explain how
Rpol finds a target promoter inside the cell
35Rpol searches are NOT diffusion limited
36- Rpol locating binding sites.
- Significantly speeded up if the initial target
for RNA polymerase is the whole genome, - Not just a specific promoter sequence.
- By increasing the target size (genome) rate
constant for diffusion to DNA increases - No longer limiting.
- MODEL one bound sequence directly displaced by
another sequence. - Thus, enzyme exchanges one sequence with another
sequence very rapidly - Continues to exchange sequences until a promoter
is found.
37- Searching much faster
- WHY?
- - Association/dissociation virtually
simultaneous - - NO time wasted commuting between sites
38Rpol binds VERY rapidly to random DNA sites
Could find promoter by direct displacement of
bound sequence
39Protein exchange of DBP (DNA binding proteins)
- Could be linear diffusion
- Could be 3-D intersegment transfer
- Most probably 3-D transfer
- Important point
- All sequence specific DNA binding proteins bind
DNA in a non-specific (non-seq) dependent mode
first. - This initiates the search for specific site
40What Drives intersegment transfer of DBP in the
search mode?
ENTROPY
HOW?
41Search is entropically driven
- FIRST DNA has an ion atmosphere rich in
counterions depleted in co-ions
42Ligand binds DNA
DPBx
Release of Z counterions upon binding creates
disorder entropy This is a favorable reaction
43Ligand binds DNA
Moves
Rebinds
Ptn exchanges to new site Counterions rearrange
back to ion cloud Upon binding to new contact
site, counterions in cloud get redistributed
44Ligand finds DNA specific sequence
DPBx
- Rapid exchange between sites stops when DPBx
finds a high affinity, sequence specific site it
likes - Usually involves base specific contacts that
either alter structure of protein or (more
likely) bring specific domains of ligand into
play at DNA target sequence.
45Reaction
Add Counterions
Dissociate
46METHOD How one finds DBPs?
- Goal Find whether a protein binds a specific
sequence you believe is regulatory site - You have a 10 bp sequence (in a 100 bp fragment
- Carry out Electrophoretic Mobility Shift Assay
(EMSA)
47EMSA
- The EMSA technique proteinDNA complexes
migrate more slowly than free DNA in
non-denaturing gel electrophoresis ( low ionic
strength gels) - Complexes Shift (retarded) upon protein binding
assay also referred to as a gel shift or gel
retardation assay. - Early expts on proteinDNA interactions
primarily used nitrocellulose filter-binding
assays - Advantages of EMSA
- resolves complexes of different stoichiometry
(and conformation). - Works with crude extracts purified preparations
- Can be used in conjunction with mutagenesis
identify key binding sequence in any regulatory
region. - EMSAs can also be utilized quantitatively to
measure thermodynamic and kinetic parameters. - Combined with antibodies to characterize
specificity
48EMSA
- Ability to resolve complexes depends on stability
of the complex during the brief time
(approximately 30 minutes) it is migrating into
the gel. - Sequence-specific interactions are stabilized by
low ionic strength - Upon entry into the gel, proteins quickly
resolved from free DNA - Freezing the equilibrium between bound and free
DNA. - In the gel, the complex may be stabilized by
caging effects of the gel matrix, meaning that
if the complex dissociates, its localized
concentration remains high, promoting prompt
reassociation. - Even labile complexes can often be resolved by
this method.
49Critical EMSA Reaction Parameters
50Target DNA (probe)
- Linear DNA fragments containing binding
sequence(s) used in EMSAs. - Labeling Probe
- 5 end label with g-32P-ATP and polynucleotide
kinase - 3 end label with Fill-in reactions a-32P-
dXTP. - Need to have high specific activity probe (at
least 1 x 106 cpm/ug) - EMSA binding expts use about 5 -10 ng of DNA
probe (ca. 10,000 cpm) - Non-Radioactive detection DNA biotinylated then
probe with chemiluminescent substrate. - If the target DNA is short (20-50 bp) oligo
bearing the specific sequence work well (annealed
to form a duplex).
51Target DNA (probe)
- Some DNA/ptn complexes involve multiprotein
complexes - Requires multiple proteins and often longer DNA
fragments to accommodate multiprotein complexes - Larger DNA probes (100-500 bp) a restriction
fragment or PCR product is used to prepare probe - DNA/Ptn complexes result in retarded mobility in
the gel. - Circular DNA probes (e.g., minicircles of 200-400
bp) complexes may migrate faster than the free
DNA. - Gel shift assays are also good for resolving
altered or bent DNA conformations that result
from the binding of certain protein factors. -
- Gel shift assays work with RNAprotein
interactions and peptideprotein interactions.
52Non-Specific Competitor DNA
Limiting
Excess
Nonspecific competitor DNA poly(dIdC) or
poly(dAdT) minimizes binding of nonspecific
proteins to the labeled target DNA. These
repetitive polymers do the following -provide an
excess of nonspecific sites to adsorb proteins in
crude lysates that will bind to any general DNA
sequence. -provide a 3-D intersegment transfer
structure for the specific DBP to
act Non-competitor is usually present in
100-1000 fold excess Example 10 ng of labeled
probe 1000-5000ng ng of cold competitor
53Real Data
- Shows self competition
- Rxn contains 1 -2 ng of EBNA DNA probe (32P
Label) and 1 ug polydI-dC cold competitor. - Self competition in lane 3 added 2 ng of cold
EBNA DNA (loss of complex) - Adding 2 ng of heterologous DNA (Oct-1) no
dissociation
54Competition Expt
Heterologous cold DNA
Complex amount
Homologous probe cold probe
DNA Concentration
55Binding Reaction Components
- Factors that affect the strength and specificity
of the proteinDNA interactions - Ionic strength
- pH
- Nonionic detergents, glycerol or carrier proteins
(e.g., BSA), - Divalent cations (e.g., Mg2 or Zn2)
- Concentration and type of competitor DNA present,
- Temperature and time of the binding reaction.
- If a particular ion, pH or other molecule is
critical to complex formation in the binding
reaction, it is often included in the
electrophoresis buffer to stabilize the
interaction prior to its entrance into the gel
matrix.
56Ionic strength
Add Counterions
Dissociate
Usually Keep ionic strength (total z)
LOW. Note Preparing a crude extract from
nuclei, requires HIGH SALT EXTRACTS WHY?
57Applications
- Supershift Reactions To identify ligand and DNA
- Antibody Binds ligand in complex and
supershifts - Antibody may disrupt the proteinDNA interaction
- Proper controls will reveal such negative
results. - Supershifts could include other secondary or
indirectly bound proteins as well. - An alternative identification process would be to
perform a combination Shift-Western blot. - Transfer complexes to stacked nitrocellulose and
anion exchange membranes as blots. - Blot probed with a specific antibody (Westerm)
while autoradiography or chemiluminescent
techniques can detect the DNA captured on the
anion-exchange membrane/
58AB
59SPLICING
60Rate 40nt/sec
Poly A, 5 cap
- Eukaryotic genes are mosaics of Int (non coding)
and Exons (coding) - Exons typically small (150 bp average)
- Introns can be small or huge and MANY
- DHFR Gene 31 kb, 6 exons, 2 kb mRNA (coding DNA
lt10)
61RNA Splicing
- Primary transcript pre-mRNA
- Must be processed
- Splicing converts pre-mRNA to mRNA
- Alternative splicing can increase gene diversity
- Estimated 60 of genes are alt. spliced!
- One gene could encode 1000s of splice variants!
- Accuracy is CRITICAL, mistakes not tolerated
62Mechanisms
- Consensus sequences in the transcript are key to
precise splicing outcomes
Consensus site _at_ splice junctions HIGHLY
conserved especially GU and AG
Branch point mid intronnear poly Pyr tract
Donor site
Acceptor Site
NOTE THAT THE CONSENSUS ELEMENTS ARE IN INTRONS
AND NOT EXONS (CONSTRAINED BY CODING SEQUENCE)
63Intron excision involves formation of a lariat
structure
- Splicing is a continuum
- 2 successive transesterifications
- Phosphodiester linkages break/reseal in a coupled
reaction - Rxn can be visualized as a 2-step process
- 1st is 2OH at conserved A residue
- 2nd is formation of lariat and splice product
64Nucleophilic attack _at_ P
Result Freed 5 end of intron joins A to make
the branch site in lariat
1st rxn
3 way junction2 OH Link at A
Nucleophilic attack _at_ P in splice site junction
2nd rxn
2 Products are made as a result
65Key points
- No net increase in phosphodiester bonds
- 2 bonds are broke and 2 are made
- No energy input required in transesterification
reactions - However, ATP is consumed
- Required for maintenance/assembly of splicing
machinery in vivo
66If no net energy input, what makes splicing
reaction irreversible?
- Entropically driven by
- Breaking a single RNA transcript in two creates
disorder (favorable) - Rearrangement of ion clouds in process
- Exicised intron rapidly degraded
- Thus, cannot go back or reverse the splicing
reaction
67Trans-splicing
- Exons from different transcripts are fused
- Rare in animals but does occur
- More common in C. elegans, trypanosomes
No lariat a Y structure is formed instead
68Splicesomes
- Large complexes or molecular machines carry out
splicing in vivo
69Splicing machines RNPs
- gt150 proteins
- 5 RNAs
- Small nuclear RNAs (snRNAs) U1,2,4,5,6
- Ca. 100 and 300 nt long complexed with protein
(snRNP or snurps) - RNPs and misc. ptns come and go in process
- Process mediated primarily by RNA catalysis with
protein support - Akin to a ribosome
70snRNP Roles
- Recognize 5 splice site and branch site
- Bring these sites into proximity
- Catalyze the splicing reaction
Discuss in detail
RNA-RNA RNA-protein Protein-Protein
71- Different snRNPs recognize same (or overlapping)
sites in transcript - Here U1 and U6 shown to bind to splice site
(donor)
72- snRNP U2 binds branch site
73- RNA pairing between snRNP U2 amd U6 is shown
- Brings 5 splice site and branch site into
proximity
74Branch point binding protein
- Here BBP (not part of splicesome) recognizes A
region and is displaced by U2 during the reaction
sequence
75Other protein roles
- U2AF binds poly-pyr tract helps BBP bind to
branch - RNA-annealing factors
- Help load snRNPs onto transcript
- DEAD Box helicases
- Use ATPase to dissociate RNA duplexes
- Facilitate alternative RNA-RNA interactions
76- Mechanistic overview
- U1 snRNP binds 5 splice site
- U2AF binds Pyr tract and 3 splice site (U2AF has
2 subunits) - U2AF interacts with BBP to help stabilize this
interaction - U2 snRNA binds A branch site and displaces BBP
A complex - A residue extrudes and made available to bond w.
5 splice site - A complex reorganized to bring together all 3
splice sites - U4 and U6 snRNAs along with U5 join to form the
tri-snRNP complex - Entry of tri-snurp complex defines formation of
B complex - 7. U1 exits and is replaced by U6 ( C complex)
or active site.
A complex
B complex
U4 exits and U2 takes over to complete
order not well known
77How did splicing evolve?
- Its complicated lots of players
- Probably evolved from self splicing mechanisms
with catalytic RNA - Summary of 3 classes of RNA Splicing
78Nuclear pre-mRNA
- Abundance
- Very common used in most eukarya
- Mechanism
- Transesterifications branch A site
- Catalytic mechanism
- Major spliceosome
79Group II Introns
- Abundance
- Rare some eukaryotic genes from organelles
- Prokaryotic mechanism
- Mechanism
- Transesterifications branch A site
- Catalytic mechanism
- RNA encoded by intron ( Ribozyme mediated)
80Group I Introns
- Abundance
- Rare nuclear rRNA in some eukaryotes
- Organelles genes
- A few prokaryotic genes
- Mechanism
- Transesterifications branch G site
- Catalytic mechanism
- RNA encoded by intron ( Ribozyme mediated)
- NOTE Not a true enzyme catalytic event!
mediate only one round of events