Title: CS273A
1(No Transcript)
2Lecture 11
- HW1 Feedback (ours)
- (Upcoming Project discuss Wed)
- Non-Coding RNAs
- Halfway Feedback (yours)
3non coding RNAs
4Central Dogma of Biology
5RNA is an Active Player
reverse transcription
long ncRNA
6What is ncRNA?
- Non-coding RNA (ncRNA) is an RNA that functions
without being translated to a protein. - Known roles for ncRNAs
- RNA catalyzes excision/ligation in introns.
- RNA catalyzes the maturation of tRNA.
- RNA catalyzes peptide bond formation.
- RNA is a required subunit in telomerase.
- RNA plays roles in immunity and development
(RNAi). - RNA plays a role in dosage compensation.
- RNA plays a role in carbon storage.
- RNA is a major subunit in the SRP, which is
important in protein trafficking. - RNA guides RNA modification.
- In the beginning it is thought there was an RNA
World, where RNA was both the information carrier
and active molecule.
7RNA Folds into (Secondary and) 3D Structures
AAUUGCGGGAAAGGGGUCAA CAGCCGUUCAGUACCAAGUC UCAGGGGA
AACUUUGAGAUG GCCUUGCAAAGGGUAUGGUA AUAAGCUGACGGACAU
GGUC CUAACCACGCAGCCAAGUCC UAAGUCAACAGAUCUUCUGU UGA
UAUGGAUGCAGUUCA
We would like to predict them from sequence.
Cate, et al. (Cech Doudna). (1996) Science
2731678.
Waring Davies. (1984) Gene 28 277.
8RNA structure rules
- Canonical basepairs
- Watson-Crick basepairs
- G - C
- A - U
- Wobble basepair
- G U
- Stacks continuous nested basepairs.
(energetically favorable) - Non-basepaired loops
- Hairpin loop.
- Bulge.
- Internal loop.
- Multiloop.
- Pseudo-knots
9RNA structure Basics
- Key RNA is single-stranded. Think of a string
over 4 letters, AC,G, and U. - The complementary bases form pairs.
- Base-pairing defines a secondary structure. The
base-pairing is usually non-crossing.
10Ab initio structure prediction lots of Dynamic
Programming
- Maximizing the number of base pairs (Nussinov et
al, 1978)
11Pseudoknots drastically increase computational
complexity
12Nearest Neighbor Model for RNA Secondary
Structure Free Energy at 37 OC
Mathews, Disney, Childs, Schroeder, Zuker,
Turner. 2004. PNAS 101 7287.
13Zukers algorithm MFOLD computing loop dependent
energies
14Energy Landscape of Real Inferred Structures
15Unfortunately
- Random DNA (with high GC content) often folds
into low-energy structures. - What other signals determine non-coding genes?
16Evolution to the Rescue
17(No Transcript)
18Stochastic context-free grammar (SCFG)
S ? aSu L ? aL S ? uSa L ? cL S ? gSc L ? a S
? cSg L ? c S ? L
S
S
L
L
L
L
a a c
g u u
c c c c
u c u
a g a
c
c
- Each derivation tree corresponds to a structure.
19Stochastic context-free grammar (cont)
S ? aSu S ? cSg S ? gSc S ? uSa S ? a S ? c S ?
g S ? u S ? SS 1. A CFG
S ? aSu ? acSgu ? accSggu ? accuSaggu
? accuSSaggu ? accugScSaggu ?
accuggSccSaggu ? accuggaccSaggu ?
accuggacccSgaggu ? accuggacccuSagaggu ?
accuggacccuuagaggu 2. A derivation of
accuggacccuuagaggu
3. Corresponding structure
20(No Transcript)
21MicroRNA
22Genomic context
known miRNAs in human
intergenic
intronic
polycistronic
monocistronic
23tRNA
24tRNA Activity
25(No Transcript)
26(No Transcript)
27Human specific accelerated evolution
rapid change
Human
Chimp
conserved
28Human Accelerated Regions
- Human-specific substitutions in conserved
sequences
rapid change
Human
- HAR1
- Novel ncRNA
- Co-expressed in Cajal-Retzius cells with reelin.
- Similar expression inhuman, chimp, rhesus.
- 18 unique human substitutionsleading to novel
conformation. - All weak (AT) to strong (GC).
Chimp
conserved
Human Derived
Chimp Ancestral
28
Pollard, K. et al., Nature, 2006
Beniaminov, A. et al., RNA, 2008
29Other Non Coding Transcripts
30(No Transcript)
31mRNA
32EST
33lincRNAs (long intergenic non coding RNAs)
34X chromosome inactivation in mammals
X
X
X
Y
35Xist X inactive-specific transcript
36Microarrays, Next Gen(eration) Sequencing etc.
37End Results
38(No Transcript)
39(No Transcript)
40Transcripts, transcripts everywhere
Human Genome
Leaky tx? Functional?
Transcribed (Tx)
Tx from both strands
41Or are they?
42Halfway Feedback