Title: SAGE data in StemBase
1SAGE data in StemBase
- Christopher Porter
- Ottawa Health Research Institute
2Presentation outline
- SAGE protocol
- SAGE analysis
- Integration with Affymetrix data
- Access to SAGE data in StemBase
3Basics of SAGE
- Identification and quantitation of mRNAs in a
mixed population by generation of a (usually)
unique sequence tag. - Assumes that tags are generated in proportion to
mRNA abundance in the population
4(No Transcript)
5Sequence to Tags
6Library database
----------------------------- tagSeq
tagCount -----------------------------
CTCGAGTTTCTTCCTGT 58
GGACAGCCGGGGATGCT 1
GTGGCTCACAACCATCT 461
TGAAGAGAGATATATCA 3
TGACAGAGCCAGGGCTC 6
TGGGAAGTGTGATTTCT 92
TTAATATTTAATTAGAG 56
TTTTATTTATATTCAAG 2 ------------------
-----------
GTGGCTCACAACCATCT TGACAGAGCCAGGGCTC TTAATATTTAATTA
GAG TGGGAAGTGTGATTTCT TTTTATTTATATTCAAG GGACAGCCGG
GGATGCT CTCGAGTTTCTTCCTGT TGAAGAGAGATATATCA
7SAGE tag identification
- Match to tags predicted from known sequences
- e.g. SAGEMap
- Generate mappings from cDNA sequences
8Finding tags for a gene
gtgi7305398refNM_013633.1 Mus musculus POU
domain, class 5, transcription factor 1
(Pou5f1), mRNA GTGAGCCGTCTTTCCACCAGGCCCCCGGCTCGGGG
TGCCCACCTTCCCCATGGCTGGACACCTGGCTTCA GACTTCGCCTCCTC
ACCCCCACCAGGTGGGGGTGATGGGTCAGCAGGGCTGGAGCCGGGCTGGG
TGGATT CTCGAACCTGGCTAAGCTTCCAAGGGCCTCCAGGTGGGCCTGG
AATCGGACCAGGCTCAGAGGTATTGGG GATCTCCCCATGTCCGCCCGCA
TACGAGTTCTGCGGAGGGATGGCATACTGTGGACCTCAGGTTGGACTG G
GCCTAGTCCCCCAAGTTGGCGTGGAGACTTTGCAGCCTGAGGGCCAGGCA
GGAGCACGAGTGGAAAGCA ACTCAGAGGGAACCTCCTCTGAGCCCTGTG
CCGACCGCCCCAATGCCGTGAAGTTGGAGAAGGTGGAACC AACTCCCGA
GGAGTCCCAGGACATGAAAGCCCTGCAGAAGGAGCTAGAACAGTTTGCCA
AGCTGCTGAAG CAGAAGAGGATCACCTTGGGGTACACCCAGGCCGACGT
GGGGCTCACCCTGGGCGTTCTCTTTGGAAAGG TGTTCAGCCAGACCACC
ATCTGTCGCTTCGAGGCCTTGCAGCTCAGCCTTAAGAACATGTGTAAGCT
GCG GCCCCTGCTGGAGAAGTGGGTGGAGGAAGCCGACAACAATGAGAAC
CTTCAGGAGATATGCAAATCGGAG ACCCTGGTGCAGGCCCGGAAGAGAA
AGCGAACTAGCATTGAGAACCGTGTGAGGTGGAGTCTGGAGACCA TGTT
TCTGAAGTGCCCGAAGCCCTCCCTACAGCAGATCACTCACATCGCCAATC
AGCTTGGGCTAGAGAA GGATGTGGTTCGAGTATGGTTCTGTAACCGGCG
CCAGAAGGGCAAAAGATCAAGTATTGAGTATTCCCAA CGAGAAGAGTAT
GAGGCTACAGGACACCTTTCCCAGGGGGGGCTGTATCCTTTCCTCTGCCC
CCAGGTCC CCACTTTGGCACCCCAGGCTATGGAAGCCCCCACTTCACCA
CACTCTACTCAGTCCCTTTTCCTGAGGGC GAGGCCTTTCCCTCTGTTCC
CGTCACTGCTCTGGGCTCTCCCATGCATTCAAACTGAGGCACCAGCCCTC
CCTGGGGATGCTGTGAGCCAAGGCAAGGGAGGTAGACAAGAGAACCTGG
AGCTTTGGGGTTAAATTCTTT TACTGAGGAGGGATTAAAAGCACAACAG
GGGTGGGGGGTGGGATGGGGAAAGAAGCTCAGTGATGCTGTT GATCAGG
AGCCTGGCCTGTCTGTCACTCATCATTTTGTTCTTAAATAAAGACTGGAC
ACACAGT
5
4
3
2
1
0
9Tags in database
-----------------------------------------------
--------------------------------------------------
- tagSeq rank geneName
-----------------------------------------
--------------------------------------------------
------- CATTCAAACTGAGGCAC 0 Mus
musculus POU domain, class 5, transcription
factor 1 (Pou5f1), mRNA TTTCTGAAGTGCCCGAA
1 Mus musculus POU domain, class 5,
transcription factor 1 (Pou5f1), mRNA
TGTAAGCTGCGGCCCCT 2 Mus musculus POU
domain, class 5, transcription factor 1 (Pou5f1),
mRNA AAAGCCCTGCAGAAGGA 3 Mus musculus
POU domain, class 5, transcription factor 1
(Pou5f1), mRNA TCCGCCCGCATACGAGT 4 Mus
musculus POU domain, class 5, transcription
factor 1 (Pou5f1), mRNA GCTGGACACCTGGCTTC
5 Mus musculus POU domain, class 5,
transcription factor 1 (Pou5f1), mRNA
---------------------------------------------
--------------------------------------------------
---
10Finding tags in genomic sequence
gt1 dnachromosome chromosomeNCBIM361472955047
295911 GAAACTGGCTCAGTGTAGCCATGAAGTCCAGGCCACTAACC
T
- 24,837,922 tags generated
- Tags observed at 1 - 36,168 locations
- 96 of tags are from a single location
11Associating tags with probesets
12(No Transcript)
13(No Transcript)
14(No Transcript)
15(No Transcript)
16(No Transcript)
17(No Transcript)
18(No Transcript)
19(No Transcript)
20(No Transcript)
21(No Transcript)
22UCSC Genome Browser controls
23(No Transcript)
24(No Transcript)
25Conclusion
- Please contact ogicinfo_at_ohri.ca if you have any
comments, corrections or questions. - See associated bibliography for references from
this presentation and further reading. - Thanks for your attention!
26Matching genes to tags
- SAGEMap (NCBI)
- From UniGene/ESTs
- 1,074,067 tags
- Build your own
- from RefSeq
- 306,970 tags, 28,903 rank 0 tags
- from Ensembl mRNA
- 322,076 tags, 34,050 rank 0 tags
- from Ensembl genomic sequence
- 24,453,442 tags
27Use of SAGE or Affymetrix
28Tags in different databases
- Pou5f1 (Oct-4)
- RefSeq
- 5 tags
- SAGEMap
- 20 tags
- Nucleolin (Ncl)
- RefSeq
- 9 tags
- SAGEMap
- 223 tags
29What do SAGE data look like?
30How are SAGE data analysed
31Computational generation of SAGE tag libraries
32What SAGE tag libraries are available
33How can SAGE data be associated with Affy data
34SAGE libraries in StemBase
35Associating SAGE with Affy in StemBase
36(No Transcript)