The Genome Access Course Electronic Sequences - PowerPoint PPT Presentation

About This Presentation
Title:

The Genome Access Course Electronic Sequences

Description:

The Genome Access Course Electronic Sequences – PowerPoint PPT presentation

Number of Views:23
Avg rating:3.0/5.0
Slides: 15
Provided by: james858
Category:

less

Transcript and Presenter's Notes

Title: The Genome Access Course Electronic Sequences


1
TheGenomeAccessCourseElectronic Sequences
Insulin, sequenced in 1955
2
This is a sequence
  • MALWTRLRPLLALLALWPPPPARAFVNQHLCGSHLVEALYLVCGERGFFY
    TPKARREVEGPQVGALELAGGPGAGGLEGPPQKRGIVEQCCASVCSLYQL
    ENYCN

So is this
ACCATGATTACGCCAAGCTTGCATGCCTGCAGGTCGGCTGCATTCGAGGC
TGCCAGCAAGCAGGTCCTCGCAGCCCCGCCATGGCCCTGTGGACACGCCT
GCGGCCCCTGCTGGCCCTGCTGGCGCTCTGGCCCCCCCCCCCGGCCCGCG
CCTTCGTCAACCAGCATCTGTGTGGCTCCCACCTGGTGGAGGCGCTGTAC
CTGGTGTGCGGAGAGCGCGGCTTCTTCTACACGCCCAAGGCCCGCCGGGA
GGTGGAGGGCCCGCAGGTGGGGGCGCTGGAGCTGGCCGGAGGCCCGGGCG
CGGGCGGCCTGGAGGGGCCCCCGCAGAAGCGTGGCATCGTGGAGCAGTGC
TGTGCCAGCGTCTGCTCGCTCTACCAGCTGGAGAACTACTGTAACTAGGC
CTGCCCCGACAAATAAACCCTTACGAGCAAG
3
Where do sequences come from?
  • Individual researchers
  • Genome sequencing projects
  • Patent applications

4
Where do the sequences go?
  • GenBank (NCBI)
  • EMBL Nucleotide Sequence Database
  • DDBJ (DNA Data Bank of Japan)

5
How can sequences be obtained?
  • ENTREZ
  • Batch ENTREZ
  • SRS (Sequence Retrieval System)
  • getentry

6
What is available in Entrez?
  • PubMed
  • Protein
  • Nucleotide
  • Structure
  • Genome
  • PopSet
  • OMIM
  • Taxonomy
  • Books
  • ProbeSet
  • 3D Domains

7
ENTREZ Cross-references
8
Sequence Formats
  • Raw
  • Fasta
  • ASN.1
  • GenBank/GenPept
  • DDBJ
  • Ensembl
  • Graphics
  • XML

- convertible in ReadSeq
9
Other Sequence Formats
  • GCG
  • DNA Strider
  • Intelligenetics
  • NBRF

- convertible in ReadSeq
10
Multiple Sequence Formats
  • MSF
  • Phylip
  • PAUP
  • Fitch
  • Pretty

- convertible in ReadSeq
11
Converting Sequence Formats
  • READSEQ
  • SEQIO
  • GCG e.g. FROMEMBL, TOFASTA, etc.

12
How to work with sequences
  • Cut paste sequences
  • Save files as text from sequence repositories
  • Unix vs. Windows format

13
Batch ENTREZ
  • A method for obtaining large numbers of
    sequences by supplying a file containing a list
    of GI or accession numbers.

14
ENTREZ Filters
  • nucleotide allFilter NOT specimen-voucherAll
    Fields
  • You can eliminate most of the BAC-type records
    from the default nucleotide database with the
    following query nucleotide allFilter NOT
    htgKeyword
Write a Comment
User Comments (0)
About PowerShow.com