Genes and Genomes - PowerPoint PPT Presentation

About This Presentation
Title:

Genes and Genomes

Description:

Affymetrix: 60,000 human genes on GeneChips? Incyte: over 120,000 genes? ... Enhancer signals. Poly-A insertion sites. 9/18/09. U. Hilgert. Genes. Genomes. 24 ... – PowerPoint PPT presentation

Number of Views:17
Avg rating:3.0/5.0
Slides: 25
Provided by: Shir185
Category:
Tags: genes | genomes

less

Transcript and Presenter's Notes

Title: Genes and Genomes


1
  • Genes and Genomes

2
Francis Collins, HGP
Craig Venter, Celera Inc.
3
(No Transcript)
4
Hierarchical vs. Whole Genome Shotgun
5
(No Transcript)
6
Raw Genome Data
7
Sequences complete
8
How many genes?
9
Celera says that there are only 30,000 genes
  • Affymetrix 60,000 human genes on GeneChips?
  • Incyte over 120,000 genes?
  • GenBank 49,000 gene coding sequences?
  • UniGene gt 89,000 clusters of unique ESTs?

10
What are genes?
11
Eukaryotic Genomes
12
Eukaryotic Genes
13
Dogmas
One Gene One Protein
Predates the description of the chemical
structure of DNA by Watson and Crick 1953 and
even the identification of DNA as the molecule of
inheritance.
14
A Dynamic Concept
  • Introns/exons
  • Postransriptional modifications
  • Alternative splicing
  • Differential expression
  • Genes-in-genes
  • Genes-ad-genes
  • Postranslational modifications
  • Multi-subunit proteins

15
Current consensus
  • 15,000 known genes (similarity to previously
    isolated genes and expressed sequences from a
    large variety of different organisms)
  • 17,000 predicted (GenScan, GeneFinder, GRAIL)
  • Based on and limited to previous knowledge

16
Sources of Complexity
  • Gene number (2-3 fold over worm and fly)
  • Alternative splicing (ca. 3 transcripts per gene
    vs. 1.3 for worm)
  • Different organization (domains, subunits)
  • New architecture (CNS, brain complexity)
  • New abilities (Cognitive abilities)

17
Challenge Opportunity
  • To locate all of the genes in the human genome
    and describe their functions may take another
    15-20 years!

18
Whats the trouble?
  • 1-3 coding sequences
  • large number and long stretches of repetitions
  • pseudogenes
  • highly specific, rarely expressed genes
  • paralogous genes
  • regulatory regions
  • short RNAs
  • first and last exons
  • splice variations

19
What does it mean?
  • 30,000 - 35,000 genes
  • Average coding length 1.4 kb
  • Average gene extent 30 kb
  • Average gene density 11.5/1 Mb
  • Y chromosome 6.4/1 Mb
  • Chromosome 19 26.8/1 b

20
Raw Genome Data
21
(No Transcript)
22
(No Transcript)
23
Its all about patterns
  • Promoters
  • Open reading frames (ORF)
  • Transcriptional and translational start and stop
    sites/codons
  • Intron splice sites
  • Enhancer signals
  • Poly-A insertion sites

24
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com