Title: Molecular Genetics, Biology And Informatics
1Molecular Genetics, Biology And Informatics
2Biology at the Movies
- Ian was right Life did find a way
- -- Michael Creighton (movie adaptation of
Jurassic Park)
3Biology at the Extremes
- Life is very creative
- archeans able to live at 120 degrees Celcius
- organisms able to transduce solar energy to
chemicals - creatures able to recycle its own waste products
to amino acids - creatures able to lift 50x their weight
- creatures able to contemplate themselvesCogito
ergo sum
4So where are the threads of commonality?
- Evolution
- Molecular Level genetic material
5Genetic Materials exert their influence far and
wide
- Cell function
- Metabolism and homeostasis
- Development
- Apoptosis
- Reproduction
- Etc.
6Natures creativity variations on variations
- Motifs
- Gene structures
- Sequence similarities
- Functional adaptation
- Alleles
- Etc.
7Bioinformatics tries to make sense of natures
methodology
- Underlying themes gene structure
- Clustering of particular biological activities
microarray analyses, proteomics, physionomics - Defining motifs structure predictions
- Understanding the direction of change --
phylogenetics - Understanding the influence of molecular
phenomena QTLs and diseases
8Overview
- DNA and RNA and Proteins
- The Central Dogma of Molecular Biology
- Components of the gene
- Transcription and RNA processing
- Translation and post translational modifications
- Organization of DNA into chromosomes
- Genomes
9DNA
- Deoxyribonucleic acid
- Double stranded backbone is made of sugar
phosphate moieties - Strands connected by nitrogen containing
nucleotide bases
10DNA
11DNA
12DNA and complementarity
13DNA is capable of self-replication information
is copied
14Organisms pass on information via DNA
15Information in DNA can change evolution and
phylogeny
16Information in DNA can be read gene expression
and RNA
17RNA
- Ribonucleic acid
- Thymine -gt Uracil
- Types
- mRNA
- tRNA
- rRNA
- miRNA
18RNA
- Single stranded
- Less stable than DNA
- Folds into complex structures
- Older than DNA
- Some RNAs posses catalytic activities to perform
self-splicing
19Proteins
20Protein Secondary Structures
- Alpha Helices
- Beta Sheets
21Protein Tertiary Structures
22Protein Quaternary Structures
23Information Flow the Central Dogma of Molecular
Biology
- The transfer of information from DNA to make a
protein - Transcription of DNA into mRNA
- Translation convert information encoded by mRNA
into amino acid sequence
24Central Dogma, cont.
- Tremendous number of enzymes needed to
- Uncoil DNA
- Transcribe
- Repair mismatches
- Splice exons together
- Transfer protect unstable mRNA
- Assemble amino acids
- Fold protein
- Etc, etc.
25DNA
sense
GGTCTTCTTGGGTGAGAATAACTTTAA
5
3
5
3
anti-sense
CCAGAAGAACCCACTCTTATTGAAATT
26Transcription
sense
GGTCTTCTTGGGTGAGAATAACTTTAA
5
3
5
3
anti-sense
CCAGAAGAACCCACTCTTATTGAAATT
Transcription of anti-sense strand RNA polymerase
GGUCUUCUUGGGUGAGAAUAACUUUAA
5
RNA
27Transcription
- Precisely controlled process
- Signals that control process part of the DNA
sequence Genes - Signals interact with various molecules to effect
trascription
28Gene Structure
29Existence of Genes
- Want to identify potentially new genes
- Typical coding gene structure
- Non-coding genes vary more
30Gene Elements
- Promoters/Repressors
- Enhancers
- Terminators
- Splice sites
- Removal of introns
- Exonic Splice Enhancers/Repressors
- Binding sites
- Gene-Gene, Gene-RNA, Gene-Protein interactions
31Splicing Alternative Splicing
- Each exon is translated into mRNA sequence
- These are trimmed and spliced together
- The order of the splicing is crucial
- The same gene can be spliced in more than one
way, producing multiple transcriptional units
(TUs) - Some researchers have proposed counting TUs
rather than genes or loci to determine the number
of genes in a given genome (this is why there is
much controversy about counting genes)
32Translating mRNA to proteins
- Occurs in the ribosome
- tRNA carries amino acids from cytoplasm to
ribosome - mRNA is the template for translation
codon
stop
mRNA
GGUCUUCUUGGGUGAGAAUAACUUUAA
5
G L L G
amino acid sequence
33(No Transcript)
34Translation
- Tightly controlled process involving plenty of
control proteins and rRNAs - Precisely defined starting position
Shine-Dalgarno sequence in prokaryotes. - Precisely defined ending points
35Post translational processing
- Protein folds into a precise configuration
- Protein maybe chemically modified to control
activity - Protein interacts with other molecules to control
activity, supply active ingredients - Activity may also be controlled by degradation or
compartmentalization
36Information is in the sequence the secondary,
tertiary and quaternary structure heavily
dependent on the primary structure or the sequence
- Amino acids determine secondary structure due to
propensities to form helixes and sheets - Tertiary structure dependent on the interactions
of helices and sheets, forces for interaction
dependent on the nature and composition of the
amino acids making up the secondary structures - Ditto for quarternary structures and features
that signal for post translational modifications
37Genomes and Chromosomes
- Complete set of instructions for making an
organism almost - Prokaryotic genomes typically composed of one
chromosome - Human genome contains 3Bbp organized into 24
distinct units called chromosomes - Humans have 22 pairs of autosomes and 1 pair of
sex chromosomes (XX in females, XY in males) - Morphological examination of eukaryotic
chromosomes possible - Chromosomes contain roughly equal quantities DNA
and protein - Chromosomes contain DNA tightly wound around
protein cores
38Chromosomes and Genomes
- Genetic information is passed along packaged in
chromosomes - Genes are arranged linearly along the chromosomes
- Arrangement of genes and other elements within
the chromosome is critical to proper functioning
of the organism - Inheritance patterns and the resulting
characteristics of the organism present another
area for bioinformatic analysis -- QTLs
39Genomes are dynamic
40Genomes are dynamic
E.coli K-12
E.coli O157 sakai
- Rearrangements
- Inversions
- Duplications
- Deletions
- Recombinations
- Transposition
S.typhi CT18
S.typhimurium LT2
IS maps
41Genomes are dynamic
4.1 3.4
3.6-4.2
1.7 1.2
1.2 - 1.8
1.3 -1.8
42Recap biological sequences can yield a lot of
information to bioinformatic scrutiny
- Composition and structure of genomic sequences
- Sequence similarity and divergence recurring
themes and phylogeny - Structure of genes
- Coding (protein producing)
- Non-coding (functional RNA)
- Predict structure
- RNA secondary/tertiary structure
- Protein secondary/tertiary structure
43Is sequence everything? Dont bet on it, but
its a great place to start
- Proliferation of the omics
- Proteo
- Pheno
- Physio
- Pharamacogeno