Title: Chapter 8Proteomics
1Chapter 8 Proteomics
- ??????????
- ???
- 2004/06/07
2- proteome
- the sum total of an organisms proteins
- genome
- the sum total of an organisms genetic material
38.1 From Genomes to Proteomes
- We want to know
- what proteins are present in cells
- what those proteins do and how they function.
- However, its not easy.
4Why?
- The longevity (??) of an mRNA and the protein it
codes for are very different. - Many proteins are extensively modified after
translation. - Many proteins are not functionally relevant until
they are assembled into larger complexes or
delivered to an appropriate location.
5- Proteins require more careful handling than DNA.
- Function may change.
- Protein identification requires
- mass spectrometric analysis
- specific antibodies.
- Obtaining large numbers of protein molecules
requires chemical isolation for living cells.
68.2 Protein Classification
- Based on
- protein function
- six categories
- evolutionary history structural similarity
- 1000 homologous families
78.2.1 Enzyme Nomenclature
International Union of Biochemistry and Molecular
Biology
88.2.2 Family and Superfamily
- Modern-day proteins may be derived from 1000
original proteins. - folds ? superfamilies ? families
- databases
- SCOP, CATH, DALI
9- fold
- the same major secondary structure topological
connections - superfamily
- probable evolutionary relationships
- family
- clear evolutionary relationships
10(No Transcript)
11(No Transcript)
128.3 Experimental Techniques
- 2D Electrophoresis
- Mass Spectrometry
132D Electrophoresis
liver
kidney
http//tw.expasy.org/cgi-bin/map1
14(No Transcript)
15(No Transcript)
16- Problems
- tens of thousand v.s. thousands
- under presentation of membrane-bound proteins
- difficult to determine exactly which protein is
represented
178.3.2 Mass Spectrometry
- 2D ? mass spectrometry, for identification
188.3.3 Protein Microarrays
- Use antibodies as probes.
- Problems
- Single proteins will interact with multiple
probes. - The binding kinetics of each probe are different.
- Proteins are sensitive to their environment.
198.4 Inhibitors and Drug Design
- development testing of a new drug
- 15 years, US 700 million
- discovery
- target identification
- lead discovery optimization
- toxicology (???)
- pharmacokinetics
- testing
20- HIV protease
- has an active site
- cuts a single, large polypeptide chain into many
proteins.
218.5 Ligand Screening
228.5.1 Ligand Docking
- Determine how two molecules of known structure
will interact. - Three issues
- Identify the energy of a particular molecular
conformations. - Search for the conformation that minimizes the
free energy.
23- How to deal with flexibility in both the protein
and the putative ligand. - Lock and key approaches
- rigid protein structure, flexible ligand
structure - induced fit docking
- flexible in both protein and ligand
24- Softwares
- AutoDock
- FTDock
- DOCK
- Hammerhead
- Gold
- FlexX
258.5.2 Database Screening
- Primary consideration
- complete and accurate search
- with a reasonable computational complexity
- SLIDE
- Fig. 8.4
26(No Transcript)
278.6 X-Ray Crystal Structures
- W. C. Roentgen (1895) discovered X-rays.
- M. von Laue (1912) discovered crystals diffract
X-rays. - D. Hodgkin, etc. (1950s), crystallized complex
organic molecules and determined their
structures.
28- grow a crystal of the protein
29(No Transcript)
30(No Transcript)
31- File formats
- PDB formatted text
- mmCIF (MacroMolecular Crystallographic
Information File)
32- databases resources
- PDB
- PIR
- ExPASy
33- Visualizing Tools
- Fig. 8.8
- RasMol
- Swiss PDB viewer
- VMD (Visual Molecular Dynamics)
- Spock
- Protein explorer
- DINO
348.7 NMR Structures
- 200 amino acids
- the structures determined are not unique
358.8 Empirical Methods and Prediction Techniques
- Example
- Fig. 8.9
- extracting features
- learning, training
- testing
36(No Transcript)
378.9 Post-Translational Modification Prediction
- Remove segments of a protein.
- Covalently attach sugars, phosphates, or sulfate
groups into surface residues. - Cross-link residues within a protein (disulfide
bond).
388.9.1 Protein Sorting
39- associated with membranes
- not associated with membranes
- Table 8.3 (Case 2)
40- PSORT nearest neighbor classifier
- Prediction of protein subcellular localization
- SignalP artificial neural networks
- Prediction of signal peptide cleavage sites
418.9.2 Proteolytic Cleavage
- chymotrypsin
- cleaves polypeptides on the C-terminal side of
bulky and aromatic residues - trypsin
- cleaves on the carboxyl side
- elastase
- cleaves on the C-terminal side of small residues
42- Prediction
- proteasomes, gt 98, by neural network
438.9.3 Glycosylation
- The process of covalently linking an
oligosaccharide to the side chain of a protein
surface residue (???) - N-linked, 75
- O-linked, 85
by neural network
448.9.4 Phosphorylation
- kinases add
- phosphatases remove
- signal
- NetPhos, gt 70, neural network
45?????????
- Fundamental Concepts of BioinformaticsDan E.
Krane and Michael L. Raymer, Benjamin/Cummings,
2003. - Merrian-Webster Dictionary