How we analyzed proteomic data - PowerPoint PPT Presentation

About This Presentation
Title:

How we analyzed proteomic data

Description:

Digest to peptide fragment. MS analysis. Unexpected variation between gels ... Flicker (NCI, through internet) Spot detection ... – PowerPoint PPT presentation

Number of Views:98
Avg rating:3.0/5.0
Slides: 67
Provided by: chm1
Category:

less

Transcript and Presenter's Notes

Title: How we analyzed proteomic data


1
How we analyzed proteomic data?
  • Session III

??????????
2
Topics for session III
  • Image analysis (for 2-DE gel)
  • Mass data analysis
  • Protein structure analysis

3
1. Image analysis
4
Examples of 2-DE results
Healthy control
Patient
MS analysis
Digest to peptide fragment
5
Unexpected variation between gels
  • Interior variation between 2-DE experiments
  • Same loading amount?
  • Same gel condition?
  • Same staining condition?
  • Exterior variation after gel developed
  • Unwanted spots (dye or reagent deposit)
  • Dirty spots (hair, dust)

6
What image analysis software do?
  • spot detection
  • unwanted spot filtering
  • background subtraction
  • normalization
  • image matching
  • expression comparison
  • pI/MW calibration
  • data organization

7
Currently available 2-DE image analysis software
  • Melanie 4 (Swiss Institute of Bioinformatics,
    SIB)
  • Phoretix 2D (Nonlinear dynamics)
  • Progenesis (Nonlinear dynamics)
  • Z3 and Z4000 (Compugen)
  • Delta2D (Sunergia group)
  • A-GelFox 2D (Alpha innotech)
  • Flicker (NCI, through internet)

8
Spot detection
  • One of the first and most important steps in 2-DE
    analysis.
  • Locating the spots in the gel image
  • defining their shape
  • calculating measurement information (volume and
    area)

9
Spot detection
10
Filtering
  • Removal of unwanted spots
  • Whats unwanted spots
  • dust on gel
  • stain deposit
  • bulky spots

11
Background subtraction
  • General background subtraction method
  • No background
  • Mode of non-spot
  • Manual background
  • Lowest boundary
  • Average boundary

12
Normalization
  • General normalization method
  • Total spot volume
  • Single spot
  • Total volume ratio

13
Image matching
14
Expression comparison
Two fold up or down expression are thought to be
significant.
15
pI/MW calibration
observed or experimental pI/MW
PI calibration
MW calibration
16
Data organization
17
Spot annotation
18
2. Mass data analysis
19
Useful proteomic resource
http//tw.expasy.org/
20
Useful proteomic resource
21
Matrix Science - Mascot
http//www.matrixscience.com/
22
Three major functions in Mascot
  • Peptide Mass Fingerprint (PMF) The experimental
    data are a list of peptide mass values from an
    enzymatic digest of a protein. (MALDI-TOF)
  • Sequence Query One or more peptide mass values
    associated with information such as partial or
    ambiguous sequence strings, amino acid
    composition information, MS/MS fragment ion
    masses, etc. A super-set of a sequence tag query.
  • MS/MS Ion Search Identification based on raw
    MS/MS data from one or more peptides. (LC/MS/MS)

23
Difference between MALDI-TOF and LC/MS/MS
MALDI-TOF
LC/MS/MS
24
2-1 PMF analysis
25
Raw data for PMF
m/z
Relative intensity
899.2076 2980.8123 905.2126 1471.3723 909.1917 231
2.2317 915.2181 1533.8486 925.4637 1881.7635 938.3
972 1528.9462 1044.3007 2111.9482 1050.3141 2396.1
550 1060.2797 4689.0698 1066.2889 7302.0029 1072.2
991 5688.8511 1078.3169 8919.1113 1084.2657 1474.5
900 1088.2793 3180.5122 1094.2947 4573.5195 1104.2
638 1546.4652 1110.2837 1470.9734 1271.3163 1498.0
187
26
Mascot PMF query form
27
Mascot PMF parameters
  • Your name Email
  • Search title
  • Database
  • Taxonomy
  • Enzyme
  • Monoisotopic or Average
  • Modifications
  • Protein Mass
  • Peptide tol.
  • Mass values
  • Missed cleavages
  • Data file
  • Query

28
Database
29
Taxonomy
  • Ensure the hit list will only contain entries
    from the selected species
  • speed up a search
  • bring a weak match

30
Enzyme
31
Enzyme
  • "None" is not an allowed choice for a Peptide
    Mass Fingerprint, where the specificity of an
    enzyme is essential.
  • If the search fails to produce a positive match,
    then try again with semiTrypsin (below) before
    resorting to "None".
  • "semiTrypsin" means that Mascot will search for
    peptides that show tryptic specificity (KR not P)
    at one terminus, but where the other terminus may
    be a non-tryptic cleavage. This is a half-way
    house between choosing "Trypsin" and "None".

32
Monoisotopic or Average
  • nominal mass values calculated from integer
    atomic weights. (H1, C12, N14, O16), not
    practical in proteomics.
  • Average mass values equivalent to taking the
    centroid of the complete isotopic envelope
  • Monoisotopic mass value the mass of the first
    peak of the isotope distribution.

33
Monoisotopic or Average
For peptides and proteins, the difference between
an average and a monoisotopic weight is
approximately 0.06.
Insulin (5.8 kD)
Albumin (66.4 kD)
34
Monoisotopic
Tol 1 Da
Monoisotopic MW
-1.01
35
Average
Tol 1 Da
Average MW
36
Monoisotopic
Tol 2 Da
37
Average
Tol 2 Da
38
Modifications
  • Most protein samples exhibit some degree of
    modification.
  • Natural post-translational modifications
    phosphorylation and glycosylation.
  • Deliberate modifications deliberately
    introduced during sample work-up, such as
    cysteine derivatisation.

39
Modifications
Fixed modifications are applied universally, to
every instance of the specified residue(s) or
terminus. Example Carboxymethyl (Cys) means
that all calculations will use 161 Da as the mass
of cysteine. Variable modifications are those
which may or may not be present. Example if
Oxidation (Met) is selected, and a peptide
contains 3 methionines, Mascot will test for a
match with the experimental data for that peptide
containing 0, 1, 2, or 3 oxidised methionine
residues.
40
Modifications
Fixed modifications are applied universally, to
every instance of the specified residue(s) or
terminus. Example Carboxymethyl (Cys) means
that all calculations will use 161 Da as the mass
of cysteine. Variable modifications are those
which may or may not be present. Example if
Oxidation (Met) is selected, and a peptide
contains 3 methionines, Mascot will test for a
match with the experimental data for that peptide
containing 0, 1, 2, or 3 oxidised methionine
residues.
41
Peptide tol.
The error window on experimental peptide mass
values
42
Missed cleavages
Missed cleavage 0, complete digestion Missed
cleavage gt1, incomplete digestion
43
Submit and processing
44
Concise protein summary
45
protein summary
46
PMF protein view (I)
Protein name
Score and Expect
MW and pI
coverage
47
PMF protein view (II)
Match peptides
RMS error
No match peptides
Protein information
48
2-2 MS/MS analysis
49
Raw data for MS/MS
Parent ion
Daughter ion
50
Mascot MS/MS query form
51
Protein summary
Most possible candidate
52
MS/MS Protein view (I)
The sum of all highest scores within each peptide
group
53
MS/MS Protein view (II)
Protein score The sum of all highest scores
within each peptide group
54
Peptide view
55
3. Protein structure analysis
56
Research Collaboratory for Structural
Bioinformatics (RCSB)
57
Protein data bank (PDB)
Proteosome
58
Example 3D structure for proteosome
59
Example 3D structure for proteosome
60
Example 3D structure for proteosome
61
Example 3D structure for proteosome
62
(No Transcript)
63
(No Transcript)
64
Stereo view
65
Rotation
66
Secondary Structure
Write a Comment
User Comments (0)
About PowerShow.com