Kein Folientitel - PowerPoint PPT Presentation

1 / 18
About This Presentation
Title:

Kein Folientitel

Description:

Dr. Hans Briem Introduction to Drug Discovery - Lecture Summer Semester 2001 ... potential, , steric fields, hydrophobicity, solvent accessible surface area etc. ... – PowerPoint PPT presentation

Number of Views:61
Avg rating:3.0/5.0
Slides: 19
Provided by: hansb
Category:

less

Transcript and Presenter's Notes

Title: Kein Folientitel


1
Introduction to QSAR (Quantitative Structure
Activity Relationships)
2
Introduction to QSAR
  • Example of a Qualitative Structure Activity
    Relationship (SAR)

Affinity to Serotonin Receptor 1. decreases by
N-alkylation (R1/R2) 2. decreases by
bis-methylation (R3/R4) 3. increases by
methoxylation in R5 and/or R7 4. increases
by adding lipophilic substituents in R6
3
Introduction to QSAR
  • In contrast to Qualitative SAR, Quantitative SAR
    (QSAR) seeks to find a mathematical relationship
    between biological activity and molecular
    properties
  • General form of a QSAR equation
  • biol. activity f(P) with P molecular
    property/ies
  • or more specifically
  • biol. activity const. (c1.P1) (c2.P2)
    (c3.P3) ...
  • Molecular properties (descriptors) P are
    calculated for each molecule in the data set
  • Coefficients c and constant term are calculated
    by statistical methods (e.g. multiple linear
    regression)

4
Molecular QSAR-Descriptors
  • 1D Whole-molecule properties (e.g. molecular
    weight, melting point, logP etc.)
  • 2D Substituent constants (e.g. ?, ?, molar
    refractivity), fragment fingerprints,
    topological indices
  • 3D Surface or field properties (e.g.
    electrostatic potential, , steric fields,
    hydrophobicity, solvent accessible surface area
    etc.),

5
Introduction to QSAR
  • Why QSAR?
  • QSAR models are derived from a series of
    (similar) molecules with known activity (training
    set)
  • If a statistically relevant QSAR model has been
    found, it can be applied to new molecules in this
    series (test set) in order to predict their
    activity before biological testing (or even
    before synthesis!)

6
Introduction to QSAR
Example Analgesic activity of Capsaicin analogs
(taken from Walpole et al., Sandoz)
7
Introduction to QSAR
8
Introduction to QSAR
  • The Gibbs-Helmholtz equation (?GRTlnK) tells us
    that there is a logarithmic relationship between
    equilibrium constants (e.g. EC50) and free energy
    of binding
  • Thus, we have to transform the EC50 values to a
    logarithmic scale

9
Introduction to QSAR
  • Now, we require some molecular properties
    (descriptors)...
  • The Sandoz group decided to use two substituent
    constants the hydrophobic constant ? and the
    molar refractivity (MR) (correlated with the size
    and polarizability of the substituents)

10
Introduction to QSAR
  • We can plot the descriptor values vs. Log EC50 ...
  • ...and we can calculate linear equations for both
    parameters
  • Log EC50 0.76 - (0.82)?
  • Log EC50 1.14 - (0.07)MR

Our first QSAR equations!!!
11
Introduction to QSAR
  • How larger are the errors that we make?

?-Equation Log EC50 0.76 - (0.82)?
12
Introduction to QSAR
  • How larger are the errors that we make?

MR-Equation Log EC50 1.14 - (0.07)MR
13
Introduction to QSAR
  • How larger are the errors that we make?

Actual (measured) Log EC50 vs Predicted Log EC50
Correlation coefficients (R2) 0.88 0.53
14
Introduction to QSAR
  • Can we do any better by using both parameters in
    the equation (multiple linear regression (MLR)
    instead of simple linear regression)?

Best MLR Equation Log EC50 0.76 - (0.82)?
(0.0003)MR (Corr. Coeff. R2 0.89)
15
Introduction to QSAR
  • How can we validate these QSAR models?
  • Prediction within the training set (e.g. by
    leave-one-out cross validation)
  • leave out each compound once
  • calculate QSAR model with remaining compounds
    only
  • predict activity of left-out compound
  • compare prediction with "true" affinity
  • calculate "cross validated" R2 (often reported as
    Q2)
  • Prediction of the test set

16
Introduction to QSAR
  • How can we validate these QSAR models?
  • Prediction of the training set (cross
    validation)
  • Log EC50 0.76 - (0.82)? R20.89 Q20.72
  • Log EC50 1.14 - (0.07)MR R20.53 Q20.28
  • Log EC50 0.76 - (0.82)? (0.0003)MR R20.89
    Q20.58

17
Introduction to QSAR
  • How can we validate these QSAR models?
  • Prediction of the test set (compound 6i in this
    example)
  • Log EC50 0.76 - (0.82)? Predicted EC50
    for 6i 1.56
  • Log EC50 1.14 - (0.07)MR Predicted
    EC50 for 6i 0.42
  • Log EC50 0.76 - (0.82)? (0.0003)MR
    Predicted EC50 for 6i 1.57
  • Now we have a problem....

18
Introduction to QSAR
  • Some problems associated with "classical" QSAR
  • Only applicable within a chemical series
  • A good training set must be available
  • Activity data should be evenly spread
  • Activity data should span 3-4 orders of magnitude
    (log units)
  • Choice of meaningful descriptors
  • Problem of extrapolation (e.g. descriptors of
    test compounds lie out of descriptor range of
    training set)
  • Non-linear relationships are hard to detect
Write a Comment
User Comments (0)
About PowerShow.com