Title: Semantical and Algorithmic Aspects of the Living Franois Fages Constraint Programming projectteam, I
1Semantical and Algorithmic Aspects of the
LivingFrançois FagesConstraint Programming
project-team, INRIA Paris-Rocquencourt
- To tackle the complexity of biological systems,
investigate - Programming Theory Concepts
- Formal Methods of Circuit and Program
Verification - Automated Reasoning Tools
- Prototype Implementation in the Biochemical
Abstract Machine BIOCHAM - modeling environment available at
http//contraintes.inria.fr/BIOCHAM
2Systems Biology ?
- Systems Biology aims at systems-level
understanding which - requires a set of principles and methodologies
that links the - behaviors of molecules to systems characteristics
and functions. - H. Kitano, ICSB 2000
- Analyze (post-)genomic data produced with
high-throughput technologies (stored in databases
like GO, KEGG, BioCyc, etc.) - Integrate heterogeneous data about a specific
problem - Understand and Predict behaviors or interactions
in large networks of genes and proteins. - Systems Biology Markup Language (SBML) exchange
format for reaction models
3Issue of Abstraction
- Models are built in Systems Biology with two
contradictory perspectives
4Issue of Abstraction
- Models are built in Systems Biology with two
contradictory perspectives - 1) Models for representing knowledge the more
concrete the better
5Issue of Abstraction
- Models are built in Systems Biology with two
contradictory perspectives - 1) Models for representing knowledge the more
concrete the better - 2) Models for making predictions the more
abstract the better !
6Issue of Abstraction
- Models are built in Systems Biology with two
contradictory perspectives - 1) Models for representing knowledge the more
concrete the better - 2) Models for making predictions the more
abstract the better ! - These perspectives can be reconciled by
organizing formalisms and models into hierarchies
of abstractions. - To understand a system is not to know everything
about it but to know - abstraction levels that are sufficient for
answering questions about it
7Semantics of Living Processes ?
- Formally, the behavior of a system depends on
our choice of observables. - ?
?
Mitosis movie Lodish et al. 03
8Boolean Semantics
- Formally, the behavior of a system depends on
our choice of observables. - Presence/absence of molecules
- Boolean transitions
0
1
9Continuous Differential Semantics
- Formally, the behavior of a system depends on
our choice of observables. - Concentrations of molecules
- Rates of reactions
x
ý
10Stochastic Semantics
- Formally, the behavior of a system depends on
our choice of observables. - Numbers of molecules
- Probabilities of reaction
n
?
11Temporal Logic Semantics
- Formally, the behavior of a system depends on
our choice of observables. - Presence/absence of molecules
- Temporal logic formulas
F x F (x F (? x y)) FG (x v y)
F
x
12Constraint Temporal Logic Semantics
- Formally, the behavior of a system depends on
our choice of observables. - Concentrations of molecules
- Constraint LTL temporal formulas
F (x gt0.2) F (x gt0.2 F (xlt0.1 ygt0.2)) FG
(xgt0.2 v ygt0.2)
F
xgt1
13Language-based Approaches to Cell Systems Biology
- Qualitative models from diagrammatic notation to
- Boolean networks Kaufman 69, Thomas 73
- Petri Nets Reddy 93, Chaouiya 05
- Process algebra pcalculus Regev-Silverman-Shapir
o 99-01, Nagasali et al. 00 - Bio-ambients Regev-Panina-Silverman-Cardelli-Shap
iro 03 - Pathway logic Eker-Knapp-Laderoute-Lincoln-Mesegu
er-Sonmez 02 - Reaction rules Chabrier-Fages 03
Chabrier-Chiaverini-Danos-Fages-Schachter 04 - Quantitative models from ODEs and stochastic
simulations to - Hybrid Petri nets Hofestadt-Thelen 98, Matsuno
et al. 00 - Hybrid automata Alur et al. 01, Ghosh-Tomlin 01
HCC Bockmayr-Courtois 01 - Stochastic pcalculus Priami et al. 03
Cardelli et al. 06 - Reaction rules with continuous time dynamics
Fages-Soliman-Chabrier 04
14Overview of the Talk
- Rule-based Modeling of Biochemical Systems
- Syntax of molecules, compartments and reactions
- Semantics at three abstraction levels boolean,
differential, stochastic - Cell cycle control models
- Temporal Logic Language for Formalizing
Biological Properties - CTL for the boolean semantics
- Constraint LTL for the differential semantics
- PCTL for the stochastic semantics
- Automated Reasoning Tools
- Inferring kinetic parameter values from
Constraint-LTL specification - Inferring reaction rules from CTL specification
- L. Calzone, N. Chabrier, F. Fages, S. Soliman.
TCSB VI, LNBI 422068-94. 2006.
15Molecules of the living
- Small molecules covalent bonds 50-200 kcal/mol
- 70 water
- 1 ions
- 6 amino acids (20), nucleotides (5),
- fats, sugars, ATP, ADP,
- Macromolecules hydrogen bonds, ionic,
hydrophobic, Waals 1-5 kcal/mol - 20 proteins (50-104 amino acids)
- RNA (102-104 nucleotides AGCU)
- DNA (102-106 nucleotides AGCT)
16Structure Levels of Proteins
- 1) Primary structure word of n amino acids
residues (20n possibilities) - linked with C-N bonds
- Example INRIA
- Isoleucine-asparagiNe-aRginine-Isoleucine-Alanine
17Structure Levels of Proteins
- 1) Primary structure word of n amino acids
residues (20n possibilities) - linked with C-N bonds
- Example INRIA
- Isoleucine-asparagiNe-aRginine-Isoleucine-Alanine
- 2) Secondary word of m a-helix, b-strands,
random coils, (3m-10m) - stabilized by hydrogen
bonds H---O
18Structure Levels of Proteins
- 1) Primary structure word of n amino acids
residues (20n possibilities) - linked with C-N bonds
- Example INRIA
- Isoleucine-asparagiNe-aRginine-Isoleucine-Alanine
- 2) Secondary word of m a-helix, b-strands,
random coils, (3m-10m) - stabilized by hydrogen
bonds H---O - 3) Tertiary 3D structure spatial folding
-
stabilized by -
hydrophobic -
interactions
19Syntax of proteins
- Cyclin dependent kinase 1 Cdk1
- (free, inactive)
- Complex Cdk1-Cyclin B Cdk1CycB
- (low activity)
- Phosphorylated form Cdk1thr161-CycB
- at site threonine 161
- (high activity)
- mitosis promotion factor
20BIOCHAM Syntax of Objects
- E compound E-E Ep1,,pn
- Compound molecule, gene binding site, abstract
_at_process - - binding operator for protein complexes, gene
binding sites, - Associative and commutative.
- modification operator for phosphorylated
sites, - Set of modified sites
(Associative, Commutative, Idempotent). - O E Elocation
- Location symbolic compartment (nucleus,
cytoplasm, membrane, ) - S _ OS
- solution operator (Associative, Commutative,
Neutral _)
21Elementary Reaction Rules
- Complexation A B gt A-B
Decomplexation A-B gt A B - cdk1cycB gt cdk1cycB
22Elementary Reaction Rules
- Complexation A B gt A-B
Decomplexation A-B gt A B - cdk1cycB gt cdk1cycB
- Phosphorylation A Cgt Ap
Dephosphorylation Ap Cgt A - Cdk1-CycB Myt1gt Cdk1thr161-CycB
- Cdk1thr14,tyr15-CycB Cdc25Ntermgt
Cdk1-CycB
23Elementary Reaction Rules
- Complexation A B gt A-B
Decomplexation A-B gt A B - cdk1cycB gt cdk1cycB
- Phosphorylation A Cgt Ap
Dephosphorylation Ap Cgt A - Cdk1-CycB Myt1gt Cdk1thr161-CycB
- Cdk1thr14,tyr15-CycB Cdc25Ntermgt
Cdk1-CycB - Synthesis _ Cgt A.
Degradation A Cgt _. - _ E2-E2f13-Dp12gt CycA cycE _at_UbiProgt _
-
(not for cycE-cdk2 which is stable)
24Elementary Reaction Rules
- Complexation A B gt A-B
Decomplexation A-B gt A B - cdk1cycB gt cdk1cycB
- Phosphorylation A Cgt Ap
Dephosphorylation Ap Cgt A - Cdk1-CycB Myt1gt Cdk1thr161-CycB
- Cdk1thr14,tyr15-CycB Cdc25Ntermgt
Cdk1-CycB - Synthesis _ Cgt A.
Degradation A Cgt _. - _ E2-E2f13-Dp12gt CycA cycE _at_UbiProgt _
-
(not for cycE-cdk2 which is stable) - Transport AL1 gt AL2
- Cdk1p-CycBcytoplasm gt Cdk1p-CycBnucleus
25From Syntax to Semantics
- R SgtS S Ogt S S ltgt S S ltOgt S
- where A Cgt B stands
for AC gt BC - A ltgt B
stands for AgtB and BgtA, etc. - kinetic for R (import/export
SBML format) - In SBML no semantics (exchange format)
26From Syntax to Semantics
- R SgtS S Ogt S S ltgt S S ltOgt S
- where A Cgt B stands
for AC gt BC - A ltgt B
stands for AgtB and BgtA, etc. - kinetic for R (import/export
SBML format) - In SBML no semantics (exchange format)
- In BIOCHAM three abstraction levels
- Boolean Semantics presence-absence of molecules
- Concurrent Transition System (asynchronous,
non-deterministic)
27From Syntax to Semantics
- R SgtS S Ogt S S ltgt S S ltOgt S
- where A Cgt B stands
for AC gt BC - A ltgt B
stands for AgtB and BgtA, etc. - kinetic for R (import/export
SBML format) - In SBML no semantics (exchange format)
- In BIOCHAM three abstraction levels
- Boolean Semantics presence-absence of molecules
- Concurrent Transition System (asynchronous,
non-deterministic) - Differential Semantics concentration
- Ordinary Differential Equations or Hybrid system
(deterministic)
28From Syntax to Semantics
- R SgtS S Ogt S S ltgt S S ltOgt S
- where A Cgt B stands
for AC gt BC - A ltgt B
stands for AgtB and BgtA, etc. - kinetic for R (import/export
SBML format) - In SBML no semantics (exchange format)
- In BIOCHAM three abstraction levels
- Boolean Semantics presence-absence of molecules
- Concurrent Transition System (asynchronous,
non-deterministic) - Differential Semantics concentration
- Ordinary Differential Equations or Hybrid system
(deterministic) - Stochastic Semantics number of molecules
- Continuous time Markov chain
291. Differential Semantics
- Associates to each molecule its concentration
Ai Ai / volume ML-1 - volume of diffusion
301. Differential Semantics
- Associates to each molecule its concentration
Ai Ai / volume ML-1 - volume of compartment
- Compiles a set of rules ei for SigtSI i1,,n
(by default ei is MA(1)) - into the system of ODEs (or hybrid automaton)
over variables A1,,Ak - dA/dt Sni1 ri(A)ei - Snj1 lj(A)ej
- where ri(A) (resp. li(A)) is the stoichiometric
coefficient of A in Si (resp. Si) - multiplied by the volume ratio of the location of
A.
311. Differential Semantics
- Associates to each molecule its concentration
Ai Ai / volume ML-1 - volume of compartment
- Compiles a set of rules ei for SigtSI i1,,n
(by default ei is MA(1)) - into the system of ODEs (or hybrid automaton)
over variables A1,,Ak - dA/dt Sni1 ri(A)ei - Snj1 lj(A)ej
- where ri(A) (resp. li(A)) is the stoichiometric
coefficient of A in Si (resp. Si) - multiplied by the volume ratio of the location of
A. - volume_ratio (15,n),(1,c).
- mRNAcycAn ltgt mRNAcycAc.
- means 15Vn Vc and is equivalent to
15mRNAcycAn ltgt mRNAcycAc.
32Numerical Integration
- Adaptive step size 4th order Runge-Kutta can be
weak for stiff systems - Rosenbrock implicit method using the Jacobian
matrix ?xi/?xj - computes a (clever) discretization of time
- and a time series of concentrations and their
derivatives - (t0, X0, dX0/dt), (t1, X1, dX1/dt), , (tn, Xn,
dXn/dt), - at discrete time points
332. Stochastic Semantics
- Associate to each molecule its number Ai in its
location of volume Vi
342. Stochastic Semantics
- Associate to each molecule its number Ai in its
location of volume Vi - Compile the rule set into a continuous time
Markov chain - over vector states X(A1,, Ak)
- and where the transition rate ti for the
reaction ei for SigtSI - (giving probability after normalization) is
obtained from ei by replacing concentrations by
molecule numbers
352. Stochastic Semantics
- Associate to each molecule its number Ai in its
location of volume Vi - Compile the rule set into a continuous time
Markov chain - over vector states X(A1,, Ak)
- and where the transition rate ti for the
reaction ei for SigtSI - (giving probability after normalization) is
obtained from ei by replacing concentrations by
molecule numbers - Stochastic simulation Gillespie 76, Gibson 00
- computes realizations as time series (t0, X0),
(t1, X1), , (tn, Xn),
363. Boolean Semantics
- Associate to each molecule a Boolean denoting its
presence/absence in its location
373. Boolean Semantics
- Associate to each molecule a Boolean denoting its
presence/absence in its location - Compile the rule set into an asynchronous
transition system
383. Boolean Semantics
- Associate to each molecule a Boolean denoting its
presence/absence in its location - Compile the rule set into an asynchronous
transition system where a reaction like ABgtCD
is translated into 4 transition rules taking into
account the possible complete consumption of
reactants - AB?ABCD
- AB??AB CD
- AB?A?BCD
- AB??A?BCD
393. Boolean Semantics
- Associate to each molecule a Boolean denoting its
presence/absence in its location - Compile the rule set into an asynchronous
transition system where a reaction like ABgtCD
is translated into 4 transition rules taking into
account the possible complete consumption of
reactants - AB?ABCD
- AB??AB CD
- AB?A?BCD
- AB??A?BCD
- Necessary for over-approximating the possible
behaviors under the stochastic/discrete semantics
(abstraction N ? zero, non-zero)
40Hierarchy of Semantics
abstraction
Theory of abstract Interpretation
Cousot Cousot POPL77 Fages Soliman
TCSc07
Boolean model
Discrete model
Differential model
Stochastic model
Models for answering queries The more abstract
the better Optimal abstraction w.r.t. query
Syntactical model
concretization
41Query what are the stationary states ?
Boolean circuit analysis
abstraction
abstraction
Discrete circuit analysis
Boolean model
abstraction
Jacobian circuit analysis
Discrete model
abstraction
Differential model
Positive circuits are a necessary condition for
multistationarity Thomas 73 de Jong 02 Soulé
03 Remy Ruet Thieffry 05
Stochastic model
Syntactical model
concretization
42Type Inference / Type Checking
abstraction
Fages Soliman CMSB06
Boolean model
Discrete model
Differential model
Influence graph of proteins Protein functions
(kinase, phosphatase,) Compartments topology
Stochastic model
Syntactical model
concretization
43Type Inference / Type Checking
abstraction
Fages Soliman CMSB06
Boolean model
Influence graph of proteins (activation/inhibition
)
Discrete model
Differential model
Influence graph of proteins Protein functions
(kinase, phosphatase,) Compartments topology
Stochastic model
Syntactical model
concretization
44Cell Cycle G1 ? DNA Synthesis ? G2 ? Mitosis
- G1 CdK4-CycD S Cdk2-CycA
G2,M Cdk1-CycA - Cdk6-CycD
Cdk1-CycB (MPF) - Cdk2-CycE
45Example Cell Cycle Control Model Tyson 91
- MA(k1) for _ gt Cyclin.
- MA(k2) for Cyclin gt _.
- MA(K7) for Cyclinp1 gt _.
- MA(k8) for Cdc2 gt Cdc2p1.
- MA(k9) for Cdc2p1 gtCdc2.
- MA(k3) for CyclinCdc2p1 gt
Cdc2p1-Cyclinp1. - MA(k4p) for Cdc2p1-Cyclinp1 gt
Cdc2-Cyclinp1. - k4Cdc2-Cyclinp12Cdc2p1-Cyclinp1
for - Cdc2p1-Cyclinp1 Cdc2-Cyclinp1 gt
Cdc2-Cyclinp1. - MA(k5) for Cdc2-Cyclinp1 gt Cdc2p1-Cyclinp
1. - MA(k6) for Cdc2-Cyclinp1 gt Cdc2Cyclinp1.
46Interaction Graph
47Stochastic Simulation
48Differential Simulation
49Boolean Simulation
50(No Transcript)
51Mammalian Cell Cycle Control Map Kohn 99
52Transcription of Kohns Map
- _ E2F13-DP12-gE2 gt cycA.
- ...
- cycB APCp1 gt_.
- cdk1p1,p2,p3 cycA gt cdk1p1,p2,p3-cycA.
- cdk1p1,p2,p3 cycB gt cdk1p1,p2,p3-cycB.
- ...
- cdk1p1,p3-cycA Wee1 gt cdk1p1,p2,p3-cycA
. - cdk1p1,p3-cycB Wee1 gt cdk1p1,p2,p3-cycB
. - cdk1p2,p3-cycA Myt1 gt cdk1p1,p2,p3-cycA
. - cdk1p2,p3-cycB Myt1 gt cdk1p1,p2,p3-cycB
. - ...
- cdk1p1,p2,p3 cdc25Cp1,p2 gt
cdk1p1,p3. - cdk1p1,p2,p3-cycA cdc25Cp1,p2 gt
cdk1p1,p3-cycA. - cdk1p1,p2,p3-cycB cdc25Cp1,p2 gt
cdk1p1,p3-cycB.
165 proteins and genes, 500 variables, 800 rules
Chiaverini Danos 02
53Overview of the Talk
- Rule-based Modeling of Biochemical Systems
- Syntax of molecules, compartments and reactions
- Semantics at three abstraction levels boolean,
differential, stochastic - Cell cycle control models
- Temporal Logic Language for Formalizing
Biological Properties - CTL for the boolean semantics
- Constraint LTL for the differential semantics
- PCTL for the stochastic semantics
- Automated Reasoning Tools
- Inferring kinetic parameter values from
Constraint-LTL specification - Inferring reaction rules from CTL specification
54A Logical Paradigm for Systems Biology
- Biological model Transition System
- Biological property Temporal Logic Formula
- Biological validation Model-checking
- Formalize properties of the biological system in
- Computation Tree Logic CTL for the boolean
semantics - Linear Time Logic with numerical constraints for
the concentration semantics - Probabilistic CTL with numerical constraints for
the stochastic semantics - Evaluate the formulas by model checking
techniques - Lincoln et al. PSB02 Chabrier Fages CMSB03
Bernot et al. TCS04
55A Logical Paradigm for Systems Biology
- Biological model Transition System
- Biological property Temporal Logic Formula
- Biological validation Model-checking
- In the Biochemical Abstract Machine environment,
- Model BIOCHAM
- - Boolean - simulation
- - Differential
- - Stochastic
- (SBML)
56A Logical Paradigm for Systems Biology
- Biological model Transition System
- Biological property Temporal Logic Formula
- Biological validation Model-checking
- In the Biochemical Abstract Machine environment,
- Model BIOCHAM
Biological Properties - - Boolean - simulation
- CTL - - Differential - query evaluation
- LTL with constraints - - Stochastic
- PCTL with constraints - (SBML)
57A Logical Paradigm for Systems Biology
- Biological model Transition System
- Biological property Temporal Logic Formula
- Biological validation Model-checking
- In the Biochemical Abstract Machine environment,
- Model BIOCHAM
Biological Properties - - Boolean - simulation
- CTL - - Differential - query evaluation
- LTL with constraints - - Stochastic
- PCTL with constraints - (SBML)
58A Logical Paradigm for Systems Biology
- Biological model Transition System
- Biological property Temporal Logic Formula
- Biological validation Model-checking
- In the Biochemical Abstract Machine environment,
- Model BIOCHAM
Biological Properties - - Boolean - simulation
- CTL - - Differential - query evaluation
- LTL with constraints - - Stochastic - rule learning
- PCTL with constraints - (SBML) - parameter search
592.1 Computation Tree Logic CTL
- Extension of propositional (or first-order) logic
with operators for time and choices Clarke et
al. 99
60Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P)
61Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R)
62Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R) - Can the cell always produce P? AG(reachable(P))
63Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R) - Can the cell always produce P? AG(reachable(P))
- About pathways
- Can the cell reach a (partially described) set
of states s while passing by another set of
states s2? EF(s2EFs)
64Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R) - Can the cell always produce P? AG(reachable(P))
- About pathways
- Can the cell reach a (partially described) set
of states s while passing by another set of
states s2? EF(s2EFs) - Is it possible to produce P without Q? E(?Q U P)
65Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R) - Can the cell always produce P? AG(reachable(P))
- About pathways
- Can the cell reach a (partially described) set
of states s while passing by another set of
states s2? EF(s2EFs) - Is it possible to produce P without Q? E(?Q U P)
- Is (set of) state s2 a necessary checkpoint for
reaching (set of) state s? - checkpoint(s2,s) ?E(?s2U s)
66Biological Properties formalized in CTL (1/3)
- About reachability
- Can the cell produce some protein P?
reachable(P)EF(P) - Can the cell produce P, Q and not R?
reachable(PQ?R) - Can the cell always produce P? AG(reachable(P))
- About pathways
- Can the cell reach a (partially described) set
of states s while passing by another set of
states s2? EF(s2EFs) - Is it possible to produce P without Q? E(?Q U P)
- Is (set of) state s2 a necessary checkpoint for
reaching (set of) state s? - checkpoint(s2,s) ?E(?s2U s)
- Is s2 always a checkpoint for s? AG(?s -gt
checkpoint(s2,s))
67Biological Properties formalized in CTL (2/3)
- About stationarity
- Is a (set of) state s a stable state? stable(s)
AG(s)
68Biological Properties formalized in CTL (2/3)
- About stationarity
- Is a (set of) state s a stable state? stable(s)
AG(s) - Is s a steady state (with possibility of
escaping) ? steady(s)EG(s)
69Biological Properties formalized in CTL (2/3)
- About stationarity
- Is a (set of) state s a stable state? stable(s)
AG(s) - Is s a steady state (with possibility of
escaping) ? steady(s)EG(s) - Can the cell reach a stable state s?
EF(stable(s)) not in LTL
70Biological Properties formalized in CTL (2/3)
- About stationarity
- Is a (set of) state s a stable state? stable(s)
AG(s) - Is s a steady state (with possibility of
escaping) ? steady(s)EG(s) - Can the cell reach a stable state s?
EF(stable(s)) not in LTL - Must the cell reach a stable state s?
AG(stable(s))
71Biological Properties formalized in CTL (2/3)
- About stationarity
- Is a (set of) state s a stable state? stable(s)
AG(s) - Is s a steady state (with possibility of
escaping) ? steady(s)EG(s) - Can the cell reach a stable state s?
EF(stable(s)) not in LTL - Must the cell reach a stable state s?
AG(stable(s)) - What are the stable states? Not expressible in
CTL. Needs to combine CTL with search (e.g.
constraint programming Thieffry et al. 05 )
72Biological Properties formalized in CTL (3/3)
- About oscillations
- Can the system exhibit a cyclic behavior w.r.t.
the presence of P ? oscil(P) EG(F ?P F P) - CTL formula that can be approximated in CTL
by - oscil(P) EG((P ? EF ?P) (?P ? EF P))
- (necessary but not sufficient condition for
oscillation)
73Biological Properties formalized in CTL (3/3)
- About oscillations
- Can the system exhibit a cyclic behavior w.r.t.
the presence of P ? oscil(P) EG((P ? EF ?P)
(?P ? EF P)) - (necessary but not sufficient condition)
- Can the system loops between states s and s2 ?
- loop(P,Q) EG((s ? EF s2) (s2 ? EF s))
74Symbolic Model-Checking
- Still for finite Kripke structures, use boolean
constraints to represent - sets of states as a boolean constraint c(V)
- the transition relation as a boolean constraint
r(V,V) - Binary Decision Diagrams BDD Bryant 85 provide
canonical forms to Boolean formulas (decide
Boolean equivalence) - (x?y)?(y?z)?(z?x)
- and
- (x?z)?(z?y)?(y?x)
- are equivalent, they
- have the same BDD(x,y,z)
-
75Mammalian Cell Cycle Control Map Kohn 99
76Cell Cycle Model-Checking (with BDD NuSMV)
- biocham check_reachable(cdk46p1,p2-cycDp1).
- Ei(EF(cdk46p1,p2-cycDp1)) is true
- biocham check_checkpoint(cdc25Cp1,p2,
cdk1p1,p3-cycB). - Ai(!(E(!(cdc25Cp1,p2) U cdk1p1,p3-cycB)))
is true - biocham nusmv(Ai(AG(!(cdk1p1,p2,p3-cycB) -gt
checkpoint(Wee1, cdk1p1,p2,p3-cycB))))). - Ai(AG(!(cdk1p1,p2,p3-cycB)-gt!(E(!(Wee1) U
cdk1p1,p2,p3-cycB)))) is false - biocham why.
- -- Loop starts here
- cycB-cdk1p1,p2,p3 is present
- cdk7 is present
- cycH is present
- cdk1 is present
- Myt1 is present
- cdc25Cp1 is present
- rule_114 cycB-cdk1p1,p2,p3cdc25Cp1gtcycB-
cdk1p2,p3. - cycB-cdk1p2,p3 is present
- cycB-cdk1p1,p2,p3 is absent
- rule_74 cycB-cdk1p2,p3Myt1gtcycB-cdk1p1,p2
,p3. - cycB-cdk1p2,p3 is absent
77Mammalian Cell Cycle Control Benchmark
- 500 variables, 2500 states. 800 rules.
- BIOCHAM NuSMV model-checker time in sec.
Chabrier et al. TCS 04
782.2 LTL with Constraints for the Differential
Semantics
- Constraints over concentrations and derivatives
as FOL formulae over the reals - M gt 0.2
- MP gt Q
- d(M)/dt lt 0
79LTL with Constraints for the Differential
Semantics
- Constraints over concentrations and derivatives
as FOL formulae over the reals - M gt 0.2
- MP gt Q
- d(M)/dt lt 0
- Linear Time Logic LTL operators for time X, F, U,
G - F(Mgt0.2)
- FG(Mgt0.2)
- F (Mgt2 F (d(M)/dtlt0 F (Mlt2
d(M)/dtgt0 F(d(M)/dtlt0)))) - oscil(M,n) defined as at least n alternances of
sign of the derivative - Period(A,75) ? t ?v F(T t A v
d(A)/dt gt 0 X(d(A)/dt lt 0) - F(T t 75 A v
d(A)/dt gt 0 X(d(A)/dt lt 0)))
80How to Evaluate a Constraint LTL Formula ?
- Consider the ODEs of the concentration semantics
dX/dt f(X)
81How to Evaluate a Constraint LTL Formula ?
- Consider the ODEs of the concentration semantics
dX/dt f(X) - Numerical integration methods produce a
discretization of time (adaptive step size
Runge-Kutta or Rosenbrock method for stiff syst.) - The trace is a linear Kripke structure
- (t0,X0,dX0/dt), (t1,X1,dX1/dt), ,
(tn,Xn,dXn/dt), - over concentrations and their derivatives at
discrete time points - Evaluate the formula on that Kripke structure
with a model checking alg. -
82Simulation-Based Constraint LTL Model Checking
- Hypothesis 1 the initial state is completely
known - Hypothesis 2 the formula can be checked over a
finite period of time 0,T - Run the numerical integration from 0 to T
producing
values at a finite sequence of time points - Iteratively label the time points with the
sub-formulae of f that are true - Add f to the time points where a FOL formula
f is true, - Add F f (X f) to the (immediate) previous
time points labeled by f, - Add f1 U f2 to the predecessor time points
of f2 while they satisfy f1, - (Add G f to the states satisfying f until
T) - Model checker and numerical integration methods
implemented in Prolog
83Constraint-LTL Instanciation Algo. Fages Rizk
CMSB07
842.3 PCTL Model Checker for the Stochastic
Semantics
- Compute the probability of realisation of a TL
formula (with constraints) by Monte Carlo method - Perform several stochastic simulations
- Evaluate the probability of realization of the TL
formula - Costly
- PRISM Kwiatkowska et al. 04 PCTL model
checker based on BDDs or using Monte Carlo method.
85Overview of the Talk
- Rule-based Modeling of Biochemical Systems
- Syntax of molecules, compartments and reactions
- Semantics at three abstraction levels boolean,
differential, stochastic - Cell cycle control models
- Temporal Logic Language for Formalizing
Biological Properties - CTL for the boolean semantics
- Constraint LTL for the differential semantics
- PCTL for the stochastic semantics
- Automated Reasoning Tools
- Inferring kinetic parameter values from
Constraint-LTL specification - Inferring reaction rules from CTL specification
86Example Cell Cycle Control Model Tyson 91
- MA(k1) for _ gt Cyclin.
- MA(k2) for Cyclin gt _.
- MA(K7) for Cyclinp1 gt _.
- MA(k8) for Cdc2 gt Cdc2p1.
- MA(k9) for Cdc2p1 gtCdc2.
- MA(k3) for CyclinCdc2p1 gt
Cdc2p1-Cyclinp1. - MA(k4p) for Cdc2p1-Cyclinp1 gt
Cdc2-Cyclinp1. - k4Cdc2-Cyclinp12Cdc2p1-Cyclinp1
for - Cdc2p1-Cyclinp1 Cdc2-Cyclinp1 gt
Cdc2-Cyclinp1. - MA(k5) for Cdc2-Cyclinp1 gt Cdc2p1-Cyclinp
1. - MA(k6) for Cdc2-Cyclinp1 gt Cdc2Cyclinp1.
873.1 Inferring Parameters from Temporal Properties
- biocham learn_parameter(k3,k4,(0,200),(0,200)
,20, -
oscil(Cdc2-Cyclinp1,3),150).
883.1 Inferring Parameters from Temporal Properties
- biocham learn_parameter(k3,k4,(0,200),(0,200)
,20, -
oscil(Cdc2-Cyclinp1,3),150). - First values found
- parameter(k3,10).
- parameter(k4,70).
893.1 Inferring Parameters from Temporal Properties
- biocham learn_parameter(k3,k4,(0,200),(0,200)
,20, - oscil(Cdc2-Cyclinp1,3)
F(Cdc2-Cyclinp1gt0.15), 150). - First values found
- parameter(k3,10).
- parameter(k4,120).
903.1 Inferring Parameters from LTL Specification
- biocham learn_parameter(k3,k4,(0,200),(0,200)
,20, -
period(Cdc2-Cyclinp1,35), 150). - First values found
- parameter(k3,10).
- parameter(k4,280).
91Linking the Cell and Circadian Cycles through Wee1
Cell cycle
Leloup and Goldbeter (1999)
Wee1P
Wee1
.. ..
preMPF
MPF
APC
APC
Cdc25P
.. ..
.. ..
Cdc25
.. ..
L
L. Calzone, S. Soliman 2006
92PCN
Wee1m
BN
Wee1
MPF
Cdc25
93Condition on Wee1/Cdc25 for the Entrainment in
Period
Entrainment in period constraint expressed in LTL
with the period formula
943.2. Inferring Rules from Temporal Properties
- Given
- a BIOCHAM model (background knowledge)
- a set of properties formalized in temporal logic
- learn
- revisions of the reaction model, i.e. rules to
delete and rules to add such that the revised
model satisfies the properties
95Model Revision from Temporal Properties
- Background knowledge T BIOCHAM model
- reaction rule language complexation,
phosphorylation, - Examples f biological properties formalized in
temporal logic language - Reachability
- Checkpoints
- Stable states
- Oscillations
- Bias R Reaction rule patterns or parameter
ranges - Kind of rules to add or delete
- Find a revision T of T such that T f
96Model Revision Algorithm
- General idea of constraint programming replace a
generate-and-test algorithm by a
constrain-and-generate algorithm. - Anticipate whether one has to add or remove a
rule. - Positive ECTL formula if false, remains false
after removing a rule - EF(f) where f is a boolean formula (pure state
description) - Oscil(f)
- Negative ACTL formula if false, remains false
after adding a rule - AG(f) where f is a boolean formula,
- Checkpoint(a,b) E(aUb)
- Remove a rule on the path given by the model
checker (why command) - Unclassified CTL formulae
97Example Cell Cycle Control Model Tyson 91
- MA(k1) for _ gt Cyclin.
- MA(k2) for Cyclin gt _.
- MA(K7) for Cyclinp1 gt _.
- MA(k8) for Cdc2 gt Cdc2p1.
- MA(k9) for Cdc2p1 gtCdc2.
- MA(k3) for CyclinCdc2p1 gt
Cdc2p1-Cyclinp1. - MA(k4p) for Cdc2p1-Cyclinp1 gt
Cdc2-Cyclinp1. - k4Cdc2-Cyclinp12Cdc2p1-Cyclinp1
for - Cdc2p1-Cyclinp1 Cdc2-Cyclinp1 gt
Cdc2-Cyclinp1. - MA(k5) for Cdc2-Cyclinp1 gt Cdc2p1-Cyclinp
1. - MA(k6) for Cdc2-Cyclinp1 gt Cdc2Cyclinp1.
98Automatic Generation of True CTL Properties
- Ei(reachable(Cyclin)))
- Ei(reachable(!(Cyclin))))
- Ai(oscil(Cyclin)))
- Ei(reachable(Cdc2p1)))
- Ei(reachable(!(Cdc2p1))))
- Ai(oscil(Cdc2p1)))
- Ai(AG(!(Cdc2p1)-gtcheckpoint(Cdc2,Cdc2p1))))
- Ei(reachable(Cdc2-Cyclinp1,p2)))
- Ei(reachable(!(Cdc2-Cyclinp1,p2))))
- Ai(oscil(Cdc2-Cyclinp1,p2)))
- Ei(reachable(Cdc2-Cyclinp1)))
- Ei(reachable(!(Cdc2-Cyclinp1))))
- Ai(oscil(Cdc2-Cyclinp1)))
- Ai(AG(!(Cdc2-Cyclinp1)-gtcheckpoint(Cdc2-Cyclin
p1,p2,Cdc2-Cyclinp1))) - Ei(reachable(Cdc2)))
- Ei(reachable(!(Cdc2))))
- Ai(oscil(Cdc2)))
- Ei(reachable(Cyclinp1)))
- Ei(reachable(!(Cyclinp1))))
99Rule Deletion
- biocham delete_rules(Cdc2 gt Cdc2p1).
- biocham check_all.
- First formula not satisfied
- Ei(EF(Cdc2-Cyclinp1))
100Model Revision from Temporal Properties
- biocham revise_model.
- Rules to delete
- Rules to add
- Cdc2 gt Cdc2p1.
101Model Revision from Temporal Properties
- biocham revise_model.
- Rules to delete
- Rules to add
- Cdc2 gt Cdc2p1.
- biocham learn_one_addition.
- (1) Cdc2 gt Cdc2p1.
- (2) Cdc2 Cdc2gt Cdc2p1.
- (3) Cdc2 Cyclingt Cdc2p1.
102Conclusion
- Temporal logic with constraints is powerful
enough to express both qualitative and
quantitative biological properties of systems
103Conclusion
- Temporal logic with constraints is powerful
enough to express both qualitative and
quantitative biological properties of systems - Three levels of abstraction in BIOCHAM
- Boolean semantics CTL formulas (rule
learning) - Differential semantics LTL with constraints over
reals (parameter search) - Stochastic semantics Probabilistic CTL with
integer constraints
104Conclusion
- Temporal logic with constraints is powerful
enough to express both qualitative and
quantitative biological properties of systems - Three levels of abstraction in BIOCHAM
- Boolean semantics CTL formulas (rule
learning) - Differential semantics LTL with constraints over
reals (parameter search) - Stochastic semantics Probabilistic CTL with
integer constraints - Parameter search from temporal properties proved
useful and complementary to bifurcation theory
tools (Xppaut)
105Conclusion
- Temporal logic with constraints is powerful
enough to express both qualitative and
quantitative biological properties of systems - Three levels of abstraction in BIOCHAM
- Boolean semantics CTL formulas (rule
learning) - Differential semantics LTL with constraints over
reals (parameter search) - Stochastic semantics Probabilistic CTL with
integer constraints - Parameter search from temporal properties proved
useful and complementary to bifurcation theory
tools (Xppaut) - Rule inference from temporal properties still in
infancy, to be optimized and improved by types
(e.g. protein functions, computed by abstract
interpretation)
106Collaborations
- STREP APrIL2 Luc de Raedt, Univ. Freiburg,
Stephen Muggleton, IC , - Learning in a probabilistic logic setting
(finished) - ARC MOCA
- modularity, compositionality and abstraction
- NoE REWERSE semantic web, François Bry, Münich,
Rolf Backofen, - Connecting Biocham to gene and protein ontologies
(types) - STREP TEMPO Cancer chronotherapies, INSERM
Villejuif, F. Lévi Bang J. Clairambault,
Contraintes S. Soliman - Coupled models of cell cycle, circadian cycle,
cytotoxic drugs. - INRA Tours E. Reiter, D. Heitzler, Sysiphe F.
Clément - Model of FSH signalling.