Inductive Logic Programming - PowerPoint PPT Presentation

About This Presentation
Title:

Inductive Logic Programming

Description:

Inductive Logic Programming t.k.prasad_at_wright.edu http://www.knoesis.org/tkprasad/ Includes s by Luis Tari CS774 * L16ILP * * * Life sciences mutagenecity ... – PowerPoint PPT presentation

Number of Views:89
Avg rating:3.0/5.0
Slides: 25
Provided by: Luis50
Learn more at: http://cecs.wright.edu
Category:

less

Transcript and Presenter's Notes

Title: Inductive Logic Programming


1
Inductive Logic Programming
  • t.k.prasad_at_wright.edu
  • http//www.knoesis.org/tkprasad/
  • Includes slides by Luis Tari

2
Logic Programming
  • Consider the following example of a logic program

parent_of(charles,george). parent_of(george,diana)
. parent_of(bob,harry). parent_of(harry,elizabeth)
. grandparent_of(X,Y) - parent_of(X,Z),
parent_of(Z,Y).
  • From the program, we can ask queries about
    grandparents.
  • Query grandparent_of(X,Y)?
  • Answers
  • grandparent_of(charles,diana).
  • grandparent_of(bob,elizabeth).

3
(Machine) Learning
  • The process by which relatively permanent changes
    occur in behavioral potential as a result of
    experience. (Anderson)
  • Learning is constructing or modifying
    representations of what is being experienced.
    (Michalski)
  • A computer program is said to learn from
    experience E with respect to some class of tasks
    T and performance measure P, if its performance
    at tasks in T, as measured by P, improves with
    experience E. (Mitchell)

4
Forms of Reasoning
  • Deduction From causes to effect (Prediction)
  • fact a, rule a gt b
  • INFER b
    (First-order logic)
  • Abduction From effects to possible causes
    (Explanation)
  • rule a gt b, observe b
  • AN EXPLANATION a
  • Induction From correlated observations to rules
    (Learning)
  • observe correlation between a1, b1, ... an,
    bn
  • LEARN a -gt b

5
What is ILP?
  • Inductive Logic Programming (ILP)
  • Automated learning of logic rules from examples
    and background knowledge
  • E.g., learn the rule for grandparents, given
    background knowledge of parents and examples of
    grandparents
  • ILP can be used for classification and prediction

6
Why ILP ? multiple relations
  • Genealogy example
  • Given known relations
  • father(Old,Young) and mother(Old,Young)
  • male(Somebody) and female(Somebody)
  • learn new relations
  • parent(X,Y) - father(X,Y).
  • parent(X,Y) - mother(X,Y).
  • brother(X,Y) -
  • male(X),father(Z,X),father(Z,Y).
  • Most ML techniques cannot use more than one
    relation
  • e.g., decision trees, neural networks,

7
ILP formal definitions
  • Given
  • a logic program B representing background
    knowledge
  • a set of positive examples E
  • a set of negative examples E-
  • Find hypothesis H such that

8
ILP logical foundation
  • Prolog Programming with Logic
  • is used to represent
  • Background knowledge (of the domain) facts
  • Examples (of the relation to be learned) facts
  • Theories (as a result of learning) rules
  • Supports two forms of logical reasoning
  • Deduction
  • Induction

9
Logical reasoning deduction
  • From rules to facts

B ? T -
E
mother(penelope,victoria). mother(penelope,arthur)
. father(christopher,victoria). father(christopher
,arthur).
parent(penelope,victoria). parent(penelope,arthur)
. parent(christopher,victoria). parent(christopher
,arthur).

parent(X,Y) - father(X,Y). parent(X,Y) -
mother(X,Y).
10
Logical reasoning induction
  • From facts to rules

B ? E -
T
mother(penelope,victoria). mother(penelope,arthur)
. father(christopher,victoria). father(christopher
,arthur).
parent(penelope,victoria). parent(penelope,arthur)
. parent(christopher,victoria). parent(christopher
,arthur).

parent(X,Y) - father(X,Y). parent(X,Y) -
mother(X,Y).
11
Example
  • Background knowledge B
  • parent_of(charles,george).
  • parent_of(george,diana).
  • parent_of(bob,harry).
  • parent_of(harry,elizabeth).
  • Positive examples E
  • grandparent_of(charles,diana).
  • grandparent_of(bob,elizabeth).
  • Generate hypothesis H
  • grandparent_of(X,Y) - parent_of(X,Z),
    parent_of(Z,Y).

12
Example Same Generation
13
Why ILP ? - Structured data
  • Seed example of East-West trains (Michalski)
  • What makes a train to go eastward ?

14
Why ILP ? multiple relations
  • This is related to structured data

has_car
car_properties
Train Car
t1 c11
t1 c12
t1 c13
t1 c14
t2 c21

Car Length Shape Axes Roof
c11 short rectangle 2 none
c12 long rectangle 3 none
c13 short rectangle 2 peaked
c14 long rectangle 2 none
c21 short rectangle 2 flat

15
Induction of a classifier example
  • Example of East-West trains
  • B relations has_car and car_properties (length,
    roof, shape, etc.)
  • e.g., has_car(t1,c11)
  • E the trains t1 to t10
  • C east, west
  • Possible T
  • east(T) -
  • has_car(T,C), length(C,short), roof(C,_).

16
ILP systems
  • Two of the most popular ILP systems
  • Progol
  • FOIL
  • Progol Muggleton95
  • Developed by S. Muggleton et. al.
  • Learns first-order Horn clauses (no negation in
    head and body literals of hypotheses)
  • FOIL Quinlan93
  • Developed by J. Quinlan et. al.
  • Learns first-order rules (no negation in head
    literals of the hypotheses)

17
Rule Learning (Intuition)
  • How to come up with a rule for grandparent_of(X,Y)
    ?
  • Take the example grandparent_of(bob,elizabeth).
  • Find the subset of background knowledge relevant
    to this example parent_of(bob,harry),

    parent_of(harry,elizabeth).
  • Form a rule from these facts
  • grandparent_of(bob,elizabeth) -
  • parent_of(bob,harry), parent_of(harry,elizabeth).
  • Generalize the rule
  • grandparent_of(X,Y) - parent_of(X,Z),
    parent_of(Z,Y).
  • Check if this rule is valid w.r.t the positive
    and the negative examples

18
Top-down induction of logic programs
  • Employs refinement operators
  • Typical refinement operators on a clause
  • Apply a substitution to clause
  • Add a literal to the body of clause
  • Refinement graph
  • Nodes correspond to clauses
  • Arcs correspond to refinements

19
Part of refinement graph
  • has_a_daughter(X).
  • has_a_daughter(X) - has_a_daughter(X) -
    has_a_daughter(X) -
  • male(Y). female(Y).
    parent(Y,Z).
  • has_a_daughter(X) - has_a_daughter(X) -
    has_a_daughter(X) -
  • male(X). female(Y),
    parent(X,Z).
  • parent(S,T).
  • ... ... ... ... ...
    has_a_daughter(X) -

  • parent(X,Z),

  • female(U).

  • has_a_daughter(X) -

  • parent(X,Z),

20
Progol Algorithm Outline
  1. From a subset of positive examples, construct the
    most specific rule rs.
  2. Based on rs, find a generalized form rg of rs so
    that score(rg) has the highest value among all
    candidates.
  3. Remove all positive examples that are covered by
    rg.
  4. Go to step 1 if there are still positive examples
    that are not yet covered.

21
Scoring hypotheses
  • score(r) is a measure of how well a rule r
    explains all the examples with preference given
    to shorter rules.
  • pr number of ve examples correctly deducible
    from r
  • nr number of -ve examples correctly deducible
    from r
  • cr number of body literals in rule r
  • score(r) pr (nr cr)

22
Applications of ILP
  • Constructing Biological Knowledge Bases by
    Extracting Information from Text Sources (M.
    Craven J. Kumlien) Craven99
  • The automatic discovery of structural principles
    describing protein fold space (A. Cootes, S.H.
    Muggleton, and M.J.E. Sternberg) Cootes03
  • More from UT-ML group (Ray Mooney)
  • http//www.cs.utexas.edu/ml/publication/ilp.html

23
Example of relation from text
  • Sample sentence from biomedical articles
  • We want to extract the following relation

24
References
  • Quinlan93 J. R. Quinlan, R. M. Cameron-Jones.
    FOIL A Midterm Report. Proceedings of Machine
    Learning ECML-93
  • Muggleton95 S. Muggleton. Inverse Entailment
    and Progol. New Generation Computing Journal,
    13245-286, 1995.
  • Craven99 M. Craven J. Kumlien (1999).
    Constructing Biological Knowledge Bases by
    Extracting Information from Text Sources. ISMB
    99.
  • Cootes03 A. Cootes, S.H. Muggleton, and M.J.E.
    Sternberg. The automatic discovery of structural
    principles describing protein fold space. Journal
    of Molecular Biology, 330(4)839-850, 2003.
Write a Comment
User Comments (0)
About PowerShow.com