Companions: Explorations in Machine Personality - PowerPoint PPT Presentation

1 / 59
About This Presentation
Title:

Companions: Explorations in Machine Personality

Description:

I think it may all be a publicity stunt for Ellen. PROGRAM[10:20:01]Ellen Degeneres was one of them she was kissing her lover ... – PowerPoint PPT presentation

Number of Views:139
Avg rating:3.0/5.0
Slides: 60
Provided by: Nick159
Category:

less

Transcript and Presenter's Notes

Title: Companions: Explorations in Machine Personality


1
Companions Explorations in Machine Personality
  • Yorick Wilks
  • Computer Science, University of Sheffield
  • and
  • Oxford Internet Institute, Balliol College

2
The talk will not start in quite the place you
may expect but this (serendipitous) quote will
give you an idea of where we are going
..claim the internet is killing their trade
because customersseem to prefer an electronic
serf with limitless memory and no conversation.
(Guardian 8.11.03)
  • Kings College, London
  • 11 December, 2003

3
What the talk contains
  • Two natural language technologies I work within
  • Information extraction from the web
  • Human dialogue modelling
  • The new vision of the Semantic Web and its
    relation to
  • Ontologies
  • Agents
  • Conversational agents as essential for
  • personalizing the web
  • making it tractable
  • Companions for the non-technical as a cosier kind
    of agent

4
The underlying technology Human Language
Processing by computer (HLT or NLP)
  • Whose Research Issues are not solved, but..
  • It is generally agreed what the issues are
  • Information access issues (Semantic Web..)
  • Dialogue and interface issues
  • Information presentation issues
  • We shall touch only on the first two
  • Good working versions of these technologies exist
    now and are increasingly being deployed, though I
    cannot give any full evidence of that today.

5
(No Transcript)
6
(No Transcript)
7
  • What then is Information Extraction?
  • getting information from content of huge
    document collections by computer at high speed
  • looking not for key words but information that
    fits some template pattern or scenario.
  • delivery of information as a structured database
    of the template fillers (usually pieces of text)
  • the technology has now moved on to one based on
    machine learning (ML) rather than people writing
    these patterns down out of their heads.
  • it has fused with machine Question-Answering.
  • it is a technology created since 1990 by the US
    Defense Department

8
  • A new core text processing technology

9
IE a new core text processing technology
  • The Web text world is huge and expanding (now
    maybe 300 bn. pages in English alone)
  • Text will not be bypassed by computer speech as
    seemed possible a decade ago
  • Critical data will continue to come as text, and
    all the PAST is text
  • Think about processing Japanese company finance
    reports (all on the same day!) without IE.

10
The semantic web
  • Unexpected outcome of the text tagging movement
    from the Humanities! SGML--HTML--XML,
    VRML,voxML, anythingML)
  • Purpose is making the web comprehensible for
    software agents (Berners-Lees dentist example)
  • See this as a process of inverting IE doing
    markup now at author time not analysis time.
  • This is an old dream of AI/NLP revived that
    information be stored and accessed by CONTENT
    (not by text string or index).

11
The Semantic Web is now another name for
Knowledge Management
  • The Semantic Web (SW) will allow the Internet to
    consist not only of documents but also
    machine-readable structured data. Key application
    areas are
  • Business apps B2B and grid applications
  • Personal assistants to permit effective
    management of personal data, activities and
    interests
  • Agents/Personal assistants will display real
    power of the Semantic Web
  • Some analysts now argue that up to 50 of the
    value of companies is contained in knowledge
    intangibles that will find their way into its
    Knowledge Management structure.

12
Agents Interacting
  • Fundamental justification for the SW is for
    software agents to use machine-readable data in
    order to perform tasks
  • Scenario (from Berners-Lee et al.2001) for
    establishing physical therapy appointments

13
Ontologies are the structures of knowledge that
SW agents will consult
  • Each agent must have internal representation,
    i.e. ontology, knowledge base or data base
  • For each new piece of info, knowledge update or
    maintenance must take place
  • The system must determine the consequences of the
    change in state and this raises the Frame Problem
  • For agents to interact, events or new information
    trigger communication, BUT who should an agent
    communicate the info to? All? None? Some? If
    some, how does the system decide?

14
Ontologies are really just an old AI idea back in
a new form
  • A philosophical word pressed into duty to serve
    for what used to be called Knowledge
    Representation in AI, usually a form of applied
    logic.
  • The interacting agents of the Semantic Web bring
    back another old AI chestnut, the Frame Problem
    (remember spotting the aliens with questions
    about turtles in  Blade Runner ??)

15
Much novelty in the SW concept is classic
Artificial Intelligence
  • The logic representation and reasoning
  • The ontological hierarchies of knowledge
  • These are all part of the A I schoolman
    tradition remember they are not new and we still
    have no real proof they are NEEDED.
  • Accept them only with suspicion and wariness
    look how old they are---------

16
An early local AI man
  • John Wilkins (1614-1672)
  • A founder of the Royal Society
  • Book An Essay Towards a Real Character and
    Philosophical Language, London, 1668
  • An Ontology
  • A Thesaurus
  • A system of logical shorthand

17
(No Transcript)
18
The Need
  • Ontologies and Taxonomies are needed for
  • Ontologies for the Semantic Web
  • Central component for agent services over the
    Web
  • Knowledge acquisition for knowledge management
  • Minds of employees Intangible assets
  • Ontologies act as index to memory of an
    organisation
  • Many organisations have built or are building
    their own ontologies/taxonomies (e.g. BBC,
    British Council, Clifford Chance, etc.---the
    majority of major UK companies say yes, if asked
    if they are).

19
What is an ontology--in its current sense?
  • Widely-held Assumption knowledge can be codified
    in an ontology
  • Ontology formal explicit specification of a
    shared conceptualisation (Gruber)
  • a document or file that formally defines the
    relations among terms (Berners-Lee)

20
The way out?
  • May be to learn these representational objects
    from scratch, from text, like we do..
  • the only escape from the schoolmen??
  • But the labour to write them all, to describe all
    of medicine formally, or all the knowldge of
    Rolls Toyceis inconceivable
  • How to get ontologies from texts?

21
Text and Knowledge
  • We want to generate ontologies from text
  • But if an ontology shared/background knowledge,
    then a writer assumes the ontology to generate
    the text

Ontology
22
Background Knowledge is elusive in an explicit
form
  • If ontology background knowledge,
  • and background knowledge is implicit,
  • then the text(s) will not express the domain
    ontology
  • Less true of introductory textbooks, manuals,
    glossaries etc.
  • We expect to find specification of the
    ontological knowledge at the borders of a domain
  • Remember the old (John McCarthy, Stanford) AI
    goal of formally expressing COMMON SENSE
    knowledge, not expertise
  • But, NLP techniques can extract plausible
    ontologies from pedagogic texts.

23
Change of topic dialogue with machine agents
  • Leaving, for the moment, the internet and the
    semantic web of text, whose content has been
    made more explicit
  • By means of ontologies as knowledge
    representations, and
  • Whose services will be provided by
    agents---simple software packages with a
    purpose, like find me the best camera under 500.

24
Machine dialogue problems with theory
  • It can be vacuous dialogues are systems of
    turn-taking
  • Speech act analysis initially has led to
    implausibly deep levels of reasoning.
  • For some researchers, dialogue theory is still a
    question of how best to deploy logic

25
Two important historical systems have all the
modern traits and functionalities in miniature
  • Colbys PARRY (Stanford, 1971)
  • Perrault, Cohen, Allens speech act system
    (Toronto, 1979)

26
Colbys PARRY
  • Perhaps best ever performance, many users,
    robust, but not a normal subject (I.e. a
    paranoid)
  • primitive individual models, some control of
    dialogue process but it had lots to say!
  • Primitive simulation of intentionality
  • not syntax analysis but fast pattern matching
  • Far better than ELIZA

27
PARRY conversation
  • Have you been hospitalized before?
  • THIS IS THE FIRST TIME
  • How long have you been there?
  • ABOUT TWO WEEKS
  • Any headaches?
  • MY HEALTH IS FINE
  • Are you have memory difficulties?

28
More PARRY
  • NO
  • Then can you tell me who is the President of the
    US?
  • NIXON IS PRESIDENT
  • And what day is today?
  • TODAY IS THURSDAY--CAN WE GET ON WITH THE
    INTERVIEW?

29
How is this achieved in PARRY?
  • WHAT IS YOUR MAIN PROBLEM _____________________
    __________
  • WHAT BE YOU MAIN PROBLEM
  • BE YOU MAIN PROBLEM
  • WHAT BE MAIN PROBLEM
  • WHAT BE YOU PROBLEM ?
  • WHAT BE YOU MAIN

30
Perrault, Cohen, Allen system
  • Based on speech act reasoning
  • User must have one of two goals, meeting or
    catching a train
  • Passenger/User Do you know when the Windsor
    train arrives?
  • This is labelled as a REQUEST not a
    REQUEST-INFORM (Y/N) because the system knows the
    user knows it knows!

31
Perrault et al. At Toronto
  • System has domain knowledge and reasoning power
  • was the first to assign speech act labels to
    dialogue items
  • But speech act reasoning is often implausible
    Can you pass the salt?
  • It has a simple rigid model of nested belief
  • but virtually no performance

32
Fixed nested beliefs passengers view of
systems view of passengers beliefs.
passenger
system
passenger
33
1970s division of approaches to machine
conversation.
  • Domain-dependent systems with coded world
    knowledge and some parsing and reasoning VERSUS
  • Wide shallow systems with little knowledge and
    high performance
  • Published AI academic systems all in first group
  • Only the second group performed at all

34
Academic systems have moved towards performance
  • Best is Traums TRAINS system--descendant of
    Allens work (Toronto-Rochester tradition)
  • Semi-empiricist uses corpora but retains
    reasoning
  • gone to the movies in California at USC!!
  • Also TRINDIKIT at Gothenburg/Edinburgh uses a
    large rule base
  • Pressure from the Loebner competition to perform?

35
TRAINS CORPUSDavid Traum (Rochester)
  • utt1 s hello can I help you
  • utt2 u yeah I want t- I want to determine the
    maximum number of boxcars of oranges by
    seven a.m. tomorrow morning
  • utt3 so hm
  • so I guess all the boxcars will have to go
    through oran- through Corning because
    thats where the orange juice orange
    factory is
  • utt4 so from Corning to Bath how far is
    that
  • utt5 s two hours
  • utt6 u and its gonna take us also an hour to
    load boxcars right
  • utt7 s right
  • utt8 u okay so hm so every
    trip will take at least three hours
    then
  • utt9 um
  • utt10 s right we can unload any amount of
    cargo onto a train in one hour
  • utt11 so we can so a maximum of
    three boxcars in an hour

36
AI people join the performersthe Loebner
Competition
  • Well designed competition over the last ten
    years---essentially a modified Turing test.
  • Domain dependent entries till 1997, then domain
    free.
  • The game is for journalists to separate and rank
    human and machine partners
  • Search Loebner on web for full transcripts

37
Loebner 1994
  • NEW JUDGE
  • 150
    Judge 1
  • Do you think the country is headed generally in a
    more conservative or more liberal direction right
    now?
  • 151
    Terminal
  • Didnt you say would you discribe that old texas
    billionaire as a conservative?? Answer me one
    question why are liberals portrayed as donkees?
    Are you a caring person? Youre not really
    talking about me, are you?
  • 152
    Judge 1
  • Ive got the feeling Ive heard these answers
    before. Is our nation swinging leftward or
    rightward right now?

  • Terminal
  • We are at the beginning of a strong and
    wonderful economic recovery, and we can thank the
    liberals!

38
Sheffield University/Intelligent Systems won in
1997!
  • The CONVERSE program pretended to be Catherine, a
    34-year old British journalist living in New York
    (to explain any usage problems).
  • She could talk about 80 appropriate topics
  • (80domain free???) clothes, shopping, food,
    abortion, fitness, sex.etc.

39
Loebner Competition 1997---Sheffields winning
entry!
  • PROGRAM101804Hi, my name is Catherine, whats
    yours?
  • JUDGE04101832My name is Elliott.
  • PROGRAM101834Well, its nice to meet you,
    Elliott.
  • PROGRAM101848Did you see that story on CNN
    last night about the lesbian
  • PROGRAM101904couple who came out at a White
    House party on Sunday?
  • JUDGE04101955Yes, I did. I think it may all
    be a publicity stunt for Ellen.
  • PROGRAM102001Ellen Degeneres was one of them
    she was kissing her lover
  • PROGRAM102011in front of Bill Clinton. Do you
    think he was very
  • PROGRAM102025embarrassed?
  • JUDGE04102100I think he want the gay vote. He
    will do anything to get it and keep it.
  • JUDGE04102115He does not have a conservative
    base.
  • PROGRAM102121
  • PROGRAM102121Are you still a Friend of Bill
    Clinton?

40
The CONVERSE prototype 1997
  • Push-me-pull-you architecture
  • strong driving top-down scripts (80) in a
    re-enterable network with complex output
    functions
  • bottom-up parsing of user input adapted from
    statistical prose parser
  • minimal models of individuals
  • contained Wordnet and Collins PNs
  • some learning from past Loebners BNC
  • It owed something to PARRY, nothing to Toronto.

41
Why the dialogue task is still hard
  •  Where am I  in the conversation what is
    being talked about now, what do they want?
  • Does topic stereotopy help or are just
    Finite-State pairs enough (VoiceXML!)?
  • How to gather the beliefs/knowledge required ,
    preferably from existing sources?
  • Are there distinctive procedures for managing
    conversations?
  • How to learn the structures we need--assuming we
    do---and how to get and annotate the data?
  • Some of this is the general NLP empiricist
    problem.

42
Dimensions of conversation construction the
Sheffield view
  • Resources to build/learn world knowledge
    structures and belief system representations
  • Quasi-linguistic learnable models of dialogue
    structure, scripts, finite state transitions etc.
  • Effective learnable surface pattern matchers to
    dialogue act functions (an IE approach to
    dialogue)
  • A stack and network structure that can be trained
    by reinforcement.
  • Ascription of belief procedures to give dialogue
    act reasoning functionality

43
VIEWGENa belief model that computes agents
states
  • Not a static nested belief structure like that of
    Perrault and Allen.
  • Computes other agents RELEVANT states at time of
    need
  • Topic restricted search for relevant information
  • Can represent and maintain conflicting agent
    attitudes

44
VIEWGEN as a knowledge basis for
reference/anaphora resolutionprocedures
  • Not just pronouns but grounding of descriptive
    phrases in a knowledge basis
  • Reconsider finding the ground of
    that old Texas billionaire as
    Ross Perot, against a background of what the
    hearer may assume the speaker knows when he says
    that.

45
(No Transcript)
46
What is the most structure that might be needed
and how much of it can be learned?
  • Steve Young (Cambridge) says learn it all and no
    a priori structures (cf MT history and Jelinek at
    IBM)
  • Availability of data (dialogue is unlike MT)?
  • Learning to partition the data into structures.
  • Learing the semantic speech act interpretation
    of inputs alone has now reached a (low) ceiling
    (75).

47
Youngs strategy not like Jelineks MT strategy
of 1989!
  • Which was non/anti-linguistic with no
    intermediate representations hypothesised
  • Young assumes rougly the same intermediate
    objects as we do but in very simplified forms.
  • The aim to to obtain training data for all of
    them so the whole process becomes a single
    throughput Markov model.

48
(No Transcript)
49
There are now four not two competing approaches
to machine dialogue
  • Logic-based systems with reasoning (Old and still
    unvalidated by performance)
  • Extensions of speech engineering methods, machine
    learning and no structure (New)
  • Simple handcoded finite state systems in VoiceXML
    (Chatbots and commercial systems)
  • Rational hybrids based on structure and machine
    learning.

50
The Companions a new economic and social goal
for dialogue systems
51
An idea for integrating the dialogue research
agenda in a new style of application...
  • That meets social and economic needs
  • That is not simply a product but everyone will
    want one if it succeeds
  • That cannot be done now but could in six years by
    a series of staged prototypes
  • That modularises easily for large project
    management, and whose modules cover the research
    issues.
  • Whose speech and language technology components
    are now basically available

52
A series of intelligent and sociable COMPANIONS
  • The SeniorCompanion
  • The EU will have more and more old people who
    find technological life hard to handle, but will
    have access to funds
  • The SC will sit beside you on the sofa but be
    easy to carry about--like a furry handbag--not a
    robot
  • It will explain the plots of TV programs and help
    choose them for you
  • It will know you and what you like and dont
  • It wills send your messages, make calls and
    summon emergency help
  • It will debrief your life.

53
(No Transcript)
54
Other COMPANIONS
  • The JuniorCompanion
  • Teaches and advises, maybe from a backpack
  • Warns of dangerous situations
  • Helps with homework and web search
  • Helps with languages
  • Always knows where the child is
  • Explains ambient signals and information
  • Its what e-learning might really mean!

55
(No Transcript)
56
The Senior Companion is a major technical and
social challenge
  • It could represent old people as their agents and
    help in difficult situations e.g. with landlords,
    or guess when to summon human assistance
  • It could debrief an elderly user about events
    and memories in their lives
  • It could aid them to organise their life-memories
    (this is now hard!)(see Lifelog and Memories for
    Life)
  • It would be a repository for relatives later
  • Has  Loebner chat aspects  as well as
    information--it is to divert, like a pet, not
    just inform
  • It is a persistent and personal social agent
    interfacing with Semantic Web agents

57
Other issues for Companions we can hardly begin
to formulate
  • Companion identity as an issue that can be
    settled many ways---
  • like that of the owner--web identity now a hot
    issue?
  • Responsibilities of Companion agents--who to?
  • Communications between agents and our access to
    them
  • Are simulations of emotional behaviour or
    politeness desirable in a Companion?
  • Protection of the vulnerable (young and old here)
  • What happens to your Companion when you are gone?

58
Companions and the Web
  • A new kind of agent as the answer to a passive
    web
  • The web/internet must become more personal to be
    tractable, as it gets bigger (and more structured
    or unstructured?)
  • Personal agents will need to be autonomous and
    trusted (like space craft on missions)
  • But also personal and persistent, particularly
    for large sections of populations now largely
    excluded from the web.
  • The semantic web is a start to structure the web
    for comprehension and activity, but web agents
    are currently abstract and transitory.
  • The old are a good group to start with (growing
    and with funds).

59
The technologies for a Companion are all there
already
  • Remember Tamagochi?
  • Quite intelligent people rushed home to feed one
    (and later Furby) even though they knew it was a
    simple empty mechnaism.
  • And Tamaogochi could not even talk!
  • People with pets live longer.
  • Wouldnt you like a warm pet to remind you what
    happened in the last episode of Coronation
    Street?
  • OK, but perhaps millions of your compatriots
    would?!
Write a Comment
User Comments (0)
About PowerShow.com