Embodied Machines - PowerPoint PPT Presentation

About This Presentation
Title:

Embodied Machines

Description:

Embodied Machines The Grounding (binding) Problem Real cognizers form multiple associations between concepts Affordances - how is an object interacted with – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 20
Provided by: Sumnicht
Learn more at: http://grammar.ucsd.edu
Category:

less

Transcript and Presenter's Notes

Title: Embodied Machines


1
Embodied Machines
  • The Grounding (binding) Problem
  • Real cognizers form multiple associations between
    concepts
  • Affordances - how is an object interacted with
  • Frames - Background structure against which
    concept is understood -- sometimes highly complex
    (Educational system, family relationships)
  • Emotions - witnessing event/seeing object
    conjures up emotional states
  • Mental simulation - comprehending language may
    trigger imagistic modeling of event based on
    experience

2
Embodied Machines
  • Mouse
  • Mammal, Small, furry, grey to brown, long
    whiskers, cats like to play with them and then
    eat them, theyre used in experiments, ladies
    stand on chairs when theyre around, they squeak,
    theyre prolific breeders, theyre sold live as
    snake food, theyre one kind of rodent, they look
    a lot like rats, they are sometimes pets, they
    like to run on a wheel
  • Play
  • The opposite of work, its fun, kids do it,
    scheduled in during grade school, you play games,
    you play with words,

3
Embodied Machines
  • Approaches to meaning construction
  • NLP
  • Text/speech is considered comprehended when
    parsed syntactically, and when word meanings have
    been assigned
  • Meaning is pre-determined by humans in some way
  • Embodied approach
  • World has no structure until body begins to
    interact in it
  • Need goals sensorimotor system
  • Experience --gt meaning
  • Words map onto meaning

4
Embodied Machines
  • Steels talking heads
  • Simple robots
  • Auditory visual systems
  • Motivating goal language game
  • Simple environment
  • 2 dimensional world containing objects
  • Robots determine their own categories for objects
  • Robots determine their own labels for categories
  • Robots and environment are real physical entities

5
Embodied Machines
  • Cangelosi Parisi
  • Virtual agents, virtual world
  • A kind of embodied learning
  • Agents have physical location, orientation,
    movement capabilities within their environment
  • Agents consume mushrooms which affects their
    energy status
  • Agents (collectively) have a motivating task --gt
    increase fitness of species
  • They sense perceptual characteristics, not
    mushrooms --gt they learn which characteristics
    describe real vs. poisonous mushrooms
  • Agents (collectively) learn to categorize and
    label mushrooms

6
Embodied Machines
  • CELL (Deb Roy)
  • Cross channel Early Lexical Learning
  • Models embodied language learning using input
    that approximates input to human infants
  • Instantiated in robot body with microphone/camera
  • CELL learns to form word meaning correspondences
    from raw (unsegmented) audio and visual input

7
Embodied Machines
  • First Task
  • Segmentation
  • Audio stream parsing into segments
  • Video stream parsing into objects
  • Segmentation process produces channel of words
    and channel of shapes
  • Second Task
  • Build a lexicon by identifying frequently
    co-occurring pairs of audio visual segments

8
Embodied Machines
  • Illustrative example (not from actual data)
  • Imagine an utterance
  • dont throw the ball at the cat
  • Uttered in a scene containing these identified
    objects
  • (Noise present)

9
Embodied Machines
  • Objects not necessarily identified in same order
    as named in utterance
  • Time delays between utterance and object
    recognition highly likely

throw the ball at the
cat
10
Embodied Machines
  • Short term memory (STM) look at a temporal
    window surrounding each word
  • Aim is to go back or forward far enough in time
    to have the word and referent in same window

throw the ball at the
cat
Short term memory
11
Embodied Machines
  • Window marches through data stream collecting
    segmented objects and words for possible mapping

throw the ball at the
cat
Short term memory
12
Embodied Machines

throw the ball at the
cat
Short term memory
13
Embodied Machines

throw the ball at the
cat
Short term memory
14
Embodied Machines
  • Audio and visual segments that have a high degree
    of mutual informationare likely semantically
    linked and should be saved in long term memory
    (LTM)

Objects Words

Ball 5
Cat 6
The 40 50
?Unique occurrences

57
100
90,000
?unique 59 116
15
Embodied Machines
  • Mutual information
  • MI P(ab) ? co-occurrence (ab)
  • ------------- ----------------------------------
    -
  • P(a) P(b) occurrence (a) occurrence (b)

P (cat ) 40/(100 59) 0.0067
P (the ) 40/(90,000 59)
0.0000075
Words like the are promiscuous. They co-occur
with so many categories, they lack predictive
power.
16
Embodied Machines
  • Two implementations of CELL
  • Robot
  • Learning from observing Infant/Caregiver
    interaction

17
Embodied Machines
  • Robot
  • Input spoken utterances and images of objects
    acquired from video camera mounted on robot
  • Experimenter places objects in front of the robot
    and describes them
  • Acquisition of lexicon
  • Robot gathers visual information about
    environment while listening to speech (discovers
    high MI pairs)
  • Speech generation
  • Search for objects in environment then describe
  • Speech understanding (maps word to object)

18
Embodied Machines
  • Learning from infant-caregiver interaction
  • Infants played with 7 classes of objects
  • Balls, shoes, keys, toy cars, trucks, dogs,
    horses
  • Care-giver/infant interaction was natural
  • CELL attempted to build up lexicon from observing
    these interactions
  • Segmentation accuracy (segment boundaries
    correspond to word boundaries?)
  • Word discovery (segments correspond to single
    word?)
  • Semantic accuracy (if word segmented properly, is
    it properly mapped to an object?)

19
Embodied Machines
  • Segmentation accuracy 28 (compared to 7 for
    acoustic only model)
  • Word discovery 72 of segmented items were
    single words (compared to 31 for acoustic only
    model)
  • Semantic accuracy 57 of hypothesized lexical
    candidates are both valid words and were linked
    to semantically relevant visual categories
Write a Comment
User Comments (0)
About PowerShow.com