Does the Brain Use Symbols or Distributed Representations? - PowerPoint PPT Presentation

1 / 43

About This Presentation

Title:

Does the Brain Use Symbols or Distributed Representations?

Description:

A typical property that. a particular object lacks. e.g., pine has leaves. An infrequent, ... Medial Temporal Lobe. Rogers et al (2005) model of semantic ... – PowerPoint PPT presentation

Number of Views:57

Avg rating:3.0/5.0

Slides: 44

Provided by: JayMcCl5

Category:

more less

Transcript and Presenter's Notes

Title: Does the Brain Use Symbols or Distributed Representations?

1
Does the Brain Use Symbols or Distributed
Representations?

James L. McClelland
Department of Psychology andCenter for Mind,
Brain, and ComputationStanford University

2
Parallel Distributed Processing Approach to
Semantic Cognition

Representation is a pattern of activation
distributed over neurons within and across brain
areas.
Bidirectional propagation of activation underlies
the ability to bring these representations to
mind from given inputs.
The knowledge underlying propagation of
activation is in the connections.

3
Development and Degeneration

Learned distributed representations in an
appropriately structured distributed
connectionist system underlies the development of
conceptual knowledge.
Gradual degradation of the representations
constructed through this developmental process
underlies the pattern of semantic disintegration
seen in semantic dementia.

4
Differentiation, Illusory Correlations, and
Overextension of Frequent Names in Development
5
(No Transcript)
6
The Rumelhart Model
7
The Training Data
All propositions true of items at the bottom
levelof the tree, e.g. Robin can grow, move,
fly
8
Target output for robin can input
9
Forward Propagation of Activation
10
Back Propagation of Error (d)
aj
wij
ai
di Sdkwki
wki
dk (tk-ak)
Error-correcting learning At the output
layer Dwki edkai At the prior layer Dwij
edjaj
11
(No Transcript)
12
(No Transcript)
13
Early Later LaterStill
Experie nce
14
(No Transcript)
15
Why Does the Model Show Progressive
Differentiation?

Learning in the model is sensitive to patterns of
coherent covariation of properties
Coherent Covariation
The tendency for properties of objects to co-vary
in clusters

16
Patterns of Coherent Covariation in the Training
Set

Patterns of coherent covariation are reflected in
the principal components of the property
covariance matrix of the training patterns.
Figure shows attribute loadings on the first
three principal components
1. Plants vs. animals
2. Birds vs. fish
3. Trees vs. flowers
Same color features covary in
component
Diff color anti-covarying
features

17
Illusory Correlations

Rochel Gelman found that children think that all
animals have feet.
Even animals that look like small furry balls and
dont seem to have any feet at all.
A tendency to over-generalize properties typical
of a superordinate category at an intermediate
point in development is characteristic of the PDP
network.

18
A typical property thata particular object
lacks e.g., pine has leaves
An infrequent, atypical property
19
A One-Class and a Two-Class Naïve Bayes
Classifier Model
20
Accounting for the networks representations with
classes at different levels of granularity
Regression Beta Weight
Epochs of Training
21
Overgeneralization of Frequent Names to Similar
Objects
goat
tree
dog
22
Why Does Overgeneralization of Frequent Names
Increase and then decrease?

In the simulation shown, dogs are experienced 10
times as much as any other animal, and there are
4 other mammals, 8 other animals, and ten plants.
In a one-class model, goat is a living thing
P(name is Dogliving thing) 10/32 .3
In a two-class model, goat is an animal
P(name is Doganimal) 10/22 .5
In a five class model, goat is a mammal
P(name is Dogmammal) 10/15 .67
In a 23 class model, goat is in a category by
itself
P(name is Doggoat) 0

23
(No Transcript)
24
Sensitivity to Coherence Requires Convergence
A
A
25
Inference and Generalizationin the PDP Model

A semantic representation for a new item can be
derived by error propagation from given
information, using knowledge already stored in
the weights.
Crucially
The similarity structure, and hence the pattern
of generalization depends on the knowledge
already stored in the weights.

26
Start with a neutral representation on the
representation units. Use backprop to adjust the
representation to minimize the error.
27
The result is a representation similar to that of
the average bird
28
Use the representation to infer what this new
thing can do.
29
Differential Importance (Marcario, 1991)

3-4 yr old children see a puppet and are told he
likes to eat, or play with, a certain object
(e.g., top object at right)
Children then must choose another one that will
be the same kind of thing to eat or that will
be the same kind of thing to play with.
In the first case they tend to choose the object
with the same color.
In the second case they will tend to choose the
object with the same shape.

30
Adjustments to Training Environment

Among the plants
All trees are large
All flowers are small
Either can be bright or dull
Among the animals
All birds are bright
All fish are dull
Either can be small or large
In other words
Size covaries with properties that differentiate
different types of plants
Brightness covaries with properties that
differentiate different types of animals

31
(No Transcript)
32
(No Transcript)
33
Similarities of Obtained Representations
Brightness is relevant for Animals
Size is relevant for Plants
34
Development and Degeneration

Sensitivity to coherent covariation in an
appropriately structured Parallel Distributed
Processing system underlies the development of
conceptual knowledge.
Gradual degradation of the representations
constructed through this developmental process
underlies the pattern of semantic disintegration
seen in semantic dementia.

35
Disintegration of Conceptual Knowledge in
Semantic Dementia

Progressive loss of specific knowledge of
concepts, including their names, with
preservation of general information
Overgeneralization of frequent names
Illusory correlations

36
Picture namingand drawing in Sem. Demantia
37
(No Transcript)
38
Proposed Architecture for the Organization of
Semantic Memory
name
action
motion
Temporal pole
color
form
valance
39
Rogers et al (2005) model of semantic dementia
40
Errors in Naming for As a Function of Severity
Simulation Results
Patient Data
Severity of Dementia
Fraction of Neurons Destroyed
41
Simulation of Delayed Copying

Visual input is presented, then removed.
After several time steps, pattern is compared to
the pattern that was presented initially.

42
IFs camel
DCs swan
Simulation results
43
Conclusion

Distributed representations gradually
differentiate in ways that allow them to capture
many phenomena in conceptual development.
Their behavior is approximated by a blend of
Naïve Bayes classifiers across several levels of
granularity, with the blending weights shifting
toward finer grain categories as learning
progresses.
Effects of damage are approximated by a reversal
of this tendency degraded representations
retain the coarse-grained level knowledge but
loose the finer-grained information.
We are currently extending the models to address
the sharing of knowledge across structurally
related domains, Ill be glad to discuss this
idea in response to questions.