Title: Motor Control
1Motor Control
2Beyond babbling
- Three problems with motor babbling
- Random exploration is slow
- Error-based learning algorithms are faster but
error signals are available in sensory
coordinates only - Real arms have two many degrees of freedom
3Degrees of freedom
Average position
4General Learning Principles
Y
Yd
Error
W
X
5General Learning Principles
Y
X
?
Error
?
W
X
6Forward Modeling
Hand Displacement
Arm
Reaching Motor command
Cortex Inverse Model
Desired Position of the hand
7Forward Modeling
Sensory Change
Plant
Motor command
Inverse Model
Desired Sensory Change
8Forward Modeling
(DSd- DS)
-
Sensory Change DS
Arm
Learning starts with desired change, not a
spontaneous movement.
Motor command (M)
Inverse Model
Desired Sensory Change DSd
9Forward Modeling
(DSd- DS)
-
Sensory Change DS
Arm
Motor command (M)
Major problem How do we compute
?
Inverse Model
Desired Sensory Change DSd
10Forward Modeling
(DSd- DSp)
-
Predicted Sensory Change DSp
Sensory Change DS
Forward Model
Plant
Motor command
Inverse Model
Desired Sensory Change DSd
11Forward Modeling
- We cant compute
- but we can use instead,
12Forward Modeling
- This works as long as
- This means that the forward model only needs to
be approximately correct -
13Forward Modeling
Predicted Sensory Change DSp
Sensory Change DS
-
(DS- DSp)
Forward Model
Plant
Motor command
Training the forward model with motor babbling
using prediction error
Inverse Model
Desired Sensory Change DS
14Forward Modeling
Predicted Sensory Change DSp
-
(DSd- DSp)
Forward Model
Motor command
Off line training of inverse model with predicted
performance error
Inverse Model
Desired Sensory Change DSd
15Forward Modeling
Predicted Sensory Change DSp
Sensory Change DS
-
(DSd- DS)
Forward Model
Plant
Motor command
On line training of inverse models using
performance error
Inverse Model
Desired Sensory Change DSd
16Forward Modeling
- Forward models can be used to predict the
consequences of motor commands - The prediction can be used to drive a linear
inverse model (e.g. Jacobian) since they are not
subject to long delays - The prediction can be subtracted from current
sensory input to compute the prediction error. - Errors can be used to improve prediction.
17Modular Approach
- Sensorimotor transformations are context
dependent, e.g., the force required to move an
object depends on the mass of the objects. - A central controller for all contexts might not
be feasible and would take too many resources - Alternative use a family of controllers and mix
them smoothly across contexts.
18Modular Approach
Inverse Model 1
X
or
19Modular Approach
- The weights of the inverse model are adjusted
according to
20Modular Approach
- To compute the responsibility, use the forward
models
Favors the model which makes the best prediction
at the previous time step
Forward Model 1
ut
1
-
X
21Modular Approach
- The weights of the forward model are adjusted
according to
22Modular Approach
- We can also add a responsibility predictor.
- where yt are the sensory cues used for the
predictions.
23Modular Approach
- The overall responsibility is computed according
to - plays the role of a likelihood
function, while is the prior.
24Controlling a trajectory
- The inverse problem (inverse dynamics)
- Ex Controlling the trajectory of a spaceship
25Controlling a trajectory
- The inverse problem (inverse dynamics)
- Ex a multi-joint arm (t(t) joint torques)
26Controlling a trajectory
- The inverse problem (inverse dynamics)
- Feedback models
- Feedforward models
- Equilibrium point models
27Controlling a trajectory
- The inverse problem feedback models
- Generate force according to the difference
between desired position and actual position
(easy to compute in linear systems). - Unstable because of sensory delays
- Works better if the actual position is
internally estimated by a forward model - Adapts on the fly to change of context
- Popular model of the oculomotor system
28Controlling a trajectory
- The inverse problem feedforward models
- Estimating torque and its differentials from a
desired trajectory is a nonlinear mapping - Use a basis function network to implement the
mapping - Subject to the curse of dimensionality
- Unable to adapt to change of context (unless you
add context units)
29Controlling a trajectory
- The inverse problem (inverse dynamics)
- Ex a multi-joint arm (t(t) joint torques)
30Controlling a trajectory
- Equilibrium models
- Basic idea the motor system only specifies the
muscle length that maintain the arm at the
desired location (in other words, it only
specifies the joint coordinates).
31Controlling a trajectory
- Equilibrium models
- Monkeys can reach accurately without
proprioceptive or visual feedback - If the arm is moved to the end point right at the
onset of a movement, it goes back to the starting
point and resume its trajectory. This implies
that trajectory are specified by moving the
equilibrium point
32Controlling a trajectory
- Equilibrium models
- How do you control trajectories? The motor system
may specify the trajectory of the equilibrium
point (virtual trajectory). Unless muscles are
very stiff, the actual trajectory will end up
being very different. This makes it difficult to
control trajectories very precisely.
33Controlling a trajectory
- There is an infinite number of trajectories
between two points. How do we choose one? Do you
have to specify one? - Minimum Jerk (Flash Hogan)
- Maximum accuracy (Harris Wolpert)
- Optimal control (Todorov Jordan)
34Controlling a trajectory
- Minimum Jerk
- Goal find q(t) minimizing C. The solution can be
found using calculus of variations.
35Controlling a trajectory
- Minimum Jerk
- Its unclear how Jerk is computed in the brain.
- No principled explanation for why the brain would
minimize such a quantity.
36Controlling a trajectory
- Maximum accuracy
- Hypothesis trajectory are optimized to maximize
accuracy - Assumption motor commands are corrupted by noise
with standard deviation proportional to mean
(this is different from Poisson noise!!) - Question for a fixed duration and amplitude of a
movement, whats the optimal control signal?
37Controlling a trajectory
- Maximum accuracy
- Large control signals lead to fast but inaccurate
movements - Small control signals lead to accurate but slow
movements. - Goal select the control signal that leads to
maximum accuracy for a given duration.
38Controlling a trajectory
- Maximum accuracy
- wt white noise with mean zero and variance kut2
39Controlling a trajectory
- Maximum accuracy
- Goal minimize covxt under the constraint that
Ext is equal the desired location for several
time steps after the end of the movement.
Quadratic problem.
40Controlling a trajectory
- Maximum accuracy
- Eye Movements
41Controlling a trajectory
- Maximum accuracy
- Arm Movements
42Controlling a trajectory
- Optimal motor control (Todorov-Jordan)
- Experts show a lot of variance in their movements
but high accuracy on end points - Indeed, there are directions in motor space that
induce no variance in end points, because of the
large number of degrees of freedom
43Controlling a trajectory
- Optimal motor control (Todorov-Jordan)
- Choose trajectory with maximum accuracy and
minimum effort
44Controlling a trajectory
Line where the constraint
is verified
x2
(X0)2
1St solution bring X to a value X such that
x1(X0)1, x2(X0)2
x1
(X0)1
45Controlling a trajectory
Additional noise
x2
(X0)2
1St solution bring X to a value X such that
x1(X0)1, x2(X0)2
x1
(X0)1
46Controlling a trajectory
x2
2nd solution To minimize effort, go to the
closest point such that x1x2X0
x1
47Controlling a trajectory
x2
2nd solution To minimize effort, go to the
closest point such that x1x2X0
x1
48Controlling a trajectory
x2
x2
x1
x1
49Controlling a trajectory
- Optimal motor control (Todorov-Jordan)
X1X2-X Task error
Less variability in solutions
More task error
X1-X2
50Open question
- How does the nervous system compute the solutions
to those optimization problems? - Is it done off line (i.e., does the CNS specify a
trajectory?) or on line? Attractor nets?