Muhammad Al-Nasser - PowerPoint PPT Presentation

About This Presentation

Title:

Muhammad Al-Nasser

Description:

King Fahd University of Petroleum and Minerals COE 584/484: Robotics Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting – PowerPoint PPT presentation

Number of Views:49

Avg rating:3.0/5.0

Slides: 33

Provided by: facultyKf

Category:

more less

Transcript and Presenter's Notes

Title: Muhammad Al-Nasser

1
Stochastic Optimization of Bipedal Walking using
Gyro Feedback and Phase Resetting
King Fahd University of Petroleum and Minerals
COE 584/484 Robotics

Muhammad Al-Nasser
Mohammad Shahab

March 2008 COE584 Robotics
2
Outline

Problem Definition
Physical Description
Humanoid Walking System
Feedback
Gyroscope
Phase Resetting
Stochastic Optimization
PGRL
Experimentation
Comments

3
Problem Definition

Authors
Felix Faber Sven Behnke, Univ. of Freinbrg,
Germany
Problem Statement
to optimize the walking pattern of a humanoid
robot for forward speed using suitable
metaheuristics

4
First Humanoid Robot!

1206 AD
Ibn Ismail Ibn al-Razzaz Al-Jazari
A boat with four programmable automatic musicians
that floated on a lake to entertain guests at
royal drinking parties!!

5
Problem Definition

Problems?

Sensor Noise Camera Gyroscope Ultrasonic Force
Inaccurate Actuators Motors
Nonlinear Dynamics i.e. complex system to
control
Environment Disturbances Unknown surface
6
Physical Description

Jupp, team NimbRo
60 cm, 2.3 kg
Pocket PC

7
Physical Description

Pitch joint to bend trunk
Each leg
3DOF hip
Knee
2DOF ankle
Each arm
2DOF shoulders
elbow

8
Humanoid Walking System

One Approach
Model-Based (Geometric Model)
Accurate Model
Solving motion equations for all joints (offline)
19 Degrees of Freedom
Nonlinear model equations
Computational complexity

Joints motor positions
Controller
Robot walks!
Leg Motion Trajectory
?s
9
Humanoid Walking System

2nd Approach

Joints motor positions
Controller
?s

Central Pattern Generators (CPG)
Sinusoid joint trajectory generated
Bio-Inspired
no need for model

10
Humanoid Walking System

Open-loop (no feedback) Gait
Mechanism
Shifting weight from one leg to the other
Shortening the leg not needed
Leg motion in forward direction

11
Humanoid Walking System

Open-loop Gait
Clock-driven, Trunk phase being central clock
Trunk Phase (with foot step frequency ? )
Right leg motion phase ?Trunk ?/2
Left leg motion phase ?Trunk - ?/2

?
?
time
-?
12
Humanoid Walking System

(continued)

?Leg
Kinematic Mapping
?Left
?Right
?
?Swing
?Foot
Human-Like Walking using Toes Joint and Straight
Stance Leg by Behnke
? Is leg extension
?Swing is leg swing amplitude
r Roll p Pitch y Yaw
13
Feedback

Overall Control System

Joints motor positions
Mapping
?s
Controller

Gyroscope ?Gyro Inclination (Balance) Angular
Velocity
Force Sensing Resistors foot touch ground
trigger (High or Low)

14
Feedback

Gyroscope
device for measuring orientation, based on the
principles of conservation of angular momentum
Remember Physics 101!

15
Feedback

P-Control
?Gyro increase robot fall
Proportional Control
reactive action proportionate to error (Error
sensor value desired value)
Desired values zero (i.e. no inclination)
Other Proportional-Integral Control
action proportionate to error and proportionate
to accumulation of error

Joints motor positions
?s
?Gyro
16
Feedback

Overall System

Joints motor positions
Mapping
?s
P-Control
17
Feedback

Overall System

Joints motor positions
Controller
?s
Online Adaptation (Stochastic Optimization)

Adaptive Control
Online tuning of parameters of the controller

18
Stochastic Optimization Approach

Goal
Adjust parameters to achieve faster and more
stable walk.
Fitness function (cost function) is used
to express optimization goals (i.e. speed
robustness)
f (.) RN---gtR
N number of parameters of interest

19
Stochastic Optimization Approach

The parameters are

Kinematic Mapping (Behnke paper)
20
Stochastic Optimization Approach

We evaluate f in a given set of parameters
x x1 , x2 , ... , xN (Table 1)
Now, how to find the values of the parameters
that will result in the highest fitness value?
use a metaheuristic method called PGRL

?
1
d ltdexp
21
Policy Gradient Reinforcement Learning (PGRL)

An optimization method to maximize the walking
speed
It automatically searches a set of possible
parameters aiming to find the fastest walk that
can be achieved

22
Policy Gradient Reinforcement Learning