Top Banner
Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum and Minerals March 2008 COE584: Robotics COE 584/484: Robotics
32

Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Jan 02, 2016

Download

Documents

Diana Morgan
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Muhammad Al-NasserMohammad Shahab

Stochastic Optimization of Bipedal Walking using Gyro Feedback and

Phase Resetting

King Fahd University of Petroleum and Minerals

March 2008 COE584: Robotics

COE 584/484: Robotics

Page 2: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Outline1. Problem Definition2. Physical Description3. Humanoid Walking System4. Feedback

1. Gyroscope2. Phase Resetting

5. Stochastic Optimization1. PGRL

6. Experimentation7. Comments

Page 3: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Problem Definition

Authors Felix Faber & Sven Behnke, Univ. of

Freinbrg, Germany

Problem Statement: “to optimize the walking pattern of a

humanoid robot for forward speed using suitable metaheuristics”

Page 4: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

First Humanoid Robot!

• 1206 AD

• Ibn Ismail Ibn al-Razzaz Al-Jazari

• A boat with four programmable automatic musicians that floated on a lake to entertain guests at royal drinking parties!!

Page 5: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Problem Definition

• Problems?

Nonlinear Dynamics: i.e. complex system to control

Sensor Noise:CameraGyroscopeUltrasonicForce…

Environment Disturbances:Unknown surface…

Inaccurate Actuators:Motors…

Page 6: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Physical Description

• Jupp, team NimbRo

• 60 cm, 2.3 kg

• Pocket PC

Page 7: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Physical Description

• Pitch joint to bend trunk

• Each leg• 3DOF hip• Knee• 2DOF ankle

• Each arm• 2DOF shoulders• elbow

Page 8: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Humanoid Walking System• One Approach

• Model-Based (Geometric Model)• Accurate Model• Solving motion equations for all joints (offline)• 19 Degrees of Freedom• Nonlinear model equations• Computational complexity

ControllerLeg Motion

Trajectory

Joints motor positions

’s

Robot walks!

Page 9: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Humanoid Walking System• 2nd Approach

Controller

Joints motor positions

’s

• Central Pattern Generators (CPG)• Sinusoid joint trajectory generated• Bio-Inspired• no need for model

Page 10: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Humanoid Walking System• Open-loop (no feedback) Gait

• Mechanism1. Shifting weight from one leg to the other2. Shortening the leg not needed3. Leg motion in forward direction

Page 11: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Humanoid Walking SystemOpen-loop GaitClock-driven, Trunk phase being central

clockTrunk Phase (with ‘foot step frequency’ )

Right leg motion phase = Trunk + /2Left leg motion phase = Trunk - /2

time

-

Page 12: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Humanoid Walking System

• (continued)

Kinematic Mapping

Left

Right

Leg

Foot

yLeg

pLeg

rLeg

Leg

pFoot

rFoot

Foot

r: Rollp: Pitchy: Yaw

“Human-Like Walking using Toes Joint and Straight Stance Leg” by Behnke

Swing

Swing is leg swing amplitude

Is leg extension

Page 13: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Feedback•Overall Control System

Joints motor positions

’s

Mapping

Controller

1. Gyroscope: Gyro = Inclination (Balance) Angular Velocity

2. Force Sensing Resistors: foot touch ground trigger (‘High’ or ‘Low’)

Page 14: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Feedback•Gyroscope

– device for measuring orientation, based on the principles of conservation of angular momentum

– Remember Physics 101!

Page 15: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Feedback P-Control

Gyro increase = robot fall

• Proportional Control • reactive action proportionate to ‘error’ (Error = sensor value –

desired value)• Desired values = zero (i.e. no inclination)

• Other: Proportional-Integral Control• action proportionate to ‘error’ and proportionate to

accumulation of ‘error’

Joints motor positions

’s

Gyro

pGyro

p

rGyro

r

FootOldFootNew K

K

Page 16: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Feedback• Overall System

Joints motor positions

’s

Mapping

P-Control

Page 17: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Feedback• Overall System

Controller

Joints motor positions

’s

Online Adaptation(Stochastic Optimization)

• Adaptive Control• Online tuning of ‘parameters’ of the

controller

Page 18: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Stochastic Optimization Approach

• Goal:– Adjust parameters to achieve faster and

more stable walk.

• Fitness function (cost function) is used to express optimization goals (i.e. speed & robustness)

f (.): RN--->RN: number of parameters of interest

)(xf

Page 19: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Stochastic Optimization Approach

• The parameters are

Kinematic Mapping

(Behnke paper)

Page 20: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Stochastic Optimization Approach

• We evaluate f in a given set of parameters• x = [x1 , x2 , ... , xN] (Table 1)

• Now, how to find the values of the parameters that will result in the highest fitness value?– use a metaheuristic method called PGRL

?+1

d <dexp

Page 21: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Policy Gradient Reinforcement Learning (PGRL)

• An optimization method to maximize the walking speed

• It automatically searches a set of possible parameters aiming to find the fastest walk that can be achieved

Page 22: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Policy Gradient Reinforcement Learning

• How dose PGRL work?1st : generates randomly B test polices {x1, x2,…,

xB} • around an initially given set of parameter vector

• (where x = [x1 , x2 , … , xN])

– Each parameter in a given test policy xi is randomly set to

• where 1≤i ≤B and 1 ≤j ≤N• ε is a small constant value

jjj xorxx ,

Page 23: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Policy Gradient Reinforcement Learning

• 2nd: – the test policy is evaluated by ‘fitness

function’.

• For each parameter j is grouped into 3 categories

• Which are• depending on where the jth parameter is

modified by –ε, 0, +ε

jjj SorSS 0,

Page 24: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Policy Gradient Reinforcement Learning

• Next 3rd , construct vector a=[a1, a2, …, aN]

• As are average of each category

Page 25: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Policy Gradient Reinforcement Learning

• Then 4th (finally), adjust xπ as follows

where η is a scalar step size

Page 26: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Extension to PRLG

• Adaptive step sizeafter g steps:

where s: the number of fitness functions

evaluationsS: maximum allowed number of s

Page 27: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Overall

• Overall System

Controller

Joints motor positions

’s

PGRLxπ

Page 28: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Experiment

Page 29: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Results

Page 30: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Results

• speed is 21.3 cm/s

• fitness is 1.36

• Speed is 34.0 cm/s

• Fitness is 1.52

After 1000 iteration

Initial

60%

Page 31: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Parameters

Page 32: Muhammad Al-Nasser Mohammad Shahab Stochastic Optimization of Bipedal Walking using Gyro Feedback and Phase Resetting King Fahd University of Petroleum.

Glossary

• Stance leg: – the leg which is on the floor during the walk.

• Swing leg:– the leg which moving during the walk.

• Single support:– The case where robot is touching the floor with one

leg.

• Double support:– The case where robot is touching the floor with both

legs.