Constraining movement imitation with reflexive behavior: Robot squatting

Constraining Movement Imitation With ReflexiveBehavior: Robot Squatting

Andrej Gams, Tadej Petric, Jan Babic, Leon Zlajpah, Ales UdeJozef Stefan InstituteLjubljana, Slovenia

Email: {andrej.gams, tadej.petric, jan.babic, leon.zlajpah, ales.ude}@ijs.si

Abstract—Direct imitation of human movement with a hu-manoid robot, which has a similar kinematic structure, doesnot guarantee a successful completion of the task because ofdifferent dynamical properties. Our research starts by showinghow to apply a generalization algorithm to extract the desiredmovement primitives from multiple human demonstrations. Theemphasis of the paper is on a method that constrains theextracted movement primitives when mapping them to a robot,taking into account a critical criterion of the task. As a practicalexample we study the stability of a robot, which is determinedthrough a normalized zero-moment-point. Our approach is basedon prioritized task control and allows direct movement transferas long as the selected criterion is met. It only constrains themovement when the criterion approaches a critical condition. Thecritical condition thus triggers a reflexive, subconscious behavior,which has higher priority than the desired, conscious movement.We demonstrate the properties of the algorithm on a real, human-inspired leg robot developed in our laboratory.

I. INTRODUCTION

Movement imitation is one of the approaches of transferringhuman movement to robotic mechanisms [1]. In the paperwe show how we can constrain the recorded movement whenmapping it to a robot, in order to maintain a chosen criterion.The criterion is in our case the stability of the robot of ahuman-inspired leg robot.

Different kinematic and dynamic properties of humans androbotic mechanisms do not allow direct transfer or mapping ofmovement from one to the other [2]. The resulting movementof the robot will most likely not accomplish the same task. Forexample, recorded joint movement of humans when squattingwill, if directly copied to a humanoid robot, result in therobot tipping over. The movement has to be somehow changedor constrained to account for the difference in the kinematicstructure and the dynamic properties.

One of the approaches is to transfer the task space move-ment, for example, the movement of the tip of the arm,thus completely ignoring the joint space movements of thedemonstrator. In such case the joint movement is under thecontrol of an inverse kinematics algorithm. Null-space motionand prioritized task control can be used to, for example, adddesired joint movement as a secondary task. While the primarytask is end-effector movement in task space, the secondarytask, if possible, maintains similar poses as the demonstrator[3]. Another common use of the secondary task is to im-plement obstacle avoidance [4]. Such prioritized task spacecontrol is also referred to as Operational space control [5].

In our approach we change the formulation of the primaryand the secondary task, so that the desired movement of therobot is in fact a secondary task. The primary task is onlyobserved if we approach a pre-defined threshold. While farfrom the threshold, the algorithm allows direct control ofseparate joints. Upon approaching the threshold, the primarytask smoothly takes over and only allows joint control in itsnull-space. The algorithm allows smooth transition in bothways – between observing the primary task with the secondaryin its null space, or just the unconstrained secondary task.This allows unconstrained joint movement while not closeto the threshold. As such, it mimics the human reflexivemechanism of retaining the stability of the posture. Similarlyto the proposed algorithm, the proprioceptive and vestibularsystems of the human body trigger the reflexes that ensurepostural stability during an arbitrary motion [6], [7].

We demonstrate the approach on a stability task, where weperform squatting movements on a planar leg robot, which hassimilar kinematical properties as a human in the sagittal plane[8]. We use a normalized zero-moment point (ZMP) locationas the criterion of the stability. Control of the stability criteriononly takes over when it approaches the threshold, and eventhen the transition is smooth. Otherwise, we use direct jointcontrol of the mechanism. The demonstrated trajectories, apartfrom the offsets, are not modified in advance.

A similar manner of robot control with two differenttasks, where their desired joint velocities blend together, waspresented by Sugiura et al. [9]. Contrary to the proposedmethod the authors used a blending coefficient that mixestwo ”primary” tasks, each with its own null-space task. Ourproposed method, on the other hand, allows continuous tran-sition between a primary and a secondary task. The work byMansard et al. [10] provides a general solution for integrationof unilateral constraints. The authors use an activation matrixwhich modifies the Jacobian of the task.

To acquire the demonstrated trajectories of movement forthe desired task we recorded several executions of chosenmovements, for example squatting. As human movement cannever be completely reproduced in timing and in movementitself, representative movement or movement primitives haveto be somehow extracted. We use a generalization algorithm[11] based on a regression technique to acquire joint movementprimitives for the demonstrated tasks. We modified the algo-rithm to evenly weight separate trajectories and thus extract

https://www.researchgate.net/publication/224446040_A_Unified_Approach_to_Integrate_Unilateral_Constraints_in_the_Stack_of_Tasks?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/221063360_Modeling_and_Control_of_Multi-Contact_Centers_of_Pressure_and_Internal_Forces_in_Humanoid_Robots?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/4297024_Real-time_collision_avoidance_with_whole_body_motion_control_for_humanoid_robots?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/224175918_Task-Specific_Generalization_of_Discrete_and_Periodic_Dynamic_Movement_Primitives?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/3972380_Kinematic_Control_Algorithms_for_On-Line_Obstacle_Avoidance_for_Redundant_Manipulators?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/22010174_Locomotion_in_vertebrates_central_mechanisms_and_reflex_interaction_Physiol_Rev?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/222536159_Programming_full-body_movements_for_humanoid_robots_by_observation?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/12943499_Is_imitation_learning_a_route_to_humanoid_robots?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/245030862_Human_sensorimotor_learning_for_humanoid_robot_skill_synthesis?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/245513646_A_Biarticulated_Robotic_Leg_for_Jumping_Movements_Theory_and_Experiments?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

primitives of movement, encoded in the dynamic movementprimitive (DMP) framework [12], [13].

The rest of the paper is organized as follows. In section twowe first show how we extract and encode motion primitivesfrom the demonstrations trough generalization. In sectionthree we present our approach on how to constrain the jointmovement of the robot with reflexive behavior and show howto calculate of the ZMP criterion. In section four we presentour leg robot and give results of real-world experiments.Conclusions and discussion are given in section five.

II. MOTION PRIMITIVES

A. Data acquisitionUsing motion capture we measured the movement of several

subjects performing different tasks. We used 3D investigatormotion capture equipment from NDI. Our measuring set-upconsisted of three position sensors, where each of these sensorsconsists of three infrared cameras. We used active markers.Fig. 1 shows the leg robot, developed at our laboratory, and asilhouette of a person, side by side in the sagittal plane. We cansee that the kinematical structure of the planar mechanism anda human are similar in this plane. A dynamical model fromSimMechanics is presented in the right. The placement of the

Fig. 1. Similarity in kinematical structures of the leg robot and a person. Thedots on the silhouette show the location of the markers for motion capture. Adynamical model from SimMechanics with marked centers of mass (COM)for the body, thigh and shank, is shown on the right. Most stable point (MSP)and the least stable points (LSP) are marked as well.

markers for motion capture is conditioned by the kinematicalstructure of the robot. In our case we could acquire jointmovements of the hip, the knee and the ankle using 5 markers,as shown in Fig. 1. These were attached to the demonstratorat these joints, plus a marker on the foot and one on the body.Using these 5 markers we can calculate the joint angles by

Φ = arccosaTb

‖a‖‖b‖, (1)

where Φ is the joint angle and a,b are vectors defined byadjacent markers.

B. Movement Primitives

We extract movement primitives for separate joints from aset of recordings, and encode them in the dynamic movementprimitive (DMP) framework [12], [13]. A set of equations

τ z = αz(βz(g − y)− z) + f(x), (2)τ y = z, τ x = −αxx, (3)

where

f(x) =

∑Ni=1 wiΨi(x)∑Ni=1 Ψi(x)

x, (4)

Ψi(x) = exp(−hi (x− ci)2

), (5)

defines what is known as a discrete DMP. In (2) – (5),αz = 4βz > 0, g is the goal of the movement, x is thephase of the movement, N is the number of kernel functions,Ψ are the kernel functions, wi are the weights of the kernelfunctions, hi is their width and ci is equally distributed onx. The details on the derivation and the use of DMPs are in[12], [13], [14]. Once we have collected movement data, wehave sets of movements for each task. One set of movementsconsists of several recordings of the same movement.

The recordings are of different length, because humans can-not reproduce exactly the same movement for every executionof the task. Movement of different subjects also varies andbesides that also depends on the marker placement. To extractthe primitives of separate movements we use a generalizationalgorithm [11]. The algorithm can be used to generalize fromdifferent demonstrated trajectories, for example from trajecto-ries of reaching to different points or throwing to differenttargets [11]. For the algorithm to work, the demonstratedtrajectories have to have similar characteristics, for example,all reaching trajectories have to be straight (or all have to becurved, etc.). The result of the generalization algorithm is forsuch a case a straight trajectory to a given target location -query point.

For the case of generalizing amongst trajectories that alldescribe the same movement, we had to change the algorithmto put the same weight on all demonstration trajectories. Addi-tionally, the target for the generalization must be representativeof the movement. We chose the average final position of thejoints. This final position is set as the goal g of the DMP, whichencodes the generalized motion primitive. The timing of themovement primitive is also a result of the generalization.

The generalization algorithm adjusts the weights of theDMP, i.e. wi from (3), to best fit the measured data.We need triplets of position, velocity and acceleration{yd(tj), yd(tj), yd(tj)} , j = 1, ..., T , where tj are the sam-pling times. In our case the data was obtained by motioncapture and is in joint space. Equations (2) and (3) can berewritten in a single equation by replacing z with τ y to get

τ2y + αzτ y − αzβz(g − y) = f, (6)

where f is defined as in (4) and (5). We thus get

F (tj) = τ2yd(tj) + αzτ yd(tj)− αzβz(g − yd(tj)), (7)



https://www.researchgate.net/publication/226942004_On-line_learning_and_modulation_of_periodic_movements_with_nonlinear_dynamical_systems?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/5920301_Dynamics_systems_vs_optimal_control_-_a_unifying_view?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==



https://www.researchgate.net/publication/3955685_Movement_imitation_with_nonlinear_dynamical_systems_in_humanoid_robots?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==



f =

F (t1)...

F (tT )

,w =

w1

...wN

, (8)

and obtain the following set of linear equations

Xw = f , (9)

which needs to be solved to estimate the DMP describing thedesired motion. For discrete movements, as is our case, thematrix X is

X=

Ψ1(x1)N∑

i=1

Ψ1(x1)

x1 ... ΨN (x1)N∑

i=1

ΨN (x1)

x1

... ... ...Ψ1(xT )

N∑i=1

Ψ1(xT )

xT ... ΨN (xT )N∑

i=1

ΨN (xT )

xT

. (10)

The parameters w can be calculated with the above systemof differential equations in a least-squares sense. Such batchapproach is common for discrete movements.

All sampled trajectory points included in the library ofdemonstrated movements can be used. The optimal parametersw can be calculated from the available training data by solvingthe following regression problem:

C(w) =

M∑k=1

||Xkw − fk||2 (11)

with respect to w.Figure 2 shows motion capture and generalization results

for squatting. Thin lines show the collected data. The gen-eralized trajectories are presented with dashed lines. Markerplacement has proved quite an issue, because the recordedtrajectories have too much of an offset at initial conditions. Weonly present the results of measurements with similar initialconditions in all the trials. The generalization algorithm suc-cessfully deals with different lengths of separate demonstrationmovements. It is important to note that the generalization canonly be successful for similar movements, so all squattingmovements have to be performed in the same manner. In ourcase the heel had to be on the ground all the time.

III. CONSTRAINING MOVEMENT WITH REFLEXIVEBEHAVIOR

In this section we show how we can constrain the recordedmovement when mapping it to the robotic mechanism, in orderto maintain a given criteria. In our case the criteria is thelocation of a simplified Zero-Moment-Point (ZMP).

A. Calculating the zero moment point

When humans do squats with the feet together, or when do-ing vertical jumps, large forces act on a narrow (short) supportplane, thus reducing the area of stability. Stability is one of themajor problems in performing movements of humanoid robots.In the case of our leg robot we are dealing with stability inthe sagittal plane (forward-backward). Stability in the lateralplane (left-right) is in our case ensured by a wide foot and therobot is planar and cannot move in the lateral plane.

0 0.5 1 1.5 2 2.5 3 3.5 40

1

2

3

hip

[rad]

0 0.5 1 1.5 2 2.5 3 3.5 40

1

2

3

4

knee

[rad]

0 0.5 1 1.5 2 2.5 3 3.5 4

1.4

1.6

1.8

2

t [s]

ankle

[rad]

Fig. 2. Motion capture results for squatting. Recorded data is presented withsolid lines and the generalized primitives are presented with dashed lines.

Zero-Moment Point (ZMP) was defined in robotics as astability criterion by Vukobratovic and Borovac [15]. In short,it is defined as the point on the ground about which the sumof all the moments of the active forces equals zero [16].

The model of our leg robot, as shown in Fig. 1, can berepresented as a triple inverted pendulum, with most of themass in the upper segment - the body. Such structure isinherently unstable. The robot is stable when the forces onthe foot are in the center of the support polygon - the foot.We denote this position as the MSP (Most Stable Point) asshown in Fig. 1. Contrary to that, the Least Stable Point (LSP)represents the point when the ZMP is on the very edge of thesupport polygon - either on the heels or on the tip of the toes,also depicted in Fig. 1. The more the ZMP approaches a LSP,the less stable is the robot. If the ZMP reaches any of theLSPs, the robot tips over.

To calculate the position of the ZMP of the leg robot,we need the dynamical model. We denote the i-th link withthe mass mi at the mass center point ri = [xi, yi, zi]

T

(relative to the inertial frame), inertial tensor Ii and with theangular velocity ωi. External forces and torques are denotedby Fi,k and Mi,j , where index k tracks all forces and indexj tracks all torques acting on i-th link. The overall rotationaland translational equation of the system in an arbitrary pointp = [xp yp 0]

T on the plain z = 0 is

∑i

{mi (ri − p)× (ri + g) + Iiωi + ωi × Iiωi}+C = Mp

(12)where

C = −∑j

Mj −∑k

(sk − p)× Fk. (13)

Here sk is the vector that points towards the position wherethe external force Fk acts on the robot, g is the gravity

https://www.researchgate.net/publication/220065796_Zero-Moment_Point_-_Thirty_Five_Years_of_its_Life?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/3299134_Planning_walking_pattern_for_a_biped_robot?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

acceleration vector (horizontal supporting surface) and Mp

is the resulting torque at the observed point p.In accordance to the ZMP definition [15], only the moment

Mp = [0 0 Mz]T acts at the point pzmp = [xzmp yzmp 0]

T ,and by ignoring the inertial tensor Ii [17], and with no externalforces acting on the robot, the components are

xzmp =

∑i

mi {xi (zi + gz)− zi (xi + gx)}∑i

mi (zi + gz), (14)

and

yzmp =

∑i

mi {yi (zi + gz)− zi (yi + gy)}∑i

mi (zi + gz). (15)

The relationship between the ZMP and the joint spacevelocity q is given by

uz = Jzq, (16)

where

Jz =

[∂yzmp

∂qi

](17)

and uz is the velocity of the ZMP in general, depending onthe task. In the case of the leg robot this becomes uz = yzmp.The Jacobian Jz is a function of the actual joint angles (i.e.the joint angles set at the robot’s joints at each given moment),and it essentially maps the velocity vector q in the joint spaceto the velocity of the ZMP.

In the next section we show how the Jacobian inverse isemployed to constrain the joint space movement as reflexivemovement in order to maintain the stability of the leg robot.

B. Constraining the movement with reflexive behavior

The task of the leg robot is to do squats, where the robotjoint movement is given by the extracted joint movementprimitives for squatting, i.e. to perform the recorded jointmovements. Performing the recorded squatting movement onlymakes sense if the robot maintains stability. Maintaining sta-bility is in this case the primary task. The issue is how to mapthe demonstrated movements (the extracted joint primitives)to the robot in joint space, and only change them when therobot would lose stability.

The difference to a classical approach is in the control ofthe ZMP. The movement of the ZMP is not defined and canfreely move along the supporting polygon, as long as it stayswithin a defined boundary. While within the boundary, thedesired joint movement (from the demonstration) is completelyobserved. As the ZMP approaches the defined boundary, whichis close to the LSP, the primary task takes over and only allowsjoint movement, which would not produce instability. As thismovement takes command only when necessary, it acts sortof reflexive, higher-priority behavior.

To control the stability we define the stability index as afunction of a normalized ZMP to the power of an odd numberby

zn =

(yzmp − yMSP

|yLSP − yMSP |

)2n−1

. (18)

The stability control is defined with

y = uz = Kpez = Kp (zd − zn) = Kp(−zn), (19)

because the desired (referential) normalized ZMP locationzd = 0. Kp is the proportional gain. The commanded jointspace velocity of the leg robot is defined with

q = J†zuz +KpN′zqn (20)

where qn = qn − q is the velocity that tracks the desiredtrajectory, J†z is the generalized inverse of (17) and matrixN′z is given by

N′z = diag(Nz) + |zn|(Nz − diag(Nz)). (21)

Here Nz is the null space matrix associated with Jz , given byNz = I− J†zJz .

When |zn| approaches 1, the zero moment point approachesthe boundary of the stability region and matrix N′z takes theform N′z ≈ Nz . Hence in this case the secondary task onlygenerates movements in the null space of the stability task.We can still move any arbitrary degree of freedom, but onlyif that motion does not affect stability control, i. e. the motionof zn toward the desired value zd = 0. This is the classicnull space control. Power n in Eq. (18) controls how closeto the boundary of the support polygon we allow the ZMP tomove before triggering stability control. If n is large, then znis close to zero over a large portion of interval −1 ≤ zn ≤ 1,whereas a smaller n triggers stability control earlier.

On the other hand, while |zn| remains close to zero, wecan allow trajectory tracking even if the resulting movementinterferes with stability, since in this case the zero momentpoint is safely within the boundaries of the support poly-gon. Nevertheless, we multiply the desired velocity qn byN′z ≈ diag(Nz), which has the effect that the robot followsthe desired trajectory more accurately with the degrees offreedom that affect stability less. To prove that this is indeedthe case, we first show that coefficients of diag(Nz) are alwaysnonnegative. This is due to the properties of Moore-Penrosepseudoinverse [18], which make JzJ

†z an idempotent matrix,

i. e. (JzJ†z)2 = JzJ

†z . It follows that

N2z = (I− JzJ

†z)2 = I− JzJ

†z − JzJ

†z +

(JzJ

†z

)2=

= I− JzJ†z = Nz, (22)

thus Nz is idempotent, too. Since Nz is also symmetric, thediagonal elements of Nz , denoted here by aii, are given by

aii = a2ii +

n∑j=1,j 6=i

a2ji ≥ 0. (23)

As explained above, JzJ†z is also an idempotent, symmetric

matrix, thus its diagonal elements bii ≥ 0,∀i, too. Conse-quently, aii = 1− bii ≤ 1 and in summary, 0 ≤ aii ≤ 1.

https://www.researchgate.net/publication/220065796_Zero-Moment_Point_-_Thirty_Five_Years_of_its_Life?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

https://www.researchgate.net/publication/223078428_Design_and_construction_of_a_series_of_compact_humanoid_robots_and_development_of_biped_walk_control_strategies?el=1_x_8&enrichId=rgreq-b0499adc-4264-4dec-9814-cde56f417a2b&enrichSource=Y292ZXJQYWdlOzIyMDcxODU1MjtBUzoxNTU2ODk3Nzk0NzAzMzZAMTQxNDEzMDc0NDY3OA==

If aii = 0, then it follows from Eq. (23) that aji = 0,∀j. Inthis case we have Nzei = 0, where ei is the i-th coordinatevector, i. e. ei = [0, . . . , 0, 1, 0, . . . , 0]T . This means that eiis orthogonal to the null space of Jz , thus any motion in thei-th degree of freedom directs the robot straight out of thestability polygon. For this reason we suppress the movementin the i-th degree of freedom if the i-th diagonal element of Nz

becomes small. Note that the suppression is only temporary;if the difference between the desired movement and the actualexecution becomes large, then the desired change qn = qn −q in (20) becomes large and the commanded motion startstracking the desired motion regardless as long as the ZMPremains within the stability polygon.

On the other hand, if aii = 1, then it follows from Eq. (23)that aji = 0, i 6= j. In this case we have Nzei = ei, whichmeans that ei belongs to the null space of Jz . Consequently,motion in the i-th degree of freedom does not affect thestability of the robot at all. It therefore makes sense to allowthe robot to freely follow the trajectory for such degreesof freedom. In summary, the multiplication by diag(Nz) isequivalent to weighting the degrees of freedom with a factorthat is inversely proportional to the amount to which themotion in each degree of freedom interferes with the primarytask. Since the elements of diag(Nz) are non-negative, theresulting controller always causes the robot to move in theright direction, but slows down the motion if it significantlyaffects stability control.

IV. EXPERIMENTAL EVALUATION

A. Leg robot

We developed a leg robot, which was inspired by theanatomic properties or the human body. The robot is presentedin Fig. 1. It is a planar robot composed of four segmentswhich represent the foot, shank, thigh and trunk of a human.The segments are connected together by rotational hinge jointswhose axes are perpendicular to the sagittal plane. Each jointis activated by a servo drive mounted inside the proximalsegment with regard to the joint. We use Maxon RE 40 DCservo drives. The gear ratio we chose is 1 : 8, which allowshigh torques, but squatting overloads the motors, so only afew squats can be performed at a time. The largest part of therobot weight is in the trunk segment where are the motor thatactivates the hip joint, the computer, motion controller and allpower amplifiers. Altogether, the robot weights ∼4.6 kg. Therobot is connected to an external power supply.

B. Squatting movement

We performed squatting movement with our leg robot. Thedesired joint movement is given with movement primitives,extracted from human demonstrations. One-to-one mappingof the movement to the robot, accounting for the offsets tocompensate the initial position of the robot, results in therobot tipping over. This is clearly shown in Fig. 4, where wecan see that unconstrained movement (red), moves the ZMPout of the area of stability. Reflexive movement (blue), onthe other hand, successfully maintains stability. Fig. 3 shows

3 3.5 4 4.5 5 5.5 6 6.5 7

−2

−1.5

−1

−0.5

0

t [s]

jointmovement[m

]

anklekneehip

Fig. 3. Joint movement during successful (solid) and unsuccessful (dashed)squatting. The difference in movement is small, but critical. In comparison tothe extracted primitives in Fig. 2, we had to add offsets to account for initialposition of the leg robot.

0 2 4 6 8 10 12

−0.02

0

0.02

0.04

0.06

t [s]

y zmp[m

]

standfall

Fig. 4. ZMP location, calculated from the model, during successful (blue)and unsuccessful (red) squatting. The dashed lines give the LSPs.

constrained (solid) and unconstrained (dashed) joint movementof the robot. We can see that the trajectories are very similar.Fig. 5 shows a sequence for successful squatting with reflexivebehavior in the top row, and unsuccessful squatting fromsimple one-to-one mapping in the bottom row.

Fig. 6 shows the comparison of the ZMP movement, cal-culated from the model, and the projection of the center ofgravity on the floor, measured with a Kistler force place, forthree consecutive squats. We can see that the model providesan accurate stability criterion.

V. CONCLUSION

In the paper we have shown how we can use a modifiedprioritized task control to implement reflexive movement ofa robot. Higher priority movement only takes over when thedesired movement approaches a given threshold, and thus actsas reflexive movement. We have shown how to apply thealgorithm to a leg robot performing squatting. The algorithmcan constrain the demonstrated movement to maintain stability.In the future we would like to implement the algorithm inthe lowest level of control of our mechanism, and try similarreflexive behavior in other scenarios. We have also shown howwe can effectively apply a generalization algorithm to extractmovement primitives from sets of recorded movements.

ACKNOWLEDGMENT

This paper was partially funded by Slovenian ResearchAgency grant Z2-3672. It has also received funding fromthe European Communitys Seventh Framework ProgrammeFP7/2007-2013 (Specific Programme Cooperation, Theme 3,Information and Communication Technologies) under grantagreement no. 269959, IntellAct.

Fig. 5. A sequence of still photos showing squatting movement. The top row show successful execution of the movement using reflexive movement. Thebottom row show execution of unconstrained demonstrated movement.

0 5 10 15 20 25 30−0.02

0

0.02

0.04

0.06

0.08

0.1

t [s]

y zmp[m

]

model measurement

Fig. 6. Movement of the ZMP, calculated from the model (blue), andsmoothed measured projection of the center of gravity on the floor, measuredwith a Kistler force plate. We can see resemblance in the shape of the twosignals.

REFERENCES

[1] S. Schaal, “Is imitation learning the route to humanoid robots?” Trendsin cognitive sciences, vol. 6, pp. 233–242, 1999.

[2] A. Ude, C. Atkeson, and R. M., “Programming full-body movements forhumanoid robots by observation,” Robotics and Autonomous Systems,vol. 47, no. 2-3, pp. 93–108, 2004.

[3] L. Sentis, J. Park, and O. Khatib, “Modeling and control of multi-contactcenters of pressure and internal forces in humanoid robots,” in Intelli-gent Robots and Systems, 2009. IROS 2009. IEEE/RSJ InternationalConference on, oct. 2009, pp. 453 –460.

[4] L. Zlajpah and B. Nemec, “Kinematic control algorithms for on-lineobstacle avoidance for redundant manipulators,” in Intelligent Robotsand Systems, 2002. IEEE/RSJ International Conference on, vol. 2, 2002,pp. 1898 – 1903 vol.2.

[5] O. Khatib, “A unified approach for motion and force control of robotmanipulators: The operational space formulation,” Robotics and Automa-tion, IEEE Journal of, vol. 3, no. 1, pp. 43 –53, february 1987.

[6] S. Grillner, “Locomotion in vertebrates: central mechanisms and reflexinteraction,” Physiol Rev, vol. 55, pp. 247 – 304, 1975.

[7] J. Babic, J. G. Hale, and E. Oztop., “Human sensorimotor learning forhumanoid robot skill synthesis,” Adaptive Behavior, 2011.

[8] J. Babic, L. Bokman, D. Omrcen, J. Lenarcic, and F. Park, “A biartic-ulated robotic leg for jumping movements : theory and experiments,”Journal of mechanisms and robotics, vol. 1, no. 1, 2009.

[9] H. Sugiura, M. Gienger, H. Janssen, and C. Goerick, “Real-time collisionavoidance with whole body motion control for humanoid robots,” in In-telligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ InternationalConference on, 29 2007-nov. 2 2007, pp. 2053 –2058.

[10] N. Mansard, O. Khatib, and A. Kheddar, “A unified approach to integrateunilateral constraints in the stack of tasks,” Robotics, IEEE Transactionson, vol. 25, no. 3, pp. 670 –685, june 2009.

[11] A. Ude, A. Gams, T. Asfour, and J. Morimoto, “Task-specific generaliza-tion of discrete and periodic dynamic movement primitives,” Robotics,IEEE Transactions on, vol. 26, no. 5, pp. 800 –815, oct. 2010.

[12] A. J. Ijspeert, J. Nakanishi, and S. Schaal, “Movement imitation withnonlinear dynamical systems in humanoid robots,” in Proc. IEEE Int.Conf. Robotics and Automation, Washington, DC, 2002, pp. 1398–1403.

[13] S. Schaal, P. Mohajerian, and I. A., “Dynamics systems vs. optimalcontrol – a unifying view,” Progress in Brain Research, vol. 165, no. 6,pp. 425–445, 2007.

[14] A. Gams, A. Ijspeert, S. Schaal, and J. Lenarcic, “On-line learning andmodulation of periodic movements with nonlinear dynamical systems,”Auton. Robots, vol. 27, no. 1, pp. 3–23, 2009.

[15] M. Vukobratovic and B. Borovac, “Zero-moment point - thirty five yearsof its life,” I. J. Humanoid Robotics, vol. 1, no. 1, pp. 157–173, 2004.

[16] Q. Huang, K. Yokoi, S. Kajita, K. Kaneko, H. Arai, N. Koyachi, andK. Tanie, “Planning walking patterns for a biped robot,” Robotics andAutomation, IEEE Transactions on, vol. 17, no. 3, pp. 280 –289, jun2001.

[17] T. Furuta, T. Tawara, Y. Okumura, M. Shimizu, and K. Tomiyama,“Design and construction of a series of compact humanoid robots anddevelopment of biped walk control strategies,” Robotics and AutonomousSystems, vol. 37, pp. 81–100(20), 30 November 2001.

[18] A. Ben-Israel and T. N. E. Greville, Generalized Inverses: Theory andApplications. New York: John Wiley & Sons, 1974.

Constraining movement imitation with reflexive behavior: Robot squatting

Documents