Abstract
This paper investigates two fundamental structures for biped robots and a control strategy to achieve stable biped running. The first biped structure contains straight legs with telescopic springs, and the second one contains knees with compliant elements in parallel with the motors. With both configurations we can use a standard linear discrete-time state-feedback control strategy to achieve an active periodic stable biped running gait, using the Poincare map of one complete step to produce the discrete-time model. In this case, the Poincare map describes an open-loop system with an unstable equilibrium, requiring a closed-loop control for stabilization. The discretization contains a stance phase, a flight phase and a touch-down. In the first approach, the control signals remain constant during each phase, while in the second approach both phases are discretized into a number of constant-torque intervals, so that its formulation can be applied easily to stabilize any active biped running gait. Simulation results with both the straight-legged and the kneed biped model demonstrate stable gaits on both horizontal and inclined surfaces.
Introduction
Many researchers have investigated biped robot walking in both theory and practice (see survey in [1]), yet many open problems remain. A mechanical leg lacks the complexity of actuation and sensing of a human leg; thus, the physical design remains an area of investigation. A useful mechanism should be simple, robust and energy efficient. The design of an appropriate control system provides another challenge; the problem is highly nonlinear, open-loop unstable, and constrained by limited actuator power. Moreover, compliant elements that store and release energy (possibly essential for energy efficiency) result in complex dynamics requiring non-trivial control strategies. Contributions to this work include proposing a mechanism - a simple, efficient, and compliant biped - as well as demonstrating the applicability of linear control strategies.
Our understanding of efficient biped walking mechanics and dynamics has improved tremendously through the investigation of passive walkers ([2],[3],[4]) and recently the idea has been successfully extended to passive biped running [5]. Although passive robots can teach us a lot, practical robots need actuation. Two of the most important fully actuated robots are the new generation of Asimo humanoid that can run at 9 km/h [6] and the HRP-2LR biped that can walk and run on level ground, with the running pattern obtained using resolved the momentum control method and the ZMP concept [8]. However, these running motions include very short flight phases and do not come near the efficiency of human running [7] - perhaps because they lack (the efficient use of) compliant elements. The simplest compliant biped model - a spring loaded inverted pendulum - consists of a point mass and two massless springy legs [9], which produces similar ground reaction forces to those found in human walking and running [10]. It has been shown through both simulation and experiments that a compliant leg structure (with linear springs representing biarticular muscle tendons), with simple open-loop sinusoidal motor inputs, can produce natural ground reaction forces for a one-legged hopper [11] and a biped running robot [12]. The elastic coupling of limbs may make gaits faster and more human-like [13]. In order to achieve human-muscle characteristics, some have proposed using artificial pneumatic muscles [14], [15] and others have suggested series elastic actuators [16], [17].
Underactuated biped robots exhibit faster and more natural walking and running gaits [18]. McGeer [19] realized stable and unstable passive running gaits for his biped model with telescoping springy legs and massless arced feet, and stabilized unstable passive gaits with event-based LQR control on its linearized Poincare map. While biped models with arced feet have some stable passive running gaits, no multi-body biped model with point feet has been found to have a stable passive running gait. Hyon et al. [22] proposed a controller based upon an energy preservation strategy for a one legged hopping robot and then applied it to a planar biped robot with springy legs and a torso. Hu et al. [23] investigated unstable passive running gaits for a telescoping springy biped model with massless point feet, and stabilized them using event-based LQR control similar to McGeer [19]. These works were confined to biped models with massless springy legs on horizontal terrain that have passive running gaits, and could not be applied to real biped robot models that do not have a passive limit cycle. Hybrid zero dynamics (HZD) have been used to prove the existence and stability of active bipedal walking [20] and running [21] gaits for a five link four actuator kneed model. A reduced-order hybrid subsystem is used to design the controller.
Our original contributions include making the Poincare map event-based control more general by developing active (rather than just passive) maps of running gaits, which has allowed us to extend the applications to different types of robots running on horizontal and sloped terrains. Also, we generalize this control strategy for active running gaits with variable discretized control commands. Inspired by [23], [12] and [21], we choose two types of biped models to generate running gaits and stabilize them with the proposed control strategy for active gaits. The first system is a straight-legged telescoping springs biped with one active revolute joint at the hip while a second one has three joints at the hip and knees with motors parallel to a torsional spring in each revolute joint. The proposed multi-body biped models can generate running gaits even with simple constant control commands during each phase, which could be very interesting for this complicated task.
Dynamic Modelling
Straight-legged biped
We first investigate a biped with a hip and two prismatic spring legs (illustrated in a with parameters given in Table 1). We assume that a lower piston fits into an upper cylinder on each leg (with a spring in between). Points A and F are the robot's point feet (massless), C and D are the centres of mass (CoMs) of the cylinders, B and E are the CoMs of the respective pistons, and H is the hip point (with a point mass). We assume that the terrain slopes at a constant angle ϕ and our Cartesian X,Y coordinates describe absolute horizontal and vertical directions (relative to the Earth). We look only at running gaits, consisting of a stance phase (with one stance leg touching the ground and one swing leg) and a flight phase. A rotational motor in the hip provides actuation torque u. The prismatic joints are assumed to have actuators that can only lock or unlock the prismatic motion – locked just after take-off (once the leg has reached its nominal length) and unlocked just before, or at, touch-down. Thus, the prismatic joints are passive.
Nomenclature of the prismatic leg biped robot
Nomenclature of the prismatic leg biped robot
The generalized coordinates of the stance phase
The Lagrange method results in dynamic equations:
in which the Lagrangian function
The kinetic and potential energies follow from the geometric relationships between the positions of the robot points and their symbolic time derivatives. The governing equations of the stance phase, derived by symbolic software, are:
in which,
in which:
More details about the dynamic equations can be found in [29].
A take-off event consists of an instantaneous transition from stance to flight, occurring when the ground reaction force reaches zero. This event is detected when the spring of the stance leg reaches its free length or the ground reaction force reaches zero. The generalized coordinates
where subscripts s and f represent stance and flight respectively, and superscripts – and + indicate time instants just before and after the event, respectively, which results in:
The generalized coordinates of the flight phase
Assuming that the lengths of the legs are constant during flight, the dynamic equations of flight become:
with
To detect a touch-down event during numerical simulation, we have to find the first intersection of the trajectory of foot point D and the line y = xtanϕ. At touch-down, the velocities change instantaneously because of an inelastic impact. The angles do not have any instantaneous change, but our coordinate notation does change due to the other leg becoming the stance leg (b):
We have to utilize the conservation of angular and linear momentum in order to describe the impact map. We assume leg AH has a fixed length during the impact, whereas leg FH is free to compact (compressing the spring). Assuming a fully plastic contact, point F becomes an ideal pivot after contact.
Foot F receives an impulse force, one component of which is transferred along the axis of the prismatic joint to the hip. Using angular momentum conservation for the whole robot around F, linear momentum conservation of AHD in the direction of leg FH, and angular momentum conservation of leg AH around H, then the expressions below are conserved in terms of
where L denotes the linear momentum and H denotes the angular momentum. The three equations (7–9) have three unknowns
Equations (2, 4, 5, 6–9) constitute the overall hybrid dynamic model of running.
Studies on the energetics and kinematics of a spring-mass model show that compliant elements in the legs of a biped robot take an important role in both walking and running [12]. Although human legs have very complicated muscle-tendon neural control systems, they exhibit simple spring-like behaviour in running and some walking speeds [24]. A one-legged hopper with a springy passive knee achieves a stable hopping motion using a harmonic input for the hip motor [11]. To model a human-like compliant leg, we can use Hill-type muscles in the biped model, but it makes the model unnecessarily complicated. In order to simulate muscle compliance with a minimalistic model, we propose a biped model in which each joint comes equipped with a rotational spring parallel to a torque motor.
This model (Figure 2) consists of a point mass in the hip and two legs with a thigh and a shank, both of which have mass and moment of inertia. The model's lengths and masses model a typical human (). Inspired by the fact that human muscles change their stiffness during running [25], we discovered that changing the knee stiffness between stance and flight produces more efficient running gaits – the gait requires more torque in stance than in flight. We define the free angles of the torsion springs in B and D as q2 = −π/4, q4 = −π/4, and the free angle of the hip torsion spring as q3 – q1 = 0. The procedure in section 2–4 remains valid for this model.

(a) Stance phase generalized coordinates and (b) Touch-down parameters for the straight leg biped

Stance phase generalized coordinates and (b) Flight phase generalized coordinates for the kneed biped
The stance phase's generalized coordinates comprises q1 (the absolute angle of thigh BH), q2 (the angle of link AB relative to BH), q3 (the absolute angle of thigh DH) and q4 (the angle of link CD relative to DH), as shown in Figure 2a. The positive direction of the angles is counter-clockwise. By utilizing the Lagrange method, the stance phase dynamic equations become:
in which,
By defining the stance phase state vector as
We use Lagrange's impact model to find the collision map in touch-down:
in which D is the inertia matrix:
The principle of virtual work provides generalized impact modelling. In the following equations, qi denotes the i-th component of
and virtual work is denoted:
So, the generalized impacts become:
After touch-down, we change the notation for labelling the feet, which gives us 8 equations in terms of pre-contact and post-contact coordinates and velocities.
Assuming fully plastic contact, the post-contact position of the hip produced by differentiating the post-contact stance coordinates is:
Equations (12,18) and the 8 equations contain 16 equations with 16 unknowns
Straight-legged biped
For simplicity and to compare the results with [23], we assume massless feet and place the leg masses at points C and D. The only event that should dissipate (significant) energy during a running cycle is touch-down. However, massless feet make it possible to touch down without energy losses, resulting in passive periodic orbits. Thus, gait generation will consist of finding a set of initial conditions and control commands that can produce a periodic orbit for running. A Poincare map of one complete running step will serve as a convenient method of describing a periodic orbit. A complete running step includes stance phase, take-off, a flight phase and touch-down. We choose the post-contact state vector as the Poincare section, which is also the initial condition of the stance phase. Thus, the state vector at the beginning of a stance phase gives the Poincare map its input, and it outputs the next state vector at the beginning of the next stance phase (the next step). Although
We begin by finding a passive solution, where the torque at the hip remains zero at all times. The Poincare map for a passive gait is:
in which k is the number of steps. Any fixed point of the Poincare map indicates a periodic orbit of the overall dynamic model and provides a valid initial condition for a passive periodic running gait. To find the fixed point of the Poincare map, find the zero of the function:
Specifically, numerical optimization methods can find a solution that minimizes the 2-norm of (11). The fixed point of the map is:
At least one fixed point exists for any given running speed. We measure the running speed as the horizontal velocity of the CoM at the touch-down instance, and then add its difference from the desired velocity to the norm of the error function in the optimization routine.
A gait designed for an uphill run will require torques (i.e., an active gait). Moreover, we shall see that active gaits may also provide a better starting point (linearization) for our closed-loop control strategy. To find such active running gaits, we find a fixed point of the active Poincare map:
Vector
The active fixed point is:
Because the feet have mass in this model, no passive running gait exists on horizontal terrain. Again, the post contact state vector becomes a Poincare section. An active fixed point of the Poincare map:
will help generate gaits on horizontal or sloped terrains. Three motors provide torque - one in the hip and two in the knees. Again, the motor torques remain constant during each continuous time phase and, therefore, vector
Countless fixed points exist, each corresponding to a running speed. We find a fixed point that minimizes energy expenditure subject to the constraint of the swing leg remaining clear of the ground. The energy expenditure index will be introduced in the next section.
Assuming the 100% efficiency of the motors, the energy expenditure in one step is calculated:
in which u i denotes the motor torque of each joint, θ i is the angular velocity of the corresponding joint in the direction of u i , and tstep defines the time interval of one step of the gait.
The cost of transport (COT) – i.e., the consumed energy per total weight per distance travelled - provides a useful measure for energy expenditure comparisons:
in which L s and h s are the horizontal and vertical components of the stride, respectively [26].
The passive and active open-loop gaits found in the previous section are unstable; even tiny disturbances, like those due to truncated numerical calculations in simulations, will cause the joint trajectories to drift and result in the robot falling after a few steps. To investigate gait stability, consider the linearized Poincare map around the fixed point (26) or (27) as:
in which [
Now, instead of a hybrid nonlinear dynamic system (2, 4, 5, 6–9), we have a linear digital system (31) with an equilibrium point at the origin. This system will be stable, if all of the eigenvalues of matrix A are located inside the unit circle. A state feedback control:
stabilizes system (31). Linear control techniques can produce an appropriate control gain matrix
The nonlinear optimization needed to minimize (22) presents some practical difficulties. Since this complicated function contains the hybrid dynamic model with continuous and discontinuous time phases, in practice the optimization algorithm will often settle in a local minimum. To solve this problem, after finding each solution we re-initialize the optimization algorithm by rounding the last result, and repeat this procedure until reaching the desired tolerance. The presence of events in the dynamic model causes challenges in choosing an integration step size; we use a relatively large time step, with a maximum step size of 10 ms for the continuous-time phases, but a much smaller time step around the events, with the maximum step size in the order of 0.1 ms.
The phase diagram of leg AH for five steps of open loop passive running with a speed of 0.85 m/s is shown in Figure 3, in which point P is the start point, curve PQ corresponds to the stance leg, QR to flight, RS to the swing leg and SP to flight. Qualitatively speaking, its overall shape appears to be in agreement with the results in [23] for a simple biped model with only three point masses. Note that this passive limit cycle is unstable; the phase diagram diverges due to the (disturbance) error accumulation in numerical calculations, causing the biped model to fall down after running five steps (the red line exiting to the left in Figure 4b). Considering that points P and R correspond to the touch-downs in the limit cycle of Figure 3, one can see that there is no discontinuity in the velocities; therefore, in this gait touch-down occurs with no energy losses. However, in the last step before failure, point P changes to a vertical line with an instantaneous velocity change in touch-down, which causes energy dissipation. This small vertical line is recognizable in the figure by a blue point just beneath the red point in P. An energy-conserving touch-down occurs for biped models with massless springy feet when the velocity of the touch-down foot is in the direction of the touch-down leg [23]. This condition has been satisfied automatically because the calculated fixed point is passive and energy conserving. However, energy dissipation is inevitable at touch-down with biped robots with foot mass. The thick lines in Figure 5 representing this gait show that the total mechanical energy of the robot remains constant during the gait, serving to verify the validity of our dynamic equations and their solutions.

Phase diagram of leg AH of the straight leg biped model with an unstable passive running gait on horizontal terrain, starting from the fixed point

Stick diagram of one step of running on 5° sloped terrain with a velocity of 0.85 m/s and 30 ms time intervals between snapshots

Total mechanical energy of the robot for two steps of running (thick line for horizontal and thin line for sloped terrain)
The stick diagram of the resulting actuated (non-passive) gait running up a slope angle of φ = 5° and a horizontal velocity of 0.85 m/s is shown in Figure 4. For points P and R in this figure, the superscript ‘−’ represents the pre-contact state and ‘+’ represents post-contact. Note the discontinuities in velocities with resulting energy losses at touch-down. Due to the uphill motion, the total mechanical energy of the robot increases with each step (Figure 5, thin lines). Figure 6 shows the control effort for the generated gait, remaining constant in each phase. The COT for running gaits increases with the terrain incline in a linear manner (Figure 8).

Control effort for a two-step of running gait on 5° sloped terrain which is constant during each phase

Cost of transport vs. terrain slope for the straight leg robot

Phase diagram of leg AH, starting from an initial state deviated from the fixed point
The maximum eigenvalue of matrix A corresponding to the passive gait on horizontal terrain is 13.8, and for the active gait on 5° sloped terrain is 5.8+2.7i. So, although both of these gaits are unstable, the magnitudes of the eigenvalues are small enough that stabilization using discrete-time state feedback control constitutes a viable strategy.
By choosing a large constant of q = 4000 and Q = q I, along with a small constant r = 1 and R=r I, we design feedback controls for the two generated gaits. For the passive gait, the maximum closed-loop pole became 0.49 and for the active gait 0.59. This controller stabilizes the fixed points and the robot model displays a steady and hybrid dynamic model of both of the gaits around their stable running motion with the designed velocity.
Figure 8 and 10 show convergence with the limit cycles using control law (30), and Figure 9 and 11 show their control efforts, which approach the fixed point values of the gait and reach a steady state after about 15 steps. The shape of the passive limit cycle for running on horizontal terrain in Figure 8 is similar to the limit cycle of a biped model with three point masses and without legs inertias in [23], because both of the models have massless springy feet. Moreover, for both models the closed-loop control effort converges on zero as the system converges on a passive gait (Figure 9). The active running gait on sloped terrain has a different shaped limit cycle (Figure 10), which also has energy dissipation at touch-down (green line), and the control effort converges on the non-zero torque values of the active fixed point (Figure 11).

Control effort for 20 steps of closed-loop running on horizontal terrain

Phase diagram of leg AH for 20 steps, starting from an initial state deviated from the fixed point

Control effort for 20 steps of closed-loop running on 5° sloped terrain
It is desirable that there are small deviations of motor torques from the fixed point values. In these simulations, each component of the initial condition state vector is 1% deviated from the fixed point vector for the gait on horizontal terrain and 3% for sloped terrain. More deviations lead to divergence from the limit cycle and failure.
To generate an efficient running gait, the springs' rates should be adjusted for that speed. We found a set of values by trial and error that can produce a natural-looking gait with a speed of 10 m/s when compared to the fastest human running speed of 12.4 m/s [28]. A stick diagram of the generated gait on sloped terrain with an angle of 5° appears in Figure 12. The red curves indicate the trajectory of the CoM during flight, demonstrating a considerable range in flight. Although unstable, these gaits can continue open-loop running for about nine steps before falling down, appearing to have better stability characteristics than the telescoping leg biped model. The linearized Poincare map (31) for the gait on horizontal terrain has a maximum eigenvalue of 2.92 and for sloped terrain 3.63, quite suitable for the application of a linear state feedback controller (compared to 13.8 for the prismatic springy leg model). The use of the DLQR method produces matrix [K]6×8 and the control law (32) places the closed-loop maximum poles of the systems at 0.4034 and 0.333 + 0.182i for horizontal and sloped terrains respectively. This control law stabilizes the nonlinear hybrid system and generates a stable periodic running motion. When the initial condition is coincident with the fixed point of the Poincare map, the phase diagram of the closed-loop system remains on the limit cycle. Perturbing each element of the starting state by 2% from the fixed point, we observe initial deviations of the phase from the limit cycle with ultimate convergence within five steps (Figure 14 and Figure 16 for the horizontal and sloped terrains respectively). If we perturb only one component of the initial state vector by 10%, the closed-loop system will converge on the limit cycle as well (larger perturbations cause divergence in both cases). The figures demonstrate a local basin of attraction. The vertical green lines P and R correspond to touch-down with instantaneous velocity changes. Notice that the gait on the slope has a greater velocity discontinuity and larger energy losses than the horizontal gait. Furthermore, Figure 15 and Figure 17 show that bigger torque magnitudes are needed on the sloped terrain. Due to the disturbance in the initial conditions, the control torques have some fluctuations and converge on

Stick diagram of one step of running with a velocity of 10 m/s and 10 ms time intervals between snapshots, (a) on horizontal terrain (b) on 5° sloped terrain
Nomenclature of the kneed biped robot
In Figures 7 and 13, the dots show the values of the computed COT for the generated gaits on different terrain slopes, and a line is fitted to them. The generated running gait on level ground results in a cost of transport of 1.98 and increases with the slope of the terrain. Comparing the two figures, we conclude that the kneed biped model requires much larger COTs than the straight leg model – due to the fact that the straight leg biped model had conceptual massless springy feet and its gait was passivity-based. However, our kneed model presents a more realistic model, similar to a human leg structure. In the next section we will show that the cost of transport can be reduced with variable control torques during phases. In addition, note that the obtained COT points do not lie on a straight line in Figure 13, as in Figure 7; the straight leg model has only one motor in the hip and assuming a constant torque for it causes a unique gait for each slope and each velocity, whereas the kneed biped with three motors does not have a unique gait for each condition (we find that with minimum energy consumption).

Cost of transport vs. terrain slope for the kneed leg robot

Phase diagram of leg BH, starting from an initial state deviated from the fixed point

Control effort for 20 steps of closed-loop running on horizontal terrain

Phase diagram of leg BH for ten steps, starting from an initial state deviated from the fixed point

Control effort for 20 steps of closed-loop stable running on 5° sloped terrain
In the controller (30) for the linear system (29) investigated in the previous sections, the torques were constant during each stance and each flight phase; reducing the number of calculations needed yet restricting the energy efficiency of the gait. In this section, the control method is generalized for running gaits with variable motor torques to show the generality of the control strategy. Here, the motor torques are discretized into smaller time steps, but remain constant during each step. Due to the variation of motor torques, the stance and flight times will vary, requiring us to choose a number of steps that cover more time than the entire phase. Looking at the stance time interval 0.056 s and the flight time interval 0.104 s from the previous section, we choose a stance time step size of 0.02 s and a flight time step size of 0.04 s, with 4 time steps for each phase. The control vector is defined as:
such that it contains all the motor torque values for both the stance and the flight phase of one step. In this formula, s and f indicate the stance and flight phases, 3 is the number of motors of the robot and n and m are the number of the discretization of the stance and flight phases, respectively. Thus, the active fixed point similar to (27) has [
Cost of transport of different running biped models

(a) Control effort of one step of running on horizontal terrain with constant torques, COT = 1.98, (b) variable torques with 0.02 s and 0.04 s time steps for stance and flight, COT = 1.47, and (c) variable torques with 0.01 s and 0.02 s time steps for stance and flight, COT = 1.31.

Cost of transport vs. time step size in a variable torque gait

Stick diagram of one step of running with variable motor torques on horizontal terrain
To test the control algorithm we use a gait with 0.01 s stance time step size. Again the Poincare Map is linearized around the fixed point and with the control law of (32) for the linear discrete system (31), and matrix [K]3(m+n)x8 is found using DLQR method. Controller (32), when applied to the main system, generates control command for one step using the feedback of stance state in the post contact instance. Starting from a state 6% disturbed from the fixed point, the phase diagram of one leg of closed-loop running for 20 steps is shown in Figure 21 which shows convergence on a steady stable gait, and the control effort for ten steps is depicted in Figure 22.

Phase diagram of leg BH for 20 steps of closed-loop stable running on horizontal terrain with variable torques, starting from a point deviated 6% from the fixed point

Control effort for ten steps of closed-loop running on horizontal terrain
The maximum deviation of the initial conditions from the fixed point for which the controller of constant torque gait converges on the limit cycle is 2% while it is 6% for the gait with variable torque. So, the variable torque has the advantages of lower COT and better controllability, while its drawback is that it needs a greater number of calculations to find the fixed point and it can be more difficult to implement on a real robot than the event-based controller.
The dynamic equations of biped robot running were formulated and a numerical framework was proposed to derive and solve the equations for various biped models. It was shown that the proposed compliant kneed multi-body biped model can produce active running limit cycles with either constant or variable motor torques during each phase. This model has a variable stiffness – changing between flight and stance – for greater energy efficiency. A fixed point of a single-step Poincare map provided the basis to generate periodic running gaits. A discrete-time linear state feedback controller stabilized the linearized Poincare map around the fixed point. Simulations demonstrated that the controller can stabilize both telescoping-leg and kneed-leg models, on both horizontal and inclined surfaces. The limit cycles and the control command for the telescoping leg biped are qualitatively in agreement with the passive limit cycles in [23]. Our running gaits are not limited to passive gaits like in [19], [22] and [23], and our controller, being able to stabilize both passive and active running gaits, is much simpler than HZD [21] for active gaits. The HZD controller is a hybrid, with continuous-time feedback signals applied in stance and flight phases, and with discrete updates of controller parameters performed at the transitions between phases, while our controller is event-based, with just updates at the instance of post-touch-down. The HZD controllers are designed separately for the stance and flight phases so that the restricted Poincare map is stable, but our controller is calculated for the entire step using the linearized Poincare map around its active fixed point. After applying the controller to running gaits with constant motor torques during each (stance and flight) phase, in section 7 the motor torques were discretized using smaller intervals to show the generality of this controller. The controller formulation in section 7 should be able to stabilize any active biped running gait with variable discretized motor torques. Using more discretizations requires significantly more calculations to find the fixed point, but the controller consumes less energy.
Footnotes
9. Acknowledgments
We thank Mr S. J. Hasaneini for his useful comments.
