Abstract
This article concentrates on the problem of walking pattern generation and online control for humanoid robot. However, it is challenging and thus still remains open so far in the field of bipedal locomotion control. In this article, we solve this problem by proposing a bivariate-stability-margin-based control scheme, in which a random vector function-link neural networks mechanism is additionally contained. By utilizing opposition-based learning algorithm to generate walking patterns and designing random vector function-link neural networks for compensating the combination of zero-moment point error and modeling error, the new walking controller exhibits good performance. Moreover, a bivariate-stability-margin-based fuzzy logic system is proposed to assign a weight to each training sample according to locomotion stability. With these results, a walking control system is successfully established and experiments validate the proposed control scheme.
Keywords
Introduction
Research of walking control for humanoid robot has collected a great deal of attention over the past decades.1–8 Roughly, these approaches can be classified into two groups, namely, model-based method and stability-criterion-based method. The former considers the reconstruction of robot configuration while the latter is directly concentrated on the locomotion stability criteria. Kajita et al. 4 defined an inverted pendulum model and proposed a walking control method for biped robot with an energy conserving orbit. Ha and Choi 5 presented a walking pattern generation method such that the deviations from the desired trajectories can be eliminated with the use of the virtual height inverted pendulum model. In Kajita et al., 6 a biped walking pattern generation approach was developed based on cart-table model using preview control. Moreover, Sun et al. 7 further proposed an extended cart-table model, and an excellent locomotion stability was guaranteed. Note that the methods based on inverted pendulum model ignore the inertia moment of robot and suffer from an inaccurate dynamics model. Bessonnet et al. 8 raised a parametric walking approach based on connective-link model with the use of spline functions. In addition, Fu et al. directly established a gait planning system so that stable gaits were achieved.
To improve locomotion stability, stability-criteria-based methods were proposed. Zero-moment point (ZMP) is one of the most popular locomotion stability criteria, which was first proposed by Vukobratović and Borovac. 9 Some interesting results have been reported in the literature.10–12 In Lee et al., 12 a modified walking pattern generation algorithm was developed by allowing a variation of ZMP over the convex hull of foot sole. Subsequently, by modifying the foot landing position, Nishiwaki and Kagami 13 developed a walking control system to reinforce the control performance. In Erbatur and Kurt, 14 a ZMP-criterion-based approach was proposed to generate natural gaits with Fourier series approximation technique. In Shin and Kim, 15 an online gait synthesis algorithm was proposed using the allowable ZMP region, which extended traditional fixed ZMP concept and exhibited a good performance. Moreover, Hu et al. 16 have fused iterative learning control with k-NN regression algorithm to enhance the robustness performance. By combining polynomial regression technique with principal component analysis, an online walking pattern generation algorithm with a low number of parameters was further proposed in Gasparri et al. 17 Despite these contributions, the methods established in the literature13–15 have a common assumption that yaw moment can be ignored during robot walks. However, this assumption is not easy to be met in practical application. As pointed out in the literature,18–20 yaw moment would inevitably exist due to the fact that the components of robot don’t move in the same plane. Usually, the moment equilibrium in vertical direction holds on for the reason that yaw moment can be counteracted by the ground reaction force when the walking speed is slow enough. Yet, slippage may happen when the ground reaction moment failed to compensate the yaw moment.
To cope with the yaw moment compensation problem of humanoid robot, some pioneering works have been reported in the literature.21–24 In Yu et al., 21 a waist-rotation-based method was proposed to counteract the undesired yaw moment. As an extended research, Ugurlu et al. 22 further developed a yaw moment control scheme using Eulerian ZMP resolution such that rotational inertia of robot is considered. Subsequently, Xing and Su 23 presented a motion generation method to compensate for the yaw moment, by utilizing trunk spinning and arms swinging. However, the controllers constructed in the literature21–23 commonly ignored the high energy consumption problem, which is a key difficulty in practical application and may result in the relative short battery life. Until now, it is still a challenging problem to maintain the dynamic balance in both horizontal and vertical directions with energy efficiency.
Inspired by this observation, we are dedicated to deal with the control problem of humanoid robot. The control objective is to control the multi-degrees-of-freedom (DoF) motion of biped robot such that the yaw moment can be effectively counteracted while ZMP trajectory is kept within the allowable ZMP region. The novelties of this work are characterized as follows:
Differs from those schemes in the literature,21–23 the proposed one in this article provides a systematic control scheme, in which multi-joint coordinated control approach is included and a random vector function-link neural networks (RVFLNNs) mechanism is newly designed. By approximating the nonlinear mapping function between ZMP and joints positions/velocities, ZMP error can be compensated online.
A weighted bivariate-stability-margin optimization algorithm based on the fact that walking samples with good locomotion stability and energy efficiency need to be emphasized is proposed to deal with the uncertainties of training data.
Finally, we present the constructed the walking control system for humanoid robot. The walking experiments have been performed to validate our proposed scheme.
System description and some preliminaries
Overview of the humanoid robot BRZ-4
The task of bipedal locomotion is carried out by the humanoid robot “BRZ-4,” which contains leg links connected by 10 motorized joints as displayed in Figure 1. BRZ-4 is a half-sized humanoid robot designed and constructed at Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan, China.

Three-dimensional model for BRZ-4.
To facilitate the analysis, BRZ-4 is simplified as connective-link-model. Specially, thigh, shank, and trunk are simplified as connective links. As illustrated in Figure 1, BRZ-4 is 61.5 cm in height and weights about 1.436 kg. Specifically, there are five degrees in each leg, where
The configuration of BRZ-4.
Analysis of locomotion stability and bivariate stability margin
Foot sole is the only contact surface between humanoid robot and the ground during walking. Thus, the ground reaction force is a key factor to maintain the dynamic balance. As illustrated in Figure 2,

Forces acting on the supporting foot.
To achieve stable bipedal locomotion of humanoid robot, two aspects need to be considered, which are, respectively, horizontal moment balance and vertical moment balance. To maintain the horizontal moment equilibrium, ZMP criterion has been widely used. According to the literature,9,25,26 ZMP is a point
where
where
where
where
where
Walking control system design
One of the control objectives is to achieve stable bipedal locomotion. The main difficulties in the implementation of the proposed control scheme arise from two aspects: (1) modeling uncertainties and (2) maintaining the moment equilibrium in both horizontal and vertical directions. Compared with single-layer feedforward neural networks, RVFLNN provides an excellent generalization capability with fast learning property in classification and regression. In this article, an RVFLNNs mechanism is designed to compensate the combination of ZMP error and modeling error. Moreover, a bivariate-stability-margin-based optimized approach is integrated in the proposed RVFLNNs mechanism to deal with the data uncertainties, in which the samples with better stability performance are emphasized. Visual illustration of above control design is given in Figure 3.

Block diagram of the proposed control scheme.
Gait plan and optimization
To generate smooth and stable gaits, a spline-based parametric technique is adopted. The main idea is to approximate the coordinates of locomotion utilizing spline function fitted at key walking states uniformly distributed along the whole motion. Then, parametric optimization approach is designed to improve the locomotion stability. The procedure of parametric modeling of hip trajectory is listed as follows. At first, we define several parameters to describe hip joint position at specific motion time, as illustrated in Figure 4.

Kinematic parameters.
By analyzing the relationship between hip joint and ankle joint, we have
where
Similarly, the trajectories of knee and ankle joints can be obtained by the above steps. Till now, the procedure of gait planning is finished. However, it is usually not enough to generate stable gaits. Optimization method needs to be employed to find optimal parameters. To improve the locomotion stability, the following multi-objective optimization functions are established as follows
where
This is a typical quadratic programming problem. To find the optimal solution, an opposition-based particle swarm optimization method is adopted, which was proposed in Yang et al. 27 For the limitation of space, the specific implementation is omitted. For the details, please refer to Yang et al. 27
Bivariate-stability-margin-based RVFLNNs design
Obviously, ZMP error would exist due to modeling uncertainties and disturbances from environment. Equation (2) implies that ZMP is related to center of mass (CoM) position. In other words, ZMP can be modified by adjusting CoM position. However, the mapping between ZMP and CoM is not easily linearized. In addition, both ZMP and yaw moment are related to the position, velocity, and acceleration of each joint. How to compensate ZMP error by controlling robot is a difficulty we need to tackle. From the aforementioned observation, we have
where
where
Compared with traditional single-layer feedforward neural networks, in our algorithm, the weight matrix is randomly initiated with the proper dimensions. The input data set is given as
Moreover, in supervised learning tasks, the control performance usually suffers from data uncertainties and imbalanced data distribution. To remove this obstacle, we assign a weight to each sample according to bivariate stability margin, which are ZMP stability margin and yaw moment margin. Based on expert knowledge, the weight of each sample is assigned using fuzzy logic system (FLS). Based on the given expert knowledge, the FLSs rules are given as
where x and y denote ZMP stability margin and yaw moment margin, respectively;
where
where
Experiment and discussion
Mechanical setup
We have established a test bed in our laboratory for the validation of the proposed control approach, see in Figure 5. Roughly, the whole walking control scheme is composed of three parts: biped robot BRZ-4, the ground workstation, and onboard system, respectively.

Basic components of the test bed.
To control the motion of BRZ-4, the onboard system is equipped with a control unit Arduino MEGA 2560, a gyroscope. Control command can be transmitted to the onboard system via UART interface. Specifically, the online control algorithm runs on the onboard system. To alleviate the computing burden of the onboard system, offline training are implemented on the ground workstation, which comprised an Intel Core i5 processor, 8G DDR3, and 1 TB SSD disk. The mechanical structure of BRZ-4 is designed and implemented by three-dimensional (3D) print technique, which is equipped with 10 DYNAMIXEL MX-64-T motors and rechargeable battery.
Simulation and experimental results
To generate smooth and stable gaits, gait planning is the first task to be implemented. By utilizing the opposition-based particle swarm optimization algorithm proposed in Yang et al., 27 the simulation results of walking gait reference are presented in Figure 6, where red solid lines denotes the right leg and trunk, and blue dashed lines represent the left leg. As indicated in Figure 6, it is noted that the generated gaits are smooth and stable.

Simulation results of 3D walking.
The walking experiment is performed in our laboratory. To verify the control performance under different gaits, the whole walking procedure consists of three kinds of gaits, which are start gait, normal gait, and stop gait. The test bed provides a relatively ideal environment without many external disturbances, as given in Figure 5. It is worth mentioning that the parameter

Snapshots of walking experiment.
To evaluate bipedal locomotion stability, the evolutions of both ZMP trajectory and yaw moment are recorded in Figures 8 and 9. Specifically, in these figures, both x-axis and y-axis ZMP response trajectories are plotted in solid blue line and ZMP boundaries are illustrated in dashed lines. As demonstrated in Figures 8 and 9, all the x-axis and y-axis ZMP response trajectories are observed to be within ZMP boundaries. Thus, with our proposed control algorithm, horizontal moment equilibrium is achieved. Moreover, the comparison results on the yaw moment response trajectories with and without control are given in Figure 10. From this figure, it is apparent that the yaw moment is effectively counteracted. Compared with the case of without yaw moment compensation, the peak of yaw moment is reduced by 34.1%, which corroborates the effect of the proposed algorithm. Overall, in practical application, the bipedal locomotion stability is well guaranteed by the presented control scheme, as validated by the experimental results.

x-axis trajectory.

y-axis trajectory.

Yaw moment trajectory.
Conclusion
In this article, the problems of walking pattern generation and online control for humanoid robot are investigated. To overcome the difficulty arising from modeling uncertainties and external disturbance, an RVFLNNs-based control scheme is proposed such that ZMP error can be effectively compensated. Moreover, the learning weight is assigned to each training sample by developing an FLS. Subsequently, a bivariate-stability-margin optimization approach, which is additionally integrated in RVFLNNs mechanism, makes that the presented control scheme is more efficient. Finally, the experiment results validate our proposed scheme.
Footnotes
Handling Editor: Chenguang Yang
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Natural Science Foundation of China under grants 61573108, U1613223, U1501251, and U1613223; in part, by the Natural Science Foundation of Guangdong Province under grants 2016A030313715 and 2016A030313018; in part, by the China postdoctoral Science Foundation under grant 2018M633353; in part, by the Fundamental Research Funds for the Central Universities under grant ZYGX2016J140; in part, by the Scientific and Technical Supporting Programs of Sichuan Province under grant 2016GZ0395, 2017GZ0391, and 2017GZ0392; in part, by the Science and Technology Project of Guangdong Province under grant 2016A020220003; in part, by the Science and Technology Planning Program of Zhongshan City under grant 2017SF0603 and 2017A1024.
