Abstract
BACKGROUND:
Automatic recognition of a person’s gender as well as his or her unilateral load state are issues that are often analyzed and utilized by a wide range of applications. For years, scientists have recognized human gait patterns for purposes connected to medical diagnoses, rehabilitation, sport, or biometrics.
OBJECTIVE:
The present paper makes use of ground reaction forces (GRF) generated during human gait to recognize gender or the unilateral load state of a walking person as well as the combination of both of those characteristics.
METHODS:
To solve the above-stated problem parameters calculated on the basis of all GRF components such as mean, variance, standard deviation of data, peak-to-peak amplitude, skewness, kurtosis, and Hurst exponent as well as leading classification algorithms including kNN, artificial neural networks, decision trees, and random forests, were utilized. Data were collected by means of Kistler’s force plates during a study carried out at the Bialystok University of Technology on a sample of 214 people with a total of 7,316 recorded gait cycles.
RESULTS:
The best results were obtained with the use of the kNN classifier which recognized the gender of the participant with an accuracy of 99.37%, the unilateral load state with an accuracy reaching 95.74%, and the combination of those two states with an accuracy of 95.31% which, when compared to results achieved by other authors are some of the most accurate.
CONCLUSION:
The study has shown that the given set of parameters in combination with the kNN classifying algorithm allows for an effective automatic recognition of a person’s gender as well as the presence of an asymmetrical load in the form of a hand-carried briefcase. The presented method can be used as a first stage in biometrics systems.
Introduction
Gait is one of the most complex, unconsciously performed, activities by human beings. It is a natural and common manner of getting about. For this reason, its measurement and analysis are carried out in a wide range, often completely unrelated, applications, such as medical diagnostics, rehabilitation, healthcare, human-machine interactions, or marketing [1].
Human gait may also be used in biometrics understood as the identification of a particular person. To improve security, in biometrics-connected applications recognizing gait, so-called soft biometrics like body height [2] or gender recognition are sometimes utilized. To identify a person’s gender on the basis of his/her gait signals gathered through the use of motion capture systems including dynamometric platforms [3, 4, 5], video camera [6], electromyography [1] as well as wearable sensors such as gyroscopes or accelerometers [7] are employed. In a paper [3] dealing with gender and age recognition characteristics identified through centers of pressure were used. The utilization of SVM classifiers allowed the achievement of accuracy reaching 99.65% concerning gender and 97.22% regarding age. The results were obtained from a sample consisting of 24 participants.
The recognition of the type of activity a person is engaged in is an oft-analyzed issue within literature [8, 9] and may be used in healthcare or to enhance security (e.g. in a video surveillance environment). Additionally, the problem of identifying one such everyday activity like walking with a briefcase (asymmetrical load) is addressed in several works connected with human gait recognition. This type of movement, in comparison with unencumbered walking, significantly changes a given person’s gait pattern [10]. When it comes to biometric systems, independent of physical parameters that are measured, the act of carrying a briefcase very negatively impacts the accuracy of a human gait recognition system [11]. Lv et al. [12] have shown that asymmetrical carrying of loads reduces gait symmetry and that a load carried in the right hand is characterized by a greater symmetry than when it is carried in a person’s left hand. Gait symmetry is inversely proportional to the level of loading. In [13] the impact of various ways of carrying backpacks by school-aged children on GRF and temporal characteristics was analyzed. Manners of carrying a backpack included walking without a backpack, with it hanging low on the child’s back, high on the back, and carried by the handle. When carrying backpacks children took smaller steps and walked at lower speeds with greater vertical ground reaction forces being measured than when walking without them (
The purpose of the present study was to present a method and set of parameters identified on the basis of elements of GRF that allow the recognition of the participant’s gender as well as the presence of asymmetrical loading. In addition, it was assumed that this method can be used as the first stage of human identification in biometric systems.
Materials and method
Signals
Measurements of GRFs made as part of this study were performed using two Kistler platforms with the dimensions of 60 cm
Features
To eliminate the impact of the duration of the support phase on the possibilities of comparing two gait cycles the following parameters were applied. These parameters were selected on the basis of the study by Derlatka and Borowska [15]:
Mean of the signal:
Variance of the signal:
Standard deviation of the signal:
Peak-to-peak (ptp) amplitude of the signal:
Skewness of the signal is computed as the Fisher-Pearson coefficient of skewness:
where Kurtosis of the signal:
Hurst exponent of the signal is calculated from the rescaled range and average over all the partial time series of length N:
where
It is also necessary to specify that signal features were calculated independently for each component of GRF and separately for each leg. Thus created input space consisted of a total of 42 parameters. Since the values of obtained parameters vary significantly from one another it becomes necessary to standardize them before classification using the following equation:
where:
An important role in the process of identifying a person or his/her gender as well as the activity which he/she may be doing, is played by the classifier. Within the presented solution it had been decided to test several well-known algorithms. Every classifier was trained for group of features listed in subsection 3.2 using 10 folds of cross-validation. Each time the same division of data into folds was utilized thanks to which results obtained by different classifiers were comparable. The quality of every classifier was determined by its accuracy. This represents the proportion of true positive results (both true positive as well as true negative) in the selected population.
K nearest neighbor (kNN)
The
Naive Bayes
The Naive Bayes classifier is based on the Bayes Theorem:
where:
In this classifier, it is assumed that each input variable is independent which is usually not true with respect to real data. However, despite this unrealistic assumption Bayes classifiers often produce good results.
Artificial neural networks consist of appropriately connected structures consisting of single artificial neurons. In ANNs signals provided to the input nodes are multiplied by values called weights connected to individual synaptic connections. The processing of information also occurs within the neuron itself. The training of an artificial neuron network comes down to the selection of weights in a way that an output of the ANN for a provided input signal is as close as possible to the desired value. Within the present work feedforward networks (MLPs) with no more than two hidden layers were used.
Linear discriminant analysis
Linear discriminant analysis (LDA) finds a linear combination of features that best differentiate between classes. Combinations of results are used as linear classifiers or to reduce the dimensionality of the input space. The present work made use of the regularized linear discriminant analysis described in detail in [17] where it is assumed that all classes possess the same covariance matrix:
where:
Support vector machines (SVMs) are a classification algorithm that builds a hyperplane separating two classes in such a way as to separate the classes by a maximum margin. SVMs most often use a nonlinear transformation based on a so-called kernel, which projects the original problem into a feature space with more dimensions. This new feature space makes it easier to find a solution.
Classification and regression trees
Classification and Regression Trees (CART) are binary trees with one-dimensional divisions. Within the node of the tree, a condition is created by verifying all possible divisions in points that are mid-points of segments between subsequent sorted
where:
Random Forest (RF) is a collection of many relatively simple decision trees. Random forests generally give better results than single trees. To make this advantage visible, some differentiation is used between generated trees. Differentiation can be provided by randomly selecting a training set for each tree. Such a set contains a strictly defined percentage of cases of the entire teaching set. Classification occurs similarly to other methods of combining ensemble classifiers. Majority voting is the most frequently chosen strategy.
The study group
The research was carried out at the laboratories of the Faculty of Mechanical Engineering of the Bialystok University of Technology on a sample of 214 people including 92 women and 122 men. The people taking part in the research were at ages 21.34
The research had been approved by the Bioethics Commission, Regional Medical Chamber in Bialystok, Poland and Bioethics Commission at Medical University of Bialystok.
Results and discussion
Accuracy of gender recognition of a participant, recognition of carrying a briefcase state as well as the combination of those two characteristics according to classifier
Accuracy of gender recognition of a participant, recognition of carrying a briefcase state as well as the combination of those two characteristics according to classifier
Table 1 presents the results of a given person’s gender recognition, probability of carrying a briefcase as well as a combination of those two characteristics simultaneously with values for selected classifiers. It should be stressed that several simulations permitting the discovery of optimum parameter values for each particular type of classifier were conducted. Table 1 contains the best of the obtained results. The impact of the number of neighbors and distance function has been shown in Fig. 1. The presented graphs prove that the selection of the right metric has a significantly higher influence on obtained results than the number of neighbors considered during the classification. It should be noted that in all kNN classifiers the tie-breaking algorithm uses the class with the nearest neighbor among tied groups.
Accuracy of recognition concerning the number of neighbors and distance metrics used for a) gender recognition, b) briefcase-carrying recognition, c) recognition of gender as well as briefcase carrying. Line colors for individual distances: Euclidean – red solid line (‘-’), city block – blue dashed line (‘- -’), Chebyshev – black dotted line (‘
The analysis of the results presented in Table 1 shows that the recognition of the gender of a considered person based on parameters identified through GRF is a relatively simple task. Results obtained were at over 90% of correct identification for most classifiers. The kNN classifier turned out to be the best (k
A similar issue was presented in [8] where 30 parameters were selected on the basis of such components as the vertical and anterior-posterior GRF additionally enriched with two temporal parameters. The authors had at their disposal a total of 64 parameters that were determined for the gait data of 15 people. These parameters were divided into 6 groups and the accuracy of gender recognition depending on the combination of parameter groups was analyzed. The classification utilized a feedforward neural network with two hidden layers. The most effective combination group of parameters allowed the correct recognition of gender with an accuracy reaching 94.03%.
Confusion matrix for gender recognition, kNN
Confusion matrix for gender recognition, kNN
Differentiation between people who are carrying a briefcase and those who are not turned out to be a much more difficult task (Table 1). Only two classifiers reached an accuracy of recognition exceeding 90% and all classifiers had worse results than when it came to the recognition of a person’s gender. In this respect, similarly to the previous consideration, the worst results were achieved by the Naive Bayes classifier while the best remained the domain of the kNN classifier (k
Confusion matrix for carrying briefcase activity recognition, kNN
Confusion matrix for carrying briefcase activity recognition, kNN
The difficulty of this task is strongly emphasized in the results reported by other authors. In [14] several different carrying status level instances including no carried object, objects being carried in the side middle region, in the side bottom region, in the front region, in the back region, in multiple regions, and carried objects with the position being changed from one region to another within period gait, were considered. The method proposed by the above work’s authors allowed the recognition of 76.8% of cases in which a person did not carry anything and 73% of cases where a person carried a load in a similar manner to the one being considered in the present work. It is worth mentioning that of all the stipulated carrying status levels those two were most often confused.
Video cameras were also used to record people’s walking patterns in [11]. The authors of this work were able to achieve 94.4% of correct recognition for people carrying a briefcase. This result was reached for 1,240 images of people (including 248 carrying bags) using a MLP. When it comes to the CASIA-B database the level of identification of a person reached at most 86.7%.
The simultaneous recognition of both gender and briefcase-carrying generally provided an even lower accuracy. Similarly to the results presented above the kNN (k
Confusion matrix for recognition of both gender and briefcase-carrying, kNN (W_CB – women with briefcase, W_nB – women without briefcase, M_CB – man with briefcase, M_nB- man without briefcase)
Confusion matrix for recognition of both gender and briefcase-carrying, kNN (W_CB – women with briefcase, W_nB – women without briefcase, M_CB – man with briefcase, M_nB- man without briefcase)
The analysis of the confusion matrix (Table 4) performed for the classifier that gained the best results in this task shows that misidentifications most often occur in differentiating between people who carry a briefcase and those who do not.
It should be said that the author of the present work is not aware of any other work that deals with the recognition of a unilateral load state based on the measurement of GRFs. In addition, the presented method is expected to be the first step in the biometrics systems so direct comparisons of the above results with those achieved by others should be done with a certain measure of caution.
The presented results have shown the importance of the GRF and parameters proposed for gender recognition, the recognition of the activity of carrying a briefcase by the handle, and the combination of those two characteristics. On this basis, the kNN classifier was able to achieve up to 99.37%, 95.74%, and 95.31% of correct recognitions respectively. Further work may involves a search for characteristics and classification algorithms (connected to, for example, deep learning or ensemble classifiers) that will allow the achievement of even better results. The other possibilities are connected with seeking a fusion with other measuring systems that could help to isolate unique information within considered phenomena.
Footnotes
Conflict of interest
The author does not have any conflict of interest to declare.
Funding
This work was financed by Bialystok University of Technology (project no. W/WM-IIB/2/2021).
