Sports Motion Recognition Using MCMR Features Based on Interclass Symbolic Distance

Abstract

Human motion and gesture recognition receive much concern in sports field, such as physical education and fitness for all. Although plenty of mature applications appear in sports training using photography, video camera, or professional sensing devices, they are either expensive or inconvenient to carry. MEMS devices would be a wise choice for students and ordinary body builders as they are portable and have many built-in sensors. In fact, recognition of hand gestures is discussed in many studies using inertial sensors based on similarity matching. However, this kind of solution is not accurate enough for human movement recognition and cost much time. In this paper, we discuss motion recognition in sports training using features extracted from distance estimation of different kinds of sensors. To deal with the multivariate motion sequence, we propose a solution that applies Max-Correlation and Min-Redundancy strategy to select features extracted with interclass distance similarity estimation. With this method, we are able to screen out proper features that can distinguish motions in different classes effectively. According to the results of experiment in real world application in dance practice, our solution is quite effective with fair accuracy and low time cost.

1. Introduction

Recognition of human motions and gestures is commonly studied in sports field so as to serve movement analysis, guidance, and evaluation. Since direct observation costs too much human resources, information techniques are introduced to assist the tasks. Video and photography are used to record human movement, and there are quite a lot of researches in analyzing the visual information [1]. For more precise analysis, 3D modeling technology [2, 3] is also used to build 3-dimensional model of movement of human body. These methods provide accurate report of human motions and gestures, but for ordinary students and exercisers, they may not afford such devices. Meanwhile, the application of these visual devices is not convenient due to the restriction of space. As for the development of smart mobile terminals, wearable sensors are popularized in many fields, such as medical and health services [4–6] as well as sports domain [7, 8]. Take running, for instance, we are able to detect the running speed, trajectories, and the calories we burn during the exercises. Most of these sensing data are obtained by microelectromechanical systems (MEMS), which is the technology of very small devices. It merges at the nanoscale into nanoelectromechanical systems (NEMS) and nanotechnology [9]. MEMS sensors can be embedded in smart phone, watch, wristband, and other wearable objects, even clothes and shoes. These sensors are portable and low cost, which is quite acceptable for the public. Therefore, to assist sports training of students, we collect the motion data with MEMS sensors for certain action recognition. Although quite a lot of contributions have been made in this domain, most of them focus on regular activity recognition with repeated motions. Recent studies in gestures recognition usually involve time consuming similarity comparison in testing phase that might be a problem in recognizing long time complex motions.

In this paper, we attempt to find a fast and accurate way to recognize certain sports movements with wearable sensors. Our job includes motion data collection and preprocessing with multiple motion sensors embedded in MEMS; feature extraction and selection; and class determination of the unlabeled data. In this process, we may face the following difficulties and problems: (i)

Selection and fusion of multiple sensors.

(ii)

Similarity matching between multivariant motion series.

(iii)

How to improve the efficiency for online detection.

As there is no clear guidance for sensor selection, the choices are various in different studies. Normally, activity recognition would choose more sensors and related statistics to generate different features, while recognition based on distance metric usually chooses few key sensor data obtained after dimensional reduction for the comparison. In our study, we will discuss different situations with single or multiple sensors. Since the data we obtain come from different sensors and DoF, we need to find an effective way to deal with this multidimensional data. Moreover, finding an efficient solution for online motion recognition based on distance metric that considers the mutual distance relationship information will be another important task. As a result, we propose a flexible and interpretable solution using interclass distance features from the mutual distance between testing data and different classes. With Max-Correlation and Min-Redundancy strategy, we choose the nonredundant and distinguishable features and apply them in traditional classifiers. After careful observation of related experiments, we find that DTW based on SAX would be a good trade-off between time and accuracy as the distance metric. This solution is applied in physical education using smart phone to detect and analyze basic movements in dance practice.

The rest of the paper is organized as follows: related works are discussed in Section 2; data collection and basic preprocessing are discussed in Section 3; in Section 4, we describe our solution starting from feature extraction and selection to data classification; discussion of related experiments is arranged in Section 5; and finally comes the conclusion of the study in Section 6.

2. Related Work

Wearable sensors are usually used to detect different activities, hand gestures, and specific motions. Activity recognition [10, 11] is used to distinguish different states of human, such as running, walking, going upstairs, and going downstairs. These states consisted of repeated motions, which is a special characteristic that can be used in pattern recognition. Hand gestures recognition in man-machine interaction is also a hot topic that attracts much attention [12, 13]. Similarity comparison between the obtained data and standard model is a common way to solve the problem. There are also studies that focus on specific movement detection, such as fall [14], smoking [5], and body shaking [15]. In some cases, multiple sensors on different parts of human body would improve the recognition accuracy; thus wearable sensor groups are built to detect human status [16, 17]. At present, beside the body sensor network with different types of sensors on different part of the body that can be used in medical field, cluster of the same type of sensors on different part of human body is mainly discussed in activity and motion analysis. Professional training may prefer high accuracy, and multiple sensors would be a better choice. As for ordinary practice, sensors on a single part of the body would be wiser, considering the trade-off between accuracy, cost, and convenience. Most studies choose acceleration information to judge human state [18], and some of them also take gyroscope and magnetometer into consideration [19]. Sensor fusion is discussed in activity recognition based on feature extraction in time and frequency domains [20]. According to the observation of related experiments, sensor selection and fusion have close relations with the discussed activities, placing position of the sensor, type of the classifier, and the extracted features. It is difficult to make the choice beforehand. Therefore, how to make full use of information from different types of sensors efficiently without redundancy is a significant job to do.

As for motion recognition, there are generally two ways to handle the data. The first one is to extract features in time and frequency domains in each fragment divided by sliding window. And then, perform dimensional reduction on these features. After that, ordinary classifiers can be used to predict motion categories. This method is commonly used in activity recognition that contains repeated motions [21]. In such cases, the probability and statistics information is more efficient and useful. However, in recognizing specific motions such certain sports gestures and movements, the differences between various motion types might exist in shape of certain location. And the statistical features might fail in reflecting these differences. Another solution of motion recognition is based on distance metric. Distances between the testing data and training data can be used to build a k-NN (1-NN + DTW is supposed to be quite effective) classifier for further prediction [22, 23]. But the comparison in the testing phase might take too much time. For some high time cost distance measurements such as DTW, this may affect the efficiency, especially for online motion recognition. Therefore, study in [24] proposed an application of Nearest Centroid Classifier using average series based on DTW [25] as centroids. Also, template matching is quite popular. By directly comparing the testing motion data with a few prototype template generated from different classes called motifs, we are able to tell the type of motion the testing data belongs to. The main idea of the previous related researches focus on finding the optimized samples that can better represent the features of the categories they belong to [26–31]. Some studies also consider the separability between different classes [32].

Motion data can be treated as time series, and there are many related studies around the topic of time series similarity estimation. First of all, we can divide the existing methods into two categories: estimations based on global similarity and local similarity. Solutions belonging to the first category use the global matching cost/distance to be the similarity measurement, while, for data with noise, local similarity may be more accurate. The structure of related works are shown below: (i)

Global matching similarity: (1)

Similarity based on different distance metric: (a)

Dynamic time warping (DTW) [33, 34]: (I)

DTW with constraints [35, 36].

(II)

Dynamic Manifold Warping (DMW) [37].

(III)

TWED [38].

(b)

Traditional distance: (I)

Manhattan distance.

(II)

Euclidean distance.

(III)

Mahalanobis distance.

(2)

Symbolic based measurement: (a)

Edit distance.

(b)

Symbolic Aggregate approximation (SAX) [39].

(ii)

Local matching similarity: (1)

Longest Common Subsequence (LCS) [40].

(2)

Similarity based on shapelet [41].

Dynamic time warping (DTW) is most commonly used in real-time generated time series like motion and voice signal. Unlike Euclidean distance, this method tolerates offsets and time shifting during the comparison, which is quite suitable for our situation. But the time consumption of DTW is quite high, and it is one of the challenges we need to deal with. There are many attempts in improving DTW in different aspects. For example, different kinds of constraints are set for comparison, so as to reduce the time consumption [42]. Symbolic comparison is another common way to measure the similarities between time series, such as Edit distance and LCS. These methods are more suitable for symbolic comparison like string matching. As for continuous time series, they need to combine with other methods. For example, TWED use Edit distance to replace Euclidean distance in the matching process of DTW. Before the direct matching, transformation of raw data or related operation should be defined. SAX is a simple method that converts continuous time series into discrete symbolic series. We can also use SAX to compute the lower bound distance efficiently. Meanwhile, the process of discrete symbolic representation is also a special way of dimensional reduction. Direct comparison with one labelled sample from each class is fast and easy, but its accuracy cannot compare with classifiers. In some circumstances, the shape of the time series with noise might be quite similar globally, while local features are various according to different categories. Shapelet was proposed to solve such problem. Shapelet is a subsequence taken from one of the time series, whose distance to different time series can be used to distinguish time series from different categories. In order to work with commonly used classifiers, the solution called shapelet transform [43, 44] was brought up. The distances between a time series and all shapelets are treated as the selected features of the time series that can be handled in all kinds of classifiers.

Most of the existing distance based methods determine the categories of the unlabeled data according to their closest motifs or samples selected by different mechanisms. The guidance information of the classification is limited in the distance between testing data and the nearest model, no matter how superior these prototypes are. Information of the mutual distance relationship between the testing data and different classes is left out. Thus, how to make rational use of the mutual distance between different classes is the key problem we need to solve. In our solution, we consider the information of mutual distance between samples and different classes and use them as features for classification. In this way, we can speed up the recognition while maintaining high accuracy.

3. Data Collection

According to the situation and requirement of motion practice in physical education, we choose sensors embedded in smart phone to obtain the data of movements and gestures. Nowadays a smart phone carries more and more types of sensors, including accelerometer, gyroscope, gravity sensor, and magnetometer. These sensors reflect the status of the phone from different aspects. 3D accelerometer measures the acceleration on three mutual perpendicular directions. This acceleration is affected by gravity, so we need to separate the linear acceleration from the original data. 3D gyroscope obtains the rotation rate on 3 coordinates. Based on the readings of the sensors, we are able to obtain the attitude information of the smart phone. Magnetometer measures magnetic density and direction, so that we can detect the orientation of the smart phone.

In our case, we use iPhone 6 to sense human motion. The system editions after iOS 4 provide us with Core Motion Framework that can read the data of all kinds of sensors and conduct essential preprocessing. Specifically, with the class called CMDeviceMotion, we are able to get information about movement and attitude of the smart phone. This class mainly consists of four parts: attitude information calculated with reading of the embedded sensors; gravity data obtained by accessing gravity sensor; acceleration that has removed the influence of gravity and finished the filtering; rotation rate that has removed the bias of gyroscope. By calculating the reading of the different sensors, attitude of the smart phone is described as a quaternion and three-dimensional Euler angles: yaw, roll, and pitch.

After careful observation and analysis, we decided to wear a packet on the frontal side of the waist and put the smart phone inside the packet with its screen towards the front. We set 20 Hz as the sampling rate. We show some of the data we obtain from motion A in Figure 1: stepping forward and backward in 8 beats.

Figure 1

Some of the data provided by Core Motion Framework: including acceleration after filtering and gravity removal, rotation rate without bias, attitude information such as Euler angles and quaternion obtained from different sensors, and data received by gravity sensor and magnetometer. The sampling rate of the data is 20 Hz.

Since the direction of the phone changes constantly, it is hard to determine the instant state of the phone. One common way to eliminate such influence is to add a dimension: magnitude. As for acceleration in three dimensions, x, y, and z, magnitude is defined in the following:

\begin{matrix} magnitude = \sqrt{x^{2} + y^{2} + z^{2}} . \end{matrix}

(1)

With magnitude, we can ignore the variation of direction at a certain degree, but we fail to see the status on each dimension separately. Thus, we transform the linear acceleration and gyroscope data from coordinates of the phone to Earth coordinates. It is calculated with the equations below

lin_acc_earth = R \cdot (acc - grav),

(2a)

gyro_earth = R \cdot gyro .

(2b)

Here, R is the rotation matrix [45].

With data from the readings of each sensor, fusion of different sensors, and coordinates transformation, we finally obtain a 31-dimensional time series that can be arranged in 9 groups. The details are shown in Table 1.

Table 1

Sensor data collection.

Name	Coordinates	Description
accelerometerAcceleration	X, Y, Z	Reading of accelerometer
gyroRotation	X, Y, Z	Reading of gyroscope
Euler Angles	Yaw, roll, and pitch	Rotation attitude
motionRotationRate	X, Y, Z	Reading of gyroscope without bias
motionUserAcceleration	X, Y, Z	Acceleration without gravity
motionQuaternion	X, Y, Z, W	Spatial orientation and attitude
motionGravity	X, Y, Z	Output of gravity sensor
motionMagneticField	X, Y, Z	Magnetometer data
lin_acc_earth	X, Y, Z	Linear acceleration relative to Earth
gyro_earth	X, Y, Z	Gyroscope data relative to Earth

4. Motion Recognition Based on MCMR Interclass Distance Features

The sensor data of each motion is a fragment of unrepeated multivariant time series. For this kind of data, we usually apply methods based on distance similarity. Here, in our study, we are facing the multidimensional data from sensor fusion and the problem of matching efficiency. Inspired by shapelet transform in time series recognition, we use distances to samples from different classes to determine the class label of the testing data. Shapelet transform uses Euclidean distances to be the candidates of features for further classification. However, in our cases, this kind of measurement faces the challenge of data offsets and shifting as well as high data dimension and large time consumption. Therefore, we design a fast DTW measurement based on symbolic representation. To eliminate the influence of noise while speeding up the calculation, we use SAX to transform the original time series into shorter symbolic series before distance estimation. We are not sure what kind of sensor and which dimension is more significant in distinguishing different motions, so we use Max-Correlation and Min-Redundancy strategy to select proper candidates to participate in classification of the next step. The chosen features can be used in all kinds of classifiers. It is flexible enough for us to choose suitable classifiers with good performance according to actual situation. The procedure of the recognition solution is demonstrated in Figure 2.

Figure 2

The procedure of our motion recognition solution: firstly, the motion series are preprocessed by Z-normalization and symbolic representation; secondly, representative sequences are selected from training data as feature candidates using distance transformation; thirdly, features with high distinguish ability and low redundancy are chosen; finally, the selected features can be used to build all kinds classifiers for motion recognition.

Motion time series recognition based on distance metric uses the distances between time series as a kind of dissimilarity measurement and works in many similar situations, such as the NN-embedded distance and similarity measurement between testing samples and selected motifs. Unlike these methods, we try to use distances between testing motion series and sequences from different classes as values of the classification features and apply in different kinds of classifiers. Since various divergences appear in time and range of the same movement performed by different people as well as the same people in different time, we use DTW based method as the distance estimation. Meanwhile, considering the high calculation time, we reduce the dimension of the data using symbolic representation. The details of this solution will be described in the rest of this section.

Before further discussion, we summarize the notations throughout the paper in Abbreviations section.

With the data obtained in Section 3, we have the following definition of our research object.

Definition 1 (motion time series based on sensor fusion).

$S = {s_{1}, s_{2}, \dots, s_{m}}$ is a multidimensional time series that represents a single movement with m-dimensional sensor data. $s_{i} = {s d_{1}, s d_{2}, \dots, s d_{n_{i}}}$ indicates the data in ith $(1 \leq i \leq m)$ dimension with length $n_{i}$ .

In our case, m is 31, and n varies for different motions.

4.1. Data Preprocessing and Representation

Since the same motion with similar shapes might be far away from each other due to the difference of the action range. Data normalization is needed to eliminate this divergence. For the time series in ith dimension $s_{i}$ , its Z- $n o r m a l i z a t i o n$ is shown in the following:

\begin{matrix} s_{i_n o r m} = \frac{s_{i} - μ_{s_{i}}}{σ_{s_{i}}} . \end{matrix}

(3)

Here, the mean $μ_{s}$ and standard deviation $σ_{s}$ are defined as follows:

\begin{matrix} μ_{s_{i}} = \frac{\sum s d_{i}}{n}, \\ σ_{s_{i}} = \sqrt{\sum_{j = 1}^{n} \frac{{(s d_{i_{j}} - μ_{s_{i}})}^{2}}{n}} . \end{matrix}

(4)

In order to reduce the time consumption while eliminating the influence of noises, we transform the continuous time series into discrete symbolic series using SAX. The time series is divided into fragments of the same size using PAA. The breakpoints $β (β_{1}, β_{2}, \dots, β_{k})$ divide the data range into $k + 1$ sections with the same occurrence probability under $N (0,1)$ Gaussian distribution. All the fragments will then be transformed into corresponding symbols according to the sections they fall into. For a time series $s_{i}$ , if we set k as the number of breakpoints and f as the number of fragments, we are able to obtain a symbolic series $s t_{i} {s t_{i_{1}}, s t_{i_{2}}, \dots, s t_{i_{f}}}$ where $f < n_{i}$ and, for each element in $s t_{i_{j}}$ , we have $s t_{i_{j}} \in O_{1}, O_{2}, \dots, O_{k + 1}, 1 \leq j \leq f$ . O is used to represent a kind of symbol such as letters, integers, or binary codeh according to different application. This operation is described in Figure 3.

Figure 3

Mapping from original series to SAX symbolic representation: the vertical lines in graph divide the time series into pieces with equal length, and the horizontal lines divide the numerical range of different pieces into 8 sections according to Gaussian distribution.

Through this process, we are able to perform dimensional reduction and discretization. Meanwhile, as we need to compare the DTW distances, variation of the time series length may affect the results of the comparison. With the use of SAX, we can unify the length of the series obtained from different motion.

4.2. Feature Extraction Based on Distance Metric

The distance metrics that are commonly used in similarity estimation of time series include symbolic matching distances, different types of traditional distances, and DTW. Generalization of Euclidean distance in SAX representation provides us a lower bound distance based on the discrete symbolic data it generates. Assuming we have two symbolic series $s t_{x} {x_{1}, x_{2}, \dots, x_{i}, \dots, x_{low_dim}}$ and $s t_{y} {y_{1}, y_{2}, \dots, y_{j}, \dots, y_{low_dim}}$ , their lower bound distance can be calculated with (5). It is fast and effective in many application scenarios of time series mining, but it is only used in one-dimensional time series:

\begin{matrix} SAX_MIN_DIST (s t_{x}, s t_{y}) = \sqrt{\frac{n_{x} + n_{y}}{2 \times low_dim}} \sqrt{\sum_{i = 1}^{low_dim} min_dist {(x_{i}, y_{i})}^{2}} . \end{matrix}

(5)

$m i n_d i s t$ is calculated with

\begin{matrix} min_dist (x_{i}, y_{i}) = \{\begin{cases} 0, & if |x_{i} - y_{i}| \leq 1 \\ β_{\max (x_{i}, y_{i}) - 1} - β_{\max (x_{i}, y_{i})}, & otherwise. \end{cases} \end{matrix}

(6)

For multidimensional data, traditional distances such as Manhattan distance, Euclidean distance, and Mahalanobis distance are able to provide simple and fast similarity measurement. But these distances sequentially align two time series and cannot tolerate the offsets and time shifting. DTW is a proper way to solve these problems, but it costs too much time. Dynamic time warping is a pattern matching algorithm that can measure the similarity between two time sequences. It is first used in speech recognition and then spreads to other areas such as data mining, gesture recognition, and robotics. To speed up the calculation, we use Sakoe-Chiba band to restrict the matching range. For symbolic series $s t_{x} {x_{1}, x_{2}, \dots, x_{i}, \dots, x_{low_dim}}$ and $s t_{y} {y_{1}, y_{2}, \dots, y_{j}, \dots, y_{low_dim}}$ , this distance can be calculated using (7) as follows:

\begin{matrix} D (i, j) = \{\begin{cases} d (i, j) + m i n (D (i - 1, j - 1), D (i, j - 1), D (i - 1, j)); & i > 0, j > 0, i - j < b w \\ d (i, j) + D (i, j - 1); & i = 0, 0 < j < b w \\ d (i, j) + D (i - 1, j); & j = 0, 0 < i < b w \\ 0; & i = 0, j = 0 . \end{cases} \end{matrix}

(7)

b w

is the band width of the constraint. Normally, Euclidean distances are used to measure the similarity between data points

x_{i}

and

y_{j}

during the matching process as follows:

\begin{matrix} d (i, j) = \sqrt{\sum_{k = 1}^{\dim} {(x_{i k} - y_{j k})}^{2}} . \end{matrix}

(8)

d i m

is the dimension of each data point and

x_{i k}

and

y_{j k}

indicate data in kth dimension of

x_{i}

and

y_{j}

. The matching process is shown in Figure 4.

Figure 4

Demonstration of matching process of DTW: for two time series X and Y, their matching path is restricted in the diagonal belt area on the right, and the matching outside of this area will be ignored. This is a common way to speed up the calculation of this algorithm.

For dim-dimensional time series, we can either treat the data point at each time stamp as a dim-dimensional data and do the matching once or consider the matching on each dimension separately. These two methods may receive quite different results. Some studies [46] define the former one as $D T W_{D}$ which would not allow different alignment on the same time stamp and the latter one as $D T W_{I}$ that would accept the independent matching. The choice should be based on the correlation between different dimensions. In our case, we need to perform the matching on the symbolic series that does not support multidimensional situation. Therefore, we deal with the information from different DoF of various sensors independently. As a matter of fact, $D T W_{I}$ is proved to be more effective than $D T W_{D}$ in the existing studies of motion sensing. However, we do not sum up the distances on different DoFs as $D T W_{I}$ . Instead, we consider each of them individually as one of the feature candidates. We will find out the most significant candidates with small correlation between each other in the next part.

4.3. Feature Selection Based on MCMR Strategy

As the selected series are treated as feature candidates, we decide to apply feature selection strategy in finding the most representative ones [47, 48]. In this section, Max-Correlation and Min-Redundancy (MCMR) strategy is proposed to screen out the features that best distinguish different classes and be independent from each other. It mainly consists of two steps: first of all, we find out the feature candidates with strong ability to distinguish samples from different classes; secondly, we need to eliminate the redundant candidates from the chosen ones. In the first step, we introduce the feature selection algorithm ReliefF to estimate the distinguished ability of each feature. By setting a certain threshold η for the number of candidates, we are able to filter out the features with strong correlations to the categories. In the next step, the candidates with weaker class correlation than their similar candidates are abandoned. Therefore, we need to sort the candidates obtained in the former step in descending order and then successively select the candidates with low correlation to the previous chosen ones until the termination condition is satisfied.

Definition 2 (RF weight: feature weight based on ReliefF).

RF weight is a value of a feature candidate that indicates its ability to distinguish different categories from each other. We can compute this weight with RelieF.

ReliefF [49] is a feature selection algorithm that can be applied in multiclass problem. It is the extended version of Relief which suits two-class problem only. In this algorithm, the weight of each feature will be determined after a few iterations. Specifically, in each iteration, ReliefF chooses one sample R from labelled data set randomly and then extracts k nearest neighbors (near hits) from the same category and k nearest samples (near misses) in every other category individually. After that, the weights of the features are updated based on the following:

\begin{matrix} w (F_{i}) = w (F_{i}) - \sum_{j = 1}^{k} \frac{d i f f (F_{i}, R, H_{j})}{m k} + \sum_{c \notin c l a s s (R)} \frac{[(p (C) / (1 - p (c l a s s (R)))) \sum_{j = 1}^{k} d i f f (F_{i}, R, M_{j} (C))]}{m k} . \end{matrix}

(9)

The $d i f f$ function is defined in

\begin{matrix} d i f f (F_{i}, R_{1}, R_{2}) = \{\begin{cases} \frac{|R_{1} (F_{i}) - R_{2} (F_{i})|}{m a x (F_{i}) - m i n (F_{i})}, & if F_{i} is  continuous \\ 0, & i f F i s d i s c r e t e a n d R_{1} (F_{i}) = R_{2} (F_{i}) \\ 1, & i f F i s d i s c r e t e a n d R_{1} (F_{i}) \neq R_{2} (F_{i}) . \end{cases} \end{matrix}

(10)

F is a collection of feature candidates chosen from labelled data of different classes.

w (F_{i})

is the RF weight of the ith feature candidate;

H_{j}

is the jth nearest hits;

M_{j} (C)

is the jth nearest misses in class C.

\max (F_{i})

and

\min (F_{i})

represent the maximum and minimum values of

F_{i}

After a certain time of iterative computations, features that can better distinguish samples from different classes will obtain higher weights. However, in the process above, feature candidates are supposed to be independent from each other, and they do not consider the correlation and redundancy between features. Therefore, we need to eliminate the redundant candidates. Here, we first sort the features in descending order based on their RF weights and remove the candidates with weights less than a certain threshold. Then, for the rest of the candidates, we should check whether they are correlated with the chosen ones, successively. If one candidate is supposed to have close relationship with one of the chosen features, it should be abandoned. Otherwise, it can be kept as one of the chosen features. This process will continue until we have collected enough number of features.

In previous studies, various measurements are used to estimate feature redundancy, such as information gain, mutual information, and correlation coefficient. We notice that redundant candidates usually have similar class separability, which means their RF weights have higher linear correlation with each other. Here we use Pearson correlation coefficients based on the RF weights to indicate the correlation between the candidates, which is indicated in the following:

\begin{matrix} ρ_{a, b} = \frac{\sum w_{a} w_{b} - \sum w_{a} \sum w_{b} / N}{\sqrt{(\sum w_{a}^{2} - {(\sum w_{a})}^{2} / N) (\sum w_{b}^{2} - {(\sum w_{b})}^{2} / N)}} . \end{matrix}

(11)

We use $w_{a}$ to represent the RF weight matrix of a chosen feature and $w_{b}$ indicates the RF weight matrix of a candidate. $ρ_{a, b}$ is the Pearson correlation coefficient between a feature that has already been chosen and a candidate. If this correlation coefficient is higher than the threshold, then we should eliminate this candidate.

Pearson correlation coefficient estimates the linear correlation between two variables. Its value is between +1 and −1 inclusive, where 1 is total positive correlation, 0 means no correlation, and −1 is total negative correlation. The actual correlation situation is described in Table 2. According to these corresponding relations, the threshold is easy to define. And this is why we choose this method.

Table 2

Correlation situation represented by Pearson correlation coefficients.

Correlation coefficients range	Correlation situation
1.0∼0.8	Extremely strong relation
0.8∼0.6	Strong relation
0.6∼0.4	Medium correlation
0.4∼0.2	Weak correlation
0.2∼0.0	Extremely Weak correlation/no relation
0.0∼−1.0	Negative correlation

The pseudocode of this process is shown in Algorithm 1.

Algorithm 1: MCMR_FeatureSelection(Dis, sn, k, cft, $η$ ).

Input:

$D i s$ as distance vector

$s n$ as sampling times in ReliefF

k as numbers of nearest hits and misses in ReliefF

$c f t$ as cut off threshold of max class correlation candidates

$η$ as the threshold of correlation coefficient;

Output:

$b e s t f$ as a group of Max-Correlation and Min-Redundancy features

(1) W $\leftarrow$ ReliefF( $D i s$ , $s n$ , k); {calculate the RF weight: W}

(2) $s i d$ $\leftarrow$ sort(W); {find the descending order of W}

(3) i $\leftarrow$ cutoff( $s w$ , $c f t$ ); {find the turing position of cft}

(4) $b e s t f \leftarrow ⌀$ ;

(5) $b e s t$ .add( $s i d$ (1));

(6) $j \leftarrow 2$ ;

(7) $n u m \leftarrow 1$ ;

(8) while $j < i$ do

(9) $k \leftarrow 1$ ;

(10) while $k < b e s t f$ .size( ) do

(11) if $ρ_{s i d (j), b e s t f (k)} > η$ then

(12) break;

(13) else

(14) $k \leftarrow k + 1$ ;

(15) end if

(16) end while

(17) if $k = = n u m + 1$ then

(18) $n u m = n u m + 1$ ;

(19) $b e s t f$ .add( $s i d (j)$ );

(20) end if

(21) $j = j + 1$ ;

(22) end while

With this procedure, we are able to screen out features with high class correlation and low redundancy. These features can be used to build various kinds of classifiers for motion recognition.

4.4. Motion Classification

In this stage, we can use the selected features to train different kinds of classifiers, such as 1-Nearest Neighbor [50], Bayes Net [51], and Random Forest [52]. We may obtain different results from various classifiers, and we should choose the proper one with better performance in actual application after a few trials. In our study, we apply this solution in dance action practice. The classification effect will be discussed in detail in the next section.

5. Experiments

To verify the efficiency of our solution, we discuss its application in dance practice. The motion data is collected with smart phone and recognize different movements and gestures performed by the wearers. In fact, we found large amount of such requirements as detecting human movements during the practice session in physical education. In our case, we invite 5 students to perform 6 basic practice movements. Each movement is performed 10 times by each person. We perform 10-fold cross-validation on these data to check the classification results in MATLAB.

In this section, we are going to discuss the following problems: (i)

Feasibility and superiority of the solution.

(ii)

Necessity of sensor fusion.

(iii)

Parameters selection and adjustment.

More details can be found in the rest of this section.

5.1. Feasibility and Superiority Verification

In order to check the feasibility and superiority of our solution in multivariant motion series recognition, we compare its classification accuracy with the commonly used methods. For multidimensional time series, there are actually two ways to measure the distance based on DTW. We can either calculate the matching cost on each dimension separately or match the multidimensional data points all together in one time. According to [46], this decision should be based on the correlation between these dimensions. As a matter of fact, these two methods have both been applied in related studies. In our solution, we consider the dimensions independently and leave the correlation problem to feature selection process. To check the superiority of our solution, we also try the other ways, including using LB instead of DTW, three kinds of DTW matching without symbolization. In this comparison, we set lower dimension as 50, k as 5, and η as 0.5. The accuracies in different classifiers are shown in Figure 5.

Figure 5

Classification accuracies in various classifiers with different solution: our solution uses DTW on time series represented by SAX and obtains the best and stable results with different classifiers most of the time. SAX_LB also performs quite good with its high efficiency. DTW on original time series does not reveal outstanding performance with exact matching. Specifically, $D T W_{I}$ performs better than $D T W_{D}$ .

At the same time, we should also take a look at the time cost of each solution in training and testing in Figure 6.

Figure 6

Time consumption of different solutions: SAX_LB is the fastest with no doubt as it only performs synchronous matching. $D T W$ takes much time as expected, especially for $D T W_{I}$ . By aligning with symbolic series, we speed up our solution and get the second place among these similarity measurements.

As we can see from the results above, measuring DTW distance on different dimensions separately performs better than multidimensional comparison. If we consider the dimensional fusion on each sensor, it will take too much time with ordinary performance. If we take the whole series together with all dimensions to calculate the distance, the result will be disappointing, though it does not take much time.

It is quite obvious that the symbolic representation would not reduce the classification accuracy while reducing calculation time. In contrast, symbolic representation performs a little bit better than original series using DTW, most of the time. To overcome the time shifting and offsets of the motion data, we apply DTW on the symbolic series. As we expected, this solution obtains the best classification accuracy and takes less time than DTW comparison on the original data. As a matter of fact, lower bound distance of SAX takes the least time in training and testing. And its accuracy is acceptable. Thus, it is also a good trade-off between time and accuracy.

5.2. Discussion of Sensor Fusion

Most of the existing studies would use a single accelerometer to measure the motion state, while some may also consider the readings of gyroscope or magnetometer. Here in our situation, we are able to obtain information from various sensors. How to choose proper amount of data to achieve better performance has become one of our research goal. In one hand, we check the performances with the use of only one kind of sensor data in Figure 7.

Figure 7

Classification accuracies using data of each sensor separately: it is quite obvious that if we use different types of sensor information independently, accelerometer and its transformation win in most of the time. However, the results do not seem to be improved after the transformation into Earth coordinates.

From these results, we confirm that accelerometer is the most effective sensor among the existing sensing groups. Most sensors behave better using 1-NN, and a little worse with decision tree. If we observe the results independently, we discover that the original reading from accelerometer containing the influence of gravity performs a little better than pure linear acceleration alone. For gyroscope, the results are improved after the preprocessing such as eliminating the bias. The transformation into Euler angles obtains better performance, but the quaternion does not work so well alone. Gravity and magnetometer obtain mediocre results separately. Meanwhile the transformation into Earth coordinates does not seem to improve much. Among all these results, the best performance comes from the linear acceleration in Earth coordinates, which obtains 97% classification accuracy using 1-NN.

On the other hand, we are going to find out the situation in sensor fusion. Whether it is necessary to use data from different sensors in our situation is one question we need to answer. Therefore, we discuss the performance using the fusion data in Figure 8.

Figure 8

Classification accuracies with sensor fusion: we get better results with more information. In fact, there are slight differences between different kinds of fusion. And it is unnecessary to use all the data in actual practice.

Obviously, the results of sensor fusion are much higher than single sensing data. From the overall performance viewpoint, fusion of all sensor data behaves the best, when fusion of CMDeviceMotion (CMDM) data performs better than fusion of four key sensors: accelerometer, gyroscope, gravity sensor, and magnetometer $(A c + G y + G r + M a)$ in CMDeviceMotion. And the performance of four key sensors in CMDeviceMotion is better than fusion of their raw data in most of the time. The results of accelerometer and gyroscope fusion are relatively worse than the other fusions. Among all these results, we have the best performance coming from fusion of all with 99.33% classification accuracy using 1-NN.

Apparently, with our solution, sensor fusion improves the performance. With more types of sensor data, we would have more information from different points of view. But, at the meantime, we should also consider the time consumption. Our final choice should be based on the resources we have, and the trade-off between time and accuracy might be the best decision.

5.3. Parameter Setting

To obtain better performance, we need to choose proper parameters. And one of the common ways to achieve this goal is to decide through experiments. In our solution, we will discuss three important factors. Firstly, we should mind the choice of target dimension of discrete segmentation. If this size is too small, it will affect the accuracy. If it is too large, it will take too much time. Here, we compare the accuracy in different classifiers with different size of the target dimensions in Figure 9. And the average of the accuracies of different classifiers is shown in Figure 10, which indicates the overall trend of performance with different size of the target dimensions.

Figure 9

Classification accuracies of different classifiers with different size of the target dimensions: most of the classifiers achieve high accuracies above 90%. With the growth of low_dim in symbolic representation, we have longer series and obtain better results. Thus the trend of classification accuracies generally goes up.

Figure 10

Average classification accuracies with different size of the target dimension: the overall trend of classification accuracies grows as the increase of the length of the symbolic representation and becomes steady after 80. As the compression ratios are lower with high target dimensions, the new series reserve more information for classification.

Meanwhile, we should also take a look at the time cost of these situations. And the results are shown in Figure 11.

Figure 11

Time consumption with different size of the target dimension: there is a clear growth of time consumption along with the increase of low_dim, as with longer series we need more matching and calculation.

From these results, we can see that for most classifiers, 50 would be a better choice as it takes less time while obtaining fairly well accuracy. Secondly, the threshold of Pearson correlation coefficient η is another parameter we need to observe. According to our intuition, candidates with high correlations to the chosen features might be redundant and should be eliminated. Thus, we should find a threshold to filter out those candidates. As we have mentioned above, this correlation coefficient is defined in a small range, and we can find out the correlation situation based on the interval this value falls in. Now, let us see where we should put this threshold. Since the range of this value is quite narrow, we can easily cover the whole positive range. The performance in different situations is shown in Figures 12 and 13.

Figure 12

Classification accuracies with different η: this is a discussion of the correlations between chosen features. If η is too small, the results of most classifiers are lower, as many useful features are abandoned. Meanwhile, if it is too big, we would reserve too many features, including the redundant ones, which may also affect the result.

Figure 13

Average accuracies of different classifiers with different η: the overall trend of the accuracies on different classifiers goes up until it pass 0.5 and drops a little afterwards. This result accords with the nature of relationship between the number of features and the classification results.

According to this result, we learn that medium correlations around 0.5 work for our case. This fact actually accords with logic and our expectation. Finally, we take a look at the choice of k in RF weight calculation, which represents the number of nearest candidates we need to consider in different classes. It is commonly believed that the nearest few samples play a significant role in estimating class discrimination ability as they are easier to be confused. So, we check the performance with different choices of k, and the results are shown in Figures 14 and 15.

Figure 14

Classification accuracies of different classifiers with different k: no matter what kind of classifiers we use, the results do not change much with different number of nearest samples chosen from various categories for feature estimation.

Figure 15

Average accuracies of different classifiers with different k: it does not show significant changes in performance with only slight decrease in average accuracies, using different number of nearest samples to estimate the quality of the feature candidates.

As we can see from the results, no significant change appears in situations with different k, only a slight decline in the trend of average accuracy. That means the performance is insensitive to k in this range. It takes less energy in setting this parameter.

6. Conclusion and Future Work

In this paper, we propose a flexible and efficient solution for motion recognition. This solution measures the human movements and gestures using smart phone embedded sensors and applies Max-Correlation and Min-Redundancy strategy to select proper features based on interclass symbolic distance metric. In order to decide what kinds of sensors are needed, we test the effect of different sensors individually as well as combinations of various sensors. The final choice should be a trade-off between time and accuracy based on the requirement and resources. If we pay more attention to efficiency, we can choose less sensor data with accuracy around 94% in average, whereas we can obtain about 98% accuracy using more sensor information. In general, fusion of a few key sensors would be a better option in most cases. In our situation, we believe fusion of accelerometer and gyroscope with 6 dimensions reaching accuracies around 95.67% in average and 98.33% at best would be a better trade-off between time and accuracy. In data recognition phase, the symbolic representation is able to reduce the dimension of the time series that speeds up the calculation while eliminating the noise and random disturbances. Dynamic time warping on this low dimensional discrete data overcomes the influences of data offsets and time shifting and, in the meantime, improves the classification accuracy. To find out the significant distance measurements and use them as classification features, we define RF weights based on ReliefF algorithm to estimate their class correlation and remove the redundant ones based on Pearson correlation coefficient. This solution is proved to be quite effective in real world application of motion detection in dance practice. In fact, the methods we used in the structure of our solution can be replaced with other related methods based on actual needs, which is quite flexible.

This study is mainly a preliminary exploration of the feasibility of the method. Our future job is to serve more applications by training our solution for more types of movements in dance classes as well as other disciplines such as martial art and gymnastics.

Footnotes

Abbreviations

Competing Interests

The authors declare that there are no competing interests regarding the publication of this paper.

Acknowledgments

This research is sponsored by National Natural Science Foundation of China (nos. 61171014, 61371185, 61401029, 61472044, 61472403, and 61571049) and the Fundamental Research Funds for the Central Universities (nos. 2014KJJCB32 and 2013NT57) and by SRF for ROCS, SEM, and by the Youth Talents Project of Beijing (YETP1711).

References

S.-R.

Thuc

H. L. U.

Lee

Y.-J.

Hwang

J.-N.

Yoo

J.-H.

Choi

K.-H.

A review on video-based human activity recognition

Computers 2013 2 2 88 131

10.3390/computers2020088

Pham

H.-T.

Kim

J.-J.

Nguyen

T. L.

Won

3D motion matching algorithm using signature feature descriptor

Multimedia Tools and Applications 2015 74 3 1125 1136

Slama

Wannous

Daoudi

3D human motion analysis framework for shape similarity and retrieval

Image and Vision Computing 2014 32 2 131 154

10.1016/j.imavis.2013.12.011

2-s2.0-84893440745

Keerthika

Ganesan

Pervasive health care system for monitoring oxygen saturation using pulse oximeter sensor

Proceedings of the IEEE Conference on Information & Communication Technologies (ICT '13)

April 2013

IEEE

819 823

10.1109/cict.2013.6558207

2-s2.0-84881624498

Parate

Chiu

M.-C.

Chadowitz

Ganesan

Kalogerakis

RisQ: recognizing smoking gestures with inertial sensors on a wristband

Proceedings of the 12th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys '14)

June 2014

Bretton Woods, NH, USA

ACM

149 161

10.1145/2594368.2594379

2-s2.0-84903220206

Tasoulis

S. K.

Doukas

C. N.

Plagianakos

V. P.

Maglogiannis

Statistical data mining of streaming motion data for activity and fall recognition in assistive environments

Neurocomputing 2013 107 87 96

10.1016/j.neucom.2012.08.036

2-s2.0-84875095274

Ahmadi

Mitchell

Destelle

Gowing

O'Connor

N. E.

Richter

Moran

Automatic activity classification and movement assessment during a sports training session using wearable inertial sensors

Proceedings of the 11th International Conference on Wearable and Implantable Body Sensor Networks (BSN '14)

June 2014

Zürich, Switzerland

IEEE

98 103

10.1109/bsn.2014.29

2-s2.0-84905969861

Chardonnens

Favre

Cuendet

Gremion

Aminian

Measurement of the dynamics in ski jumping using a wearable inertial sensor-based system

Journal of Sports Sciences 2014 32 6 591 600

10.1080/02640414.2013.845679

2-s2.0-84896735865

https://en.wikipedia.org/wiki/Microelectromechanical_systems

10.

Althloothi

Mahoor

M. H.

Zhang

Voyles

R. M.

Human activity recognition using multi-features and multiple kernel learning

Pattern Recognition 2014 47 5 1800 1812

10.1016/j.patcog.2013.11.032

2-s2.0-84893703288

11.

Lara

Ó. D.

Labrador

M. A.

A survey on human activity recognition using wearable sensors

IEEE Communications Surveys and Tutorials 2013 15 3 1192 1209

10.1109/SURV.2012.110112.00192

2-s2.0-84881311778

12.

Liu

M.-C.

Yuan

Hand gesture detection

US Patent 8,792,722, 2014

13.

Park

Lee

Hwang

Yoo

Nachman

Song

E-gesture: a collaborative architecture for energy-efficient gesture recognition with hand-worn sensor and mobile devices

Proceedings of the 9th ACM Conference on Embedded Networked Sensor Systems

November 2011

Seattle, Wash, USA

ACM

260 273

14.

Mubashir

Shao

Seed

A survey on fall detection: principles and approaches

Neurocomputing 2013 100 144 152

10.1016/j.neucom.2011.09.037

2-s2.0-84868627131

15.

Niazmand

Tonn

Kalaras

Kammermeier

Boetzel

Mehrkens

J.-H.

Lueth

T. C.

A measurement device for motion analysis of patients with parkinson's disease using sensor based smart clothes

Proceedings of the 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth '11)

May 2011

Dublin, Ireland

9 16

16.

Field

Stirling

Pan

Ros

Naghdy

Recognizing human motions through mixture modeling of inertial data

Pattern Recognition 2015 48 8 2394 2406

10.1016/j.patcog.2015.03.004

2-s2.0-84928277549

17.

Yurtman

Barshan

Automated evaluation of physical therapy exercises using multi-template dynamic time warping on wearable sensor signals

Computer Methods and Programs in Biomedicine 2014 117 2 189 207

10.1016/j.cmpb.2014.07.003

2-s2.0-84907970332

18.

Fuentes

Gonzalez-Abril

Angulo

Ortega

J. A.

Online motion recognition using an accelerometer in a mobile device

Expert Systems with Applications 2012 39 3 2461 2465

10.1016/j.eswa.2011.08.098

2-s2.0-80255138194

19.

Hsu

Y.-L.

Chu

C.-L.

Tsai

Y.-J.

Wang

J.-S.

An inertial pen with dynamic time warping recognizer for handwriting and gesture recognition

IEEE Sensors Journal 2015 15 1 154 163

10.1109/JSEN.2014.2339843

2-s2.0-84910093810

20.

Shoaib

Bosch

Durmaz Incel

Scholten

Havinga

P. J. M.

Fusion of smartphone motion sensors for physical activity recognition

Sensors 2014 14 6 10146 10176

10.3390/s140610146

2-s2.0-84902254076

21.

Kwapisz

J. R.

Weiss

G. M.

Moore

S. A.

Activity recognition using cell phone accelerometers

ACM SIGKDD Explorations Newsletter 2011 12 2 74 82

10.1145/1964897.1964918

22.

Shao

Integral invariants for space motion trajectory matching and recognition

Pattern Recognition 2015 48 8 2418 2432

10.1016/j.patcog.2015.02.029

2-s2.0-84928292001

23.

Shan

Liu

Feature recognition of body dance motion in sports dancing

Metallurgical & Mining Industry 2015 7 7 290 297

2-s2.0-84942328956

24.

Petitjean

Forestier

Webb

G. I.

Nicholson

A. E.

Chen

Keogh

Dynamic time warping averaging of time series allows faster and more accurate classification

Proceedings of the 14th IEEE International Conference on Data Mining (ICDM '14)

December 2014

Shenzhen, China

470 479

10.1109/icdm.2014.27

2-s2.0-84936941169

25.

Petitjean

Ketterlin

Gançarski

A global averaging method for dynamic time warping, with applications to clustering

Pattern Recognition 2011 44 3 678 693

10.1016/j.patcog.2010.09.013

ZBL1209.68477

2-s2.0-78649324794

26.

Fuchs

Gruber

Nitschke

Sick

On-line motif detection in time series with SwiftMotif

Pattern Recognition 2009 42 11 3015 3031

10.1016/j.patcog.2009.05.004

ZBL1175.68321

2-s2.0-67649404577

27.

Yang

Cai

Lin

A model-free and stable gene selection in microarray data analysis

Proceedings of the 5th IEEE Symposium on Bioinformatics and Bioengineering (BIBE '05)

October 2005

Minneapolis, Minn, USA

3 10

10.1109/bibe.2005.4

2-s2.0-33751189368

28.

Hartmann

Schwab

Link

Prototype optimization for temporarily and spatially distorted time series

Proceedings of the AAAI Spring Symposium Series

March 2010

Palo Alto, Calif, USA

29.

Yang

Cai

Lin

A stable gene selection in microarray data analysis

BMC Bioinformatics 2006 7, article 228

10.1186/1471-2105-7-228

2-s2.0-33746677949

30.

Miao

Wang

Lin

Optimized recognition with few instances based on semantic distance

The Visual Computer 2015 31 4 367 375

10.1007/s00371-014-0931-8

2-s2.0-84924852370

31.

Miao

Wang

Chen

Zhou

Image completion with multi-image based on entropy reduction

Neurocomputing 2015 159 1 157 171

10.1016/j.neucom.2014.12.088

2-s2.0-84933279576

32.

Hartmann

Link

Gesture recognition with inertial sensors and optimized DTW prototypes

Proceedings of the IEEE International Conference on Systems, Man and Cybernetics (SMC '10)

October 2010

Istanbul, Turkey

2102 2109

10.1109/icsmc.2010.5641703

2-s2.0-78751530702

33.

Berndt

D. J.

Clifford

Using dynamic time warping to find patterns in time series

10, no. 16

Proceedings of the Knowledge Discovery in Databases (KDD '94)

1994

Seattle, Wash, USA

359 370

34.

Sakoe

Chiba

A dynamic programming approach to continuous speech recognition

Proceedings of the 7th International Congress on Acoustics

August 1971

Budapest, Hungary

65 69

35.

Itakura

Minimum prediction residual principle applied to speech recognition

IEEE Transactions on Acoustics, Speech and Signal Processing 1975 23 1 67 72

10.1109/tassp.1975.1162641

2-s2.0-0016467604

36.

Sakoe

Chiba

Dynamic programming algorithm optimization for spoken word recognition

IEEE Transactions on Acoustics, Speech, and Signal Processing 1978 26 1 43 49

10.1109/tassp.1978.1163055

2-s2.0-0017930815

37.

Gong

Medioni

Zhao

Structured time series analysis for human action segmentation and recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence 2014 36 7 1414 1427

10.1109/TPAMI.2013.244

2-s2.0-84903175597

38.

Marteau

P.-F.

Time warp edit distance with stiffness adjustment for time series matching

IEEE Transactions on Pattern Analysis and Machine Intelligence 2009 31 2 306 318

10.1109/TPAMI.2008.76

2-s2.0-62249218289

39.

Lin

Keogh

Lonardi

Chiu

A symbolic representation of time series, with implications for streaming algorithms

Proceedings of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD '03)

2003

ACM

2 11

10.1145/882082.882086

40.

Abdulla-Al-Maruf

Huang

H.-H.

Kawagoe

Time series classification method based on longest common subsequence and textual approximation

Proceedings of the 7th International Conference on Digital Information Management (ICDIM '12)

August 2012

IEEE

130 137

41.

Keogh

Time series shapelets: a new primitive for data mining

Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09)

July 2009

ACM

947 956

10.1145/1557019.1557122

2-s2.0-70350660908

42.

Keogh

Ratanamahatana

C. A.

Exact indexing of dynamic time warping

Knowledge and Information Systems 2005 7 3 358 386

10.1007/s10115-004-0154-9

2-s2.0-14844285758

43.

Lines

Davis

L. M.

Hills

Bagnall

A shapelet transform for time series classification

Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '12)

August 2012

Beijing, China

289 297

10.1145/2339530.2339579

2-s2.0-84866029906

44.

Hills

Lines

Baranauskas

Mapp

Bagnall

Classification of time series by shapelet transformation

Data Mining and Knowledge Discovery 2014 28 4 851 881

10.1007/s10618-013-0322-1

MR3176926

ZBL1298.62098

2-s2.0-84896489839

45.

http://developer.android.com/reference/packages.html

46.

Shokoohi-Yekta

Wang

Keogh

On the non-trivial generalization of dynamic time warping to the multi-dimensional case

Proceedings of the SIAM International Conference on Data Mining

April 2015

Vancouver, Canada

39 48

47.

Cai

Goebel

Salavatipour

M. R.

Lin

Selecting dissimilar genes for multi-class classification, an application in cancer subtyping

BMC Bioinformatics 2007 8, article 206

10.1186/1471-2105-8-206

48.

Lin

Cai

Wan

X.-F.

Goebel

Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping

BMC Bioinformatics 2008 9 1, article 279

10.1186/1471-2105-9-279

2-s2.0-46049105726

49.

Robnik-Šikonja

Kononenko

Theoretical and empirical analysis of relieff and rrelieff

Machine Learning 2003 53 1-2 23 69

10.1023/a:1025667309714

2-s2.0-0141990695

50.

Altman

N. S.

An introduction to kernel and nearest-neighbor nonparametric regression

The American Statistician 1992 46 3 175 185

10.2307/2685209

MR1183070

51.

Lee

Shimoji

Bayesnet: bayesian classification network based on biased random competition using gaussian kernels

Proceedings of the IEEE International Conference on Neural Networks

April 1993

San Francisco, Calif, USA

1354 1359

10.1109/ICNN.1993.298754

52.

Breiman

Random forests

Machine Learning 2001 45 1 5 32

10.1023/A:1010933404324

ZBL1007.68152

2-s2.0-0035478854

Sports Motion Recognition Using MCMR Features Based on Interclass Symbolic Distance

Abstract

1. Introduction

2. Related Work

3. Data Collection

4. Motion Recognition Based on MCMR Interclass Distance Features

Definition 1 (motion time series based on sensor fusion).

4.1. Data Preprocessing and Representation

4.2. Feature Extraction Based on Distance Metric

4.3. Feature Selection Based on MCMR Strategy

Definition 2 (RF weight: feature weight based on ReliefF).

Algorithm 1: MCMR_FeatureSelection(Dis, sn, k, cft, η ).

4.4. Motion Classification

5. Experiments

5.1. Feasibility and Superiority Verification

5.2. Discussion of Sensor Fusion

5.3. Parameter Setting

6. Conclusion and Future Work

Footnotes

Abbreviations

Competing Interests

Acknowledgments

References

Algorithm 1: MCMR_FeatureSelection(Dis, sn, k, cft, $η$ ).