Tracking and climbing behavior recognition of heavy-duty trucks on roadways

Abstract

The tracking and behavior recognition of heavy-duty trucks on roadways are keys for the development of automated heavy-duty trucks and an advanced driver assistance system. The spatiotemporal information of trucks from trajectory tracking and motions learnt from behavior analysis can be employed to predict possible driving risks and generate safe motion to avoid roadway accidents. This article presents a unified tracking and behavior recognition algorithm that can model the mobility of heavy-duty trucks on long inclined roadways. Random noise within the sampled elevation data is addressed by time-based segmentation to extract time-continuous samples at geographical locations. A Kalman filter is first used to distinguish error offsets from random noise and to estimate the distribution of truck elevations for different time intervals. A Markov chain Monte Carlo model is then applied to classify truck behaviors based on the change in elevation between two geographical locations. A heavy-duty truck mobility (HVMove) model is constructed based on the map information to apply the roadway geometry to the tracking and behavior recognition algorithm. We develop an extended Metropolis–Hastings algorithm to tune the parameters of the HVMove model. The proposed model is verified and evaluated through extensive experiments based on a real-world trajectory dataset covering sections of an expressway and national and provincial highways. From the experimental results, we conclude that the HVMove model provides sufficient accuracy and efficiency for automated heavy-duty trucks and advanced driver assistance system applications. In addition, HVMove can generate maps with the elevation information marked automatically.

Keywords

Heavy-duty truck mobility modeling spatiotemporal trajectory elevation

Introduction

Coasting and braking on long inclined roadways are one of the primary reasons for traffic accidents for heavy-duty trucks.^1–3 To improve truck climbing, we have to track and accurately predict the trucks’ movements while driving uphill and downhill. Ensuring the safety of drivers, trucks, and goods on roadways as well as the development of automated heavy-duty trucks can be facilitated by detailed analyses of truck motion and behavior. Moreover, such analyses also represent a primary foundation for evaluating environmental impacts,^4,5 detecting air quality,^6,7 reducing fuel consumption,⁸ and optimizing the transportation sector.⁹ The specific motion of heavy-duty trucks depends on controllable tractive and braking forces as well as external forces arising from conditions such as road slope.¹⁰ Analysis of the dynamic features of trucks on continuous long upgrades is of particular importance because this facilitates the modeling of their mobility, thereby establishing the safety margin for surrounding moving trucks, particularly for trucks following in the rear.¹¹ Tracking the trajectory of trucks and climbing behavior recognition are two important functions that facilitate the accurate modeling of truck mobility.¹² Here, trajectory tracking is employed to estimate the dynamic features of trucks based on filtering approaches, while behavior recognition is a machine learning approach employed to recognize the specific motions of trucks based on the tracking results.

Many previous studies have focused on the tracking of vehicle trajectories on roadways. These tracking systems normally adopt algorithms that depend solely on the use of on-board sensors, such as video cameras, LIDAR, and radar for vehicle detection. However, these algorithms possess performance constraints based on the limitations of the sensors due to various factors, such as environmental impacts, the timeliness of multiple vehicle detection, and vibration.¹³ Tracking of heavy-duty trucks is further constrained when trucks travel through regions offering a narrow field of view and unexpected obstacles. However, these tracking constraints can be overcome by applying filters to constrain the detected motion of heavy-duty trucks.^14–16 Therefore, it is possible to smooth the random noise introduced by motion-detection sensors and improve the accuracy of truck tracking and subsequent motion behavior recognition. In addition, trucks commonly travel along the same routes and make routine stops, such as for cargo loading and unloading, which leads to low-sampling-rate trajectories where the average time interval between consecutive sample points is greater than 10 s. As a result, the raw trajectories of trucks following highly structured routes in urban areas have fewer sample points than those of other vehicles such as taxies. Moreover, the quality of these truck trajectories cannot be substantially improved by the simple segmentation of each sample based on the spatial proximity between them.

Machine learning algorithms, such as the Bayes classifier, decision tree, and support vector machine, are used to infer the motion behavior of heavy-duty trucks on roadways. A truck motion model based on a deep neural network was developed by Wang et al.¹⁷ Although the deep learning approach enhanced the accuracy of the resulting truck mobility modeling, the model suffered from a lack of generality.¹⁸ For example, the complete driving operations of trucks could not be predicted when the trajectory data were incomplete. Thus, the application of deep learning to truck behavior recognition has been restricted by data sparsity. In addition, the precision of the deep learning approach depends on the number of kilometers traveled by a truck, which makes it difficult to recognize the long-distance mobility of trucks. An improved multi-mode hybrid automaton (MOHA) model was developed by Lin et al.¹⁹ for truck tracking purposes. The MOHA model extracts and clusters common state sequences from the temporal information of actual trajectory data based on a discrete event model and thereby identifies different truck behaviors. However, the MOHA model performs more poorly than the existing methods^20–22 due to high time complexity. In addition, a previous study²³ introduced a unified framework for the tracking and behavior recognition of heavy-duty trucks under highway or constrained roadway driving conditions, but the algorithm did not consider the upgrade and downgrade motions of heavy-duty trucks. Furthermore, many previous studies have failed to consider the characteristics of heavy-duty trucks explicitly, such as long and fixed transportation routes and restricted traveling speeds and regions.²⁴ These characteristics of heavy-duty trucks are distinct from, for example, the characteristics of taxies.

To address these issues, this article presents an algorithm for conducting the tracking and behavior recognition of moving heavy-duty trucks along long inclined roadways. A large-scale spatiotemporal trajectory dataset is used for measurements in the algorithm. The elevation information is extracted from the sampled trajectories. Based on the elevation information, a model, denoted as the HVMove model, is constructed, which adopts a logistic regression approach and produces the probability of truck motion based on the Markov chain Monte Carlo (MCMC) simulation. Tracking filters and behavior classification provide many benefits for conducting behavior recognition. Therefore, a Kalman-based filter serves as the basis of the HVMove model for generating the probability distribution of the elevation for different time intervals. The performance of the HVMove model is evaluated using real-world data.

The main contributions of this study are described as follows:

The proposed algorithm provides unified tracking and climbing behavior recognition of moving heavy-duty trucks;

The probability distribution of the climbing motions of heavy-duty trucks is modeled using a logistic distribution;

Maps with the elevation information are generated automatically from large-scale real-world trajectories.

The remainder of this article is organized as follows. Section “Data” presents the heavy-duty truck trajectory data used by the proposed method. Section “Labeling elevations in truck trajectories” presents the spatiotemporal trajectory model based on the Kalman filter, and section “MCMC-based truck motion model” describes the HVMove model based on two elevation features. Section “Performance analysis” presents the verification results, and section “Conclusion” concludes the article.

Data

The dataset employed in this work specifically includes sections of the Beijing–Kunming Expressway, G108 National Highway, and S311 Provincial Highway in the area of Shaanxi, and for Shanxi Province, China. General information regarding the time-stamped global navigation satellite system (GNSS) data within the 9-day period is summarized in Table 1. In our study, the data acquisition and positioning system are GPS and Beidou dual-mode navigation. The number of connected satellites is not less than four. The positioning error is 2.5 m that enables to fulfill the requirements for the high-sampling-rate trajectory. Each record of this dataset contains geographical location in the form of latitude and longitude, elevation, velocity, and time at each instance of heavy-duty truck activity, which includes traveling both up and down the slopes.

Table 1.

Summary of the GNSS data and geographic information of the areas analyzed in this study.

Data	Datatype	Sample
Timestamp of received data	DateTime	20180402112300
GNSS longitude	long int	108950656
GNSS latitude	long int	34382758
GNSS elevation (m)	short int	346
GNSS speed (km/h)	short int	69
GNSS positioning mark	char	1
GNSS southern/northern latitude mark	char	1 or 0
GNSS eastern/western longitude mark	char	0 or 1

GNSS: global navigation satellite system. The total number of data records during the time period was 57,698.

Truck coordinates are estimated from the records by on-board equipment based on a standard triangulation algorithm that provides an average coordinate error of 10 m. The spatiotemporal truck coordinate data represent the motions of heavy-duty trucks in connection with time and roadway. For example, a record from 10:25 a.m. to 11:47 a.m. on 1 April indicates a heavy-duty truck moving at low speed (<60 km/h) on a roadway with an inclined slope (elevation increasing from 344 to 542 m) for a long period of time (82 min). The data are intrinsically heterogeneous because the discrete approximate representations of geographical locations and elevations are derived using different sampling rates (e.g. every 500 m or 20 s). This work aims to provide a model that can support both the generation of origin-destination trips related to elevation and the identification of probability distributions for the motions of a heavy-duty truck for a particular period of a day based on the passive data employed.

Labeling elevations in truck trajectories

This section explains how the time-stamped GNSS records can be converted into individual trajectories with labeled elevations, which are then used to generate trip types for each heavy-duty truck. Due to the distinct sampling rates and poor data quality of the trajectories,²⁵ we first perform the following data filtering: (1) data records with the same receiving times and geographic coordinates are removed; (2) a median filter is applied to adjust data records with similar coordinates, but with significantly different elevations; (3) lost elevation data points (56 records of all 57,698 records) are estimated using linear interpolation between consecutive spatiotemporal trajectory samplings; and (4) records that did not follow a strict time sequence due to factors such as variations in the geographical environment or signal quality were reordered into their proper sequences according to their sampling times.

Although the pretreated data records can describe the routes of heavy-duty trucks on an actual map, the random noise in the elevation data based on the GNSS trajectory segments should be reduced, which is discussed as follows.

Spatiotemporal trajectory segmentation and analysis

Trajectory segmentation is a significant step prior to engaging in further trajectory filtering. Here, segmentation can be conducted using the following three methods²⁶: (1) trajectory segmentation based on time intervals²⁷—here, if the time interval between two consecutive sampling points is greater than a given threshold, the trajectory is divided into two segments between the two points; (2) trajectory segmentation based on the spatial shape of the trajectory¹¹—in this method, the trajectory is segmented at the key points that maintain the trajectory shape; and (3) trajectory segmentation based on the semantics of a trajectory point¹⁰—with this method, the trajectory is divided according to points such as a stationary or transitory point.

We combine the time-interval-based and semantics-based segmentation methods, where the semantics of the points are defined according to changes in the elevation of each truck (i.e. a truck traveling on an inclined roadway or on a declined roadway). Figure 1 presents a schematic diagram illustrating the conversion of daily trajectory records to daily changes in elevation. Here, we first partition the trajectories according to every 24-h interval, detect locations where the truck is stationary (i.e. where the truck speed is 0 m/s), and then detect trips that occur between these stationary locations. Truck motions are generated over a specific time interval by first labeling all locations according to the relative changes in the elevation. These changes in elevation can be counted to determine the probabilities with which trips in each time interval involve increasing or decreasing elevation.

Figure 1.

Schematic diagram illustrating the conversion of trajectory records of a heavy-duty truck to daily changes in elevation and trip types. The changes in elevation are observed based on stationary locations, and daily trips are measured according to the time of day between these stationary locations: (a)–(d) the probabilities of driving uphill or downhill over a specified period of time.

For example, a single truck in Figure 1(a) and (b) generates trips over the 2 single-day periods of observation, and these include 60 trips with an increase in elevation (indicated in blue), 76 trips with a decrease in elevation (marked in orange), while the remaining trips occur at a constant elevation (in white). These trips would be distributed across a single-day period based on the observation time of the stationary locations and the corresponding elevations, as shown in Figure 1(c) and (d). For example, the orange circle in Figure 1(c) indicates that 13% and 19% of all trips generated by heavy-duty trucks, respectively, exhibited an increase and decrease in elevations over the period of 6:00 a.m. to approximately 10:00 a.m. on a Saturday. Based on these visualizations from the prepared digital map, we can observe that the trips of heavy-duty trucks within the suburban and mountainous areas are more concentrated during the peak hours (6:00 to 8:00 a.m.) on Saturday and the late night hours (8:00 to 12:00 p.m.) on Monday due to their larger trip distances and less traffic. This procedure aims to generate a representative sample of trips to account for the travel choices of heavy-duty trucks within the suburban and mountainous areas of the region, as well as to label the elevation information on an actual map. Then, the elevation data noise was analyzed by detecting the short-term stationary points (SSPs) of trucks as follows.

Definition 1 (SSP)

The SSP represents a segment of geographic data where a heavy-duty truck traveled at a speed of 0 m/s over a specified time interval. The extraction of an SSP depends on two scale parameters denoted as the time threshold $(τ)$ and the distance threshold $(δ)$ . Formally, a single SSP can be obtained from a spatiotemporal trajectory characterized by points $(x, y)_{i} \to \dots \to (x, y)_{j}$ that satisfy the conditions $\forall k \in [i, j), Dist ((x, y)_{k}, (x, y)_{k + 1}) < δ, Int ((x, y)_{i}, (x, y)_{j}) < τ$ , where $Dist (,)$ denotes the geospatial network-based distance between two points, $Int (,)$ is the time interval between two points, and x and y are the latitude and longitude, respectively.

Example SSPs extracted from pretreated trajectory data are shown in Figure 2. The distribution of random noise in the elevation data is more clearly observable from the expanded data presented in the inset of Figure 2. Therefore, a Kalman filter is applied to smooth out short-term fluctuations in the time series data and thereby highlight the long-term trends of truck motions.

Figure 2.

Illustration of short-term stationary points (SSPs) with random noise.

Denoising by Kalman filter

A Kalman filter is particularly advantageous for processing continuous noisy data points compared with median or mean filtering.²⁸ With a Kalman filter, we define the state model for predicting the elevation of trucks as

H (k) = Ψ H (k - 1) + ϒ W (k)

(1)

where for the kth state at a single point in time, $H (k)$ is the elevation matrix, $W (k)$ is the noise matrix, and $Ψ = [1]$ and $ϒ = [1]$ are the state transformation matrices. The observation model used to obtain the elevation data from GNSS trajectories is then given for the kth state as

Y (k) = NG (k) + D (k)

(2)

where $Y (k)$ is the observed elevation matrix based on GNSS data, $N = [1]$ is the observation matrix, $G (k)$ is the true elevation matrix, and $D (k)$ is the observed noise matrix. State and observed values can be connected. Therefore, the elevation in the kth state is estimated using equations (1) and (2) based on the last elevation value. The covariance matrix of the (k− 1)th elevation is then updated

P (k | k - 1) = Ψ P (k - 1 | k - 1) Ψ^{T} + Q (k - 1)

(3)

where the parameters are determined using the Kalman filtering, and $Q$ is the covariance matrix of $W$ . The Kalman gain $Kg$ is constructed for allocating the weights of the predicted and observed values as follows

Kg (k) = P (k | k - 1) N^{T} [N P (k | k - 1) N^{T} + R (k - 1)]^{- 1}

(4)

where $R (k)$ is the covariance matrix of $D (k)$ . According to the predicted value in equation (1) and the observed value in equation (2), the elevation of the kth state is obtained based on the weight calculated by equations (3) and (4). The matrices $P$ and $Kg$ for the corresponding state are updated as follows $H (k | k) = H (k | k - 1) + Kg (k) [Y (k) - H (k | k - 1)]$

P (k) = (I - Kg (k)) P (k | k - 1)

(5)

where I is a unit matrix. Thus, the elevation data of each sampled point are obtained using equations (3)–(5).

Elevation-labeled trajectory generation

We define the trajectory model labeled by the elevation data of a heavy-duty truck in the present work as follows.

Definition 2 (elevation-labeled trajectory)

An E-Tra is a sequence of time-stamped points $p_{0} \to p_{1} \to \dots \to p_{k}$ , where $p_{i} = (lat, lon, t, h, s), (i = 0, 1, \dots, k)$ , in which lat and lon are the latitude and longitude, respectively, t is the timestamp, h is the elevation, s is the speed, and the following conditions hold $\forall 0 \leq i \leq k$ , $p_{(i + 1)} t > p_{i} t$ .

The E-Tra of heavy-duty trucks on an actual map is illustrated in Figure 3, in which each trajectory point is marked by the obtained elevation. The transport routes of the trucks and road conditions (such as gradient and length of slope) can be extracted from the map using the E-Tra model.

Figure 3.

Trajectories marked by elevation on an actual map.

MCMC-based truck motion model

The elevation data optimized by Kalman filtering form the basis for extracting the features and for modeling the probability distribution of heavy-duty trucks moving along long inclined roadways. The MCMC simulation was used in conjunction with the E-Tra model to determine the distribution of the features.

Feature extraction

Both the relative elevation difference (rED) and the sum of continuous elevation differences (i.e. the elevation difference summation—EDS) are used to track heavy-duty truck motions along an inclined slope. Therefore, they can be used as representations of truck behaviors. To extract the rED and EDS, we first model changes in the elevation between two consecutive trajectory points and obtain the duration for which a truck travels with the same type of motion, such as where the truck is continuously ascending or descending over a given time interval. Algorithm 1 was developed for extracting these features from the trajectories. Here, if the ith and (i + 1)th rED values are both positive or both negative, the ith rED value is accumulated in the EDS.

Algorithm 1. Elevation characteristics extraction
Input: Sequence of rED Output: Sequence of EDS
1. Begin 2. $ED S_{k}, EDS \leftarrow \emptyset$ 3. While $(h d_{i} \in RED)$ do: 4. If $h d_{i} \times h d_{i + 1} > 0$ do: 5. $ED S_{k} \leftarrow h d_{i}$ 6. Else do: 7. $Sum (ED S_{k})$ 8. k++ 9. end while 10. $EDS \leftarrow ED S_{k}$ 11. return EDS 12. end

HVMove model

We considered the climbing behavior of heavy-duty trucks as a movement from a level roadway to an inclined roadway. The distribution is consistent with the shape of a logistic distribution. The aim of our proposed method is to model the probability distribution of truck climbing behavior denoted by M under a given rED. To achieve this goal, we utilize logistic regression to express the probability of M ${m_{i}, m_{j}}$ , as follows

p (M = m_{i} | EDS) = \frac{1}{1 + e^{β (EDS)}}

(6)

where $m_{i} = 0$ denotes the state of a truck traveling on a level roadway, and $m_{j} = 1$ denotes the truck state of moving along an inclined roadway. We can estimate the parameter $β$ by extending the MCMC simulation and thereby improve the HVMove model to fit well with the actual observation data, as follows

p (M = m_{i} | EDS) = \frac{1}{1 + e^{[β (EDS) + α]}}

(7)

The parameter $β$ is used to determine the gradient of the model, while tuning parameter $α$ can alter the relative position of the distribution. The effects of the values of $β$ (for $α = 0$ ) and $α$ (with different values of $β$ ) on the distribution of the HVMove model are illustrated in Figure 4(a) and (b), respectively. Based on the curves in Figure 4(a), we can observe that the slope of the distribution is less than 0 when $β < 0$ , while the slope is greater than 0 for $β > 0$ . Figure 4(b) indicates that the distribution is offset to the left for $α < 0$ and to the right for $α > 0$ . We note that the model reduces to equation (6) when $α = 0$ .

Figure 4.

Influences of parameters $α$ and $β$ on the probability distribution of the HVMove model: (a) varying $β$ and (b) varying $α and β$ .

Parameter tuning

To find an appropriate model for describing the joint distribution of parameters $α$ and $β$ , we study the parameters under a two-dimensional Gaussian distribution. Accordingly, a Gaussian distribution was established with a mean value $μ = 0$ and a standard deviation $σ = 0.05$ in Figure 5. Therefore, we generated 100 points between 0 and 5 at random by employing a Gaussian distribution illustrated as a thermodynamic histogram. Random sampling was performed in the hot areas to find an approximate solution for the HVMove model.

Figure 5.

Coefficient space with a Gaussian distribution taken as the a priori probability distribution.

We then vary the values of $μ$ and $σ$ in the Gaussian distribution to estimate the optimal parameters $α$ and $β$ based on the MCMC simulation. To estimate the probability distribution of truck motion for given parameters, we consider the HVMove model as a 0–1 Bernoulli-variable-based representation, where the Bernoulli variable value of 0 denotes traveling on a level roadway and 1 denotes traveling on an inclined roadway. This is expressed as follows

p (M = m_{i} | EDS) = Ber (\frac{1}{1 + e^{[β (EDS) + α]}})

(8)

We next show how to tune the parameters in the HVMove model step by step. We first use the Metropolis–Hastings algorithm to produce sample states of truck motion and evaluate the associated transition probabilities between two states with a generated Markov chain. The Markov chain structure of the HVMove model using the MCMC simulation is illustrated in Figure 6. Algorithm 2 describes the parameter tuning process, where an a priori Gaussian distribution $N (μ, σ^{2})$ is used to establish the Markov chain of the distribution $p (M | EDS)$ . In Algorithm 2, a new state y is obtained from the Gaussian distribution, and the acceptance rate, a, of this state is then calculated. In addition, comparing a with a random variable u from a uniform distribution U generates a new state and associated Markov chain.

Figure 6.

A Markov chain according to the distribution $p (M | EDS)$ . The acceptance rate $a$ in the state $M_{k - 1}$ is calculated, and its value is compared with an independent random variable $u$ selected from a uniform distribution U. The current state denoted by $x$ is accepted when $a > u$ , and the process moves to the next state denoted by $y$ ; otherwise, the current state is rejected.

Algorithm 2. Generating a Markov chain
Input: Gaussian distribution $N (μ, σ^{2})$ , current state $x$ , time k, uniform distribution U~[0,1] Output: $p (M)$
1. Begin 2. for each $k$ do: 3. $M_{k} \leftarrow x$ 4. $y \leftarrow N$ 5. calculate acceptance rate a 6. $u \leftarrow U (x)$ 7. if $u < a$ do: 8. $p (M) \leftarrow y$ 9. else do: 10. $p (M) \leftarrow x$ 11. end if 12. end for 13. return P 14. end

After identifying the transition probability, we then apply the MCMC simulation to estimate $α$ and $β$ according to the last state in each iteration. If the parameters fit with the actual data distribution, the current state is accepted; otherwise, the current state is rejected. Therefore, the sample set of each parameter can be obtained. By maximizing the likelihood of all samples, the optimal parameters can be learned, further building the HVMove model.

Behavior recognition

The proposed behavior recognition algorithm is given in Algorithm 3. First, SSPs are extracted. The elevation data are optimized by Kalman filtering. Then, the EDS is obtained using Algorithm 1. The Markov chain of the distribution is established using Algorithm 2. Parameters $α$ and $β$ are obtained using the MCMC to build the HVMove model, which conducts heavy-duty truck motion behavior recognition.

Algorithm 3. Ramp-climbing behavior recognition
Input: Dataset N, dataTime T, elevation h, longitude lon, latitude lat, speed s, a set of SSPs denoted as $SSH$ , time span $τ$ , search radius $δ$ Output: Recognition probability p
1. Begin 2. for each $i$ in N do: 3. if $Int (N [0], N [i + 1]) < τ ANDDist (N [i] . lon, N [i] . lat),$ $(N [i + 1] . lon, N [i + 1] . lat) < δ ANDN [i] . speed = 0$ put $N [i] . hinSSH$ 4. end if 5. end for 6. $Kalman (SSH)$ 7. $LinerRegression (SSH)$ 8. Calculate rED on SSH 9. while rED ≥ 0 do: 10. Calculate EDS 11. end while 12. $MCMC (EDS)$ 13. Generate the HVMove model with $LogisticRegression (α, β)$ 14. Calculate p using the HVMove model 15. return p 16. end

Algorithm 3. Ramp-climbing behavior recognition

Input: Dataset N, dataTime T, elevation h, longitude lon, latitude lat, speed s, a set of SSPs denoted as

SSH

, time span

τ

, search radius

δ

Output: Recognition probability p

1. Begin
2. for each $i$ in N do:
3. if $Int (N [0], N [i + 1]) < τ ANDDist (N [i] . lon, N [i] . lat),$

(N [i + 1] . lon, N [i + 1] . lat) < δ ANDN [i] . speed = 0

put $N [i] . hinSSH$
4. end if
5. end for
6.

Kalman (SSH)

LinerRegression (SSH)

8. Calculate rED on SSH
9. while rED ≥ 0 do:
10. Calculate EDS
11. end while
12.

MCMC (EDS)

13. Generate the HVMove model with $LogisticRegression (α, β)$
14. Calculate p using the HVMove model
15. return p
16. end

Performance analysis

Experimental setup

We used the GNSS trace dataset of heavy-duty trucks presented in section “Data.” A total number of 70 trajectory sequences were obtained. The sequences that included fewer than 100 trajectory points were removed, and 48 sequences remained for fitting. Of these, the longest and shortest sequences comprised 2526 and 100 points, respectively. We then performed operations such as data cleansing and normalization to obtain the required elevation trajectories. Figure 7 investigates the effect of data processing. It compares the original SSP data of heavy-duty trucks with the processed data. It can be seen that the short-term fluctuations in the data segment declined significantly after Kalman filtering.

Figure 7.

Elevation trajectories preprocessed using a Kalman filter.

We randomly split the above 48 sequences into training and test sequences according to the ratio 7:3, that is, 70% of the data were used for training and the remaining 30% were used for testing. Measurement errors were identified using two different models. In the first model, we analyzed the joint distribution of the time and the elevation for each truck. To find an appropriate model to describe this two-dimensional joint distribution, we can assume that a linear relationship exists between the time sample $T (t_{1}, t_{2}, \dots, t_{n})$ and the elevation sample $H (h_{1}, h_{2}, \dots, h_{n})$ , that is, $T$ and $H$ are complied with a joint distribution $P (t, h)$ . Supervised learning was used to determine the joint time and elevation distribution model, which is given as follows

h_{n} (t, θ) = θ_{0} + θ_{1} t

(9)

In the second model, the joint distribution of time and elevation was modeled based on a polynomial, as follows

h_{n} (t, θ) = θ_{0} + \sum_{j = 1}^{N} θ_{n} {(t)}^{j}

(10)

The testing set was used to evaluate the tracking performance of the two models by comparing the estimation error with the measurement error. Figure 8(a) presents the residuals of the two models, demonstrating that these models are suitable for defining the joint distribution of the elevation and time. Figure 8(b) illustrates the differences between the observations and predictions produced by the two models. The differences are generated using the mean absolute error (MAE) function. Because a straight line is applied to fit the data points, the slopes of the two lines represent the elevation estimation errors, where the larger the slopes, the smaller the errors. Figure 9 illustrates the uncertainties of the two models for four segments of SSPs based on the following expression

uA = \sqrt{\frac{\sum_{i = 1}^{n} (σ_{i} - \bar{σ})}{(n - 1) n}}

(11)

where n is the segment length. We can observe that the uncertainty of the linear model is less than that of the polynomial model when considering each segment with a given elevation range. Thus, we analyzed the errors of all data points from the trajectories using the linear regression method.

Figure 8.

Estimation errors of the elevation for heavy-duty trucks using two models: (a) illustrates the residuals of the two models when modeling the the joint distribution of the time and the elevation and (b) shows the differences between the observations and predictions by plotting the straight lines.

Figure 9.

Uncertainties for four segments of SSPs with elevation ranges of 300–390, 400–460, 570–600, and 650–700 m.

After calibration of the trajectory by offsetting the error using linear regression, we can label the ground-truth data in the dataset. If the elevation data were greater than or equal to a threshold, it was treated as positive samples (i.e. an increasing elevation), while the remaining data were treated as negative samples. To optimize the thresholds for ground truth labels, we performed 10-fold cross validation and identified the optimal threshold as 6.06 by minimizing the MAE of each sequence in all 48 sequences. We obtained 7119 sequences without SSPs. We consider these as our dataset and label their ground truth, that is, ${EDS (h_{i + 2^{i}} \cdot h_{2^{i}}), L_{m}}_{m = 1}^{7119}$ , where EDS(·) defines the EDS for a set of continuous trajectory points if and only if $h_{2^{i}} - h_{i + 2^{i}} > 0$ as i increases. The label is denoted as $L_{m} \in {0, 1}$ . The labeled sequences are then input into the HVMove model for recognizing the climbing behaviors of heavy-duty trucks.

Parameter study

Figure 10(a) presents the distributions of $α$ (top) and $β$ (bottom) with 5000 samples during HVMove model training. Considering that a greater number of iterations would increase the estimation accuracy in MCMC, we selected 500 $α$ and $β$ values in the posterior section for calculating their probability distributions.

Figure 10.

(a) Distribution of $α$ and $β$ . (b) Correlation analysis of $α$ and $β$ .

The recognition uncertainty would increase as the parameters become more widely distributed, and the overlapping between the sampled trajectories of heavy-duty trucks traveling on level and on inclined roadways has been investigated. Therefore, $α$ and $β$ were averaged to determine the posteriori distribution of the motion behavior of trucks. The autocorrelations of the $α$ and $β$ samples are shown in Figure 10(b), from which it can be seen that the convergent coefficients of $α$ and $β$ maintained a decreasing trend and finally converged as the number of iterations reached about 20. Values of $α = - 5.069$ and $β = - 11.507$ were then determined by taking the mean of all the parameter values when values are in the steady state.

Performance of truck motion identification

The motion behavior recognition results for trucks moving on level and inclined roadways are shown in Figure 11. The behavior recognition algorithm based on the HVMove model classifies motion behavior as climbing (CL) and flat road (FR) motion. However, we note from the figure that the HVMove model classifies the truck behavior in some regions as both CL and FR. This is because the elevation variance is very small and the target truck changes its speed continuously. Therefore, FR behavior probabilities are dominant compared to CL behavior. The behavior recognition algorithm mostly classifies the scenarios as CL for the elevation changing scenario.

Figure 11.

Behavior reasoning results for the CL scenario.

Conclusion

This study developed the HVMove model using spatiotemporal characteristics and pattern learning extracted from large-scale GNSS trajectory data for effectively modeling and predicting the ramp-climbing behavior of heavy-duty commercial trucks. First, an elevation-labeled trajectory, called as E-Tra model was established based on sampled trajectory data on a real map. The model provides positioning, time, altitude, and instantaneous speed of the truck, followed by the temporal segmentation of the trajectory data and optimization using Kalman filtering. The characteristics of the processed data were extracted and represented by logistic regression, for instance, the state transition, and the characteristic distribution of the ED was established using the MCMC simulation, which thereby determined the traveling mode characteristics of heavy-duty trucks. The HVMove model was finally established after determining the model parameters using the Metropolis–Hastings algorithm. In addition, the influence of the volume of sampled trajectory data on the predicted probability of ramp-climbing behavior was analyzed. The HVMove model can be integrated with a commercial in-car sensor system to track and identify the truck’s movements in time and to further predict and adapt the behaviors to minimize security risk while driving.

The model proposed herein was created primarily using the characteristics of elevation and time, whereas the influences of other characteristics such as truck speed, positioning marks, and fuel consumption on truck ramp-climbing behavior were not considered. In addition, only a single sampling method was used. Therefore, it is advisable to model and predict additional mobility features of heavy-duty trucks using different sampling methods and the characteristics of multi-source data in the future.

Footnotes

Handling Editor: Rodolfo Meneguette

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was funded by the Key Science and Technological Innovation Team of Shanxi Province, China (Grant No. 2017KCT-29) and the Key Research and Development Plan Project of the Shaanxi Province, China (Grant Nos 2019ZDLGY17-08 and 2019ZDLGY03-0901).

ORCID iD

Lei Tang

References

Gui

Wang

Fang

. Study on the mountainous freeway vertical alignment safety based on typical truck climbing characteristics in China. In: Proceedings of the first international conference on transportation information and safety (ICTIS), Wuhan, China, June 30–July 2 2011. Reston, VA: American Society of Civil Engineers.

Lei

Jia

, et al. Maximal gradient of highway in high-altitude area based on typical truck’s climbing performance. J Tongji Univ (Nat Sci) 2017; 45(6): 854–860.

Fang

. Study on the operation speed of traffic safety in mountainous area highway. Master’s Thesis, Huazhong University of Science and Technology, Wuhan, China, 2015.

Castanedo

Pesquera

MÀ

Casares-Hontañón

, et al. Efficient route of freight transport by road, evaluated with innotransmer. Procedia Soc Behav Sci 2014; 160: 634–643.

Walnum

Simonsen

. Does driving behavior matter? An analysis of fuel consumption data from heavy-duty trucks. Transp Res D Transp Environ 2015; 36: 107–120.

Perugu

Wei

Yao

. Integrated data-driven modeling to estimate PM2.5 pollution from heavy-duty truck transportation activity over metropolitan area. Transp Res D Transp Environ 2016; 46: 114–127.

Yao

Wei

Perugu

, et al. Sensitivity analysis of project level MOVES running emission rates for light and heavy duty vehicles. J Traffic Transp Eng (Engl Ed) 2014; 1(2): 81–96.

Lei

Hajiesmaili

Chen

, et al. Energy-efficient timely transportation of long-haul heavy-duty trucks. IEEE Trans Intell Transp Syst 2017; 19: 2099–2113.

Yao

Wei

Liu

, et al. Statistical vehicle specific power profiling for urban freeways. Procedia Soc Behav Sci 2013; 96: 2927–2938.

10.

Henriksson

Flardh

Martensson

. Optimal speed trajectory for a heavy duty truck under varying requirements. In: Proceedings of the 2016 IEEE 19th international conference on intelligent transportation systems (ITSC), Rio de Janeiro, Brazil, 1–4 November 2016. New York: IEEE.

11.

Zhao

Jing

. Optimization of postal express line network under mixed driving pattern of trucks. Transp Res E Logist Transp Rev 2015; 77: 147–169.

12.

Lee

Kim

, et al. Tracking and behavior reasoning of moving vehicles based on roadway geometry constraints. IEEE Trans Intell Transp Syst 2017; 18: 460–476.

13.

Wang

Bebis

Miller

. Overtaking vehicle detection using dynamic and quasi-static background modeling. In: Proceedings of the 2005 IEEE computer society conference on computer vision & pattern recognition, San Diego, CA, 21–23 September 2005. New York: IEEE.

14.

Yin

Peng

. Fast and low-power behavior analysis on vehicles using smartphones. In: Proceedings of the 2017 6th international symposium on next generation electronics, Keelung, 23–25 May 2017. New York: IEEE.

15.

Chen

Zhu

, et al. Fine-grained abnormal driving behaviors detection and identification with smartphones. IEEE Trans Mob Comput 2017; 16(8): 2198–2212.

16.

Zheng

Hansen

JHL

. Lane-change detection from steering signal using spectral segmentation and learning-based classification. IEEE Trans Intell Veh 2017; 2: 14–24.

17.

Wang

Jiang

, et al. Capturing car-following behaviors by deep learning. IEEE Trans Intell Transp Syst 2018; 19(3): 910–920.

18.

Panwai

Dia

. Comparative evaluation of microscopic car-following behavior. IEEE Trans Intell Transp Syst 2005; 6(3): 314–325.

19.

Lin

Zhang

Verwer

, et al. MOHA: a multi-mode hybrid automaton model for learning car-following behaviors. IEEE Trans Intell Transp Syst 2018; 20(2): 790–796.

20.

Van Hinsbergen

Schakel

Knoop

, et al. A general framework for calibrating and comparing car-following models. Transportmetrica A 2015; 11(5): 420–440.

21.

Higgs

Abbas

. Segmentation and clustering of car-following behavior: recognition of driving patterns. IEEE Trans Intell Transp Syst 2015; 16(1): 81–90.

22.

Gazis

Herman

Rothery

. Nonlinear follow-the-leader models of traffic flow. Oper Res 1961; 9(4): 545–567.

23.

Kim

Lim

, et al. Curvilinear-coordinate-based object and situation assessment for highly automated vehicles. IEEE Trans Intell Transp Syst 2015; 16(3): 1559–1575.

24.

Bakhtyar

Holmgren

. A data mining based method for route and freight estimation. Procedia Comput Sci 2015; 52(1): 396–403.

25.

J-J

Zheng

Chi

, et al. Trajectory big data: data, applications and techniques. J Commun 2015; 36(12): 97–105.

26.

Luo

Xin

. Personalized travel route recommendation using collaborative filtering based on GPS trajectories. Int J Digit Earth 2018; 11(12): 1–24.

27.

Chou

Hsia

Lan

. A hybrid approach on multi-objective route planning and assignment optimization for urban lorry transportation. In: Proceedings of the 2017 international conference on applied system innovation, Sapporo, Japan, 13–17 May 2017. New York: IEEE.

28.

Abuali

. Advanced vehicular sensing of road artifacts and driver behavior. In: Proceedings of the 2015 IEEE symposium on computers and communication, Larnaca, Cyprus, 6–9 July 2016. New York: IEEE.