Sage Journals: Discover world-class research

Abstract

Autonomous vehicles (AVs) have moved from hype to reality as the penetration and acceptance rate continues to increase. As they are slowly integrated into traffic with human-driven vehicles (HDVs), it is necessary to predict the car-following behaviors of AVs and HDVs for better control of AV–HDV mixed traffic. This study extends a data-driven car-following model to incorporate drivers’ memory, and cooperation with the lead vehicle. The model predicts the following vehicle’s speed in AV–HDV mixed traffic. The effect of drivers’ cooperation on car-following behavior was modeled using prospect theory (PT), whereas the driver’s memory was incorporated using the memory cell of a long short-term memory (LSTM) neural network. This extended car-following model is called the “PT-LSTM model.” Real-world vehicle trajectories of HDVs and AVs in the Waymo AV Open Dataset were used to calibrate and validate the PT-LSTM model. The PT-LSTM model demonstrated higher accuracy compared with the LSTM model that did not consider drivers’ cooperation, the multiple layer perceptron model, Gipps’ model, and the intelligent driver model that incorporated PT. The importance of variables in different time steps in the PT-LSTM model was also evaluated using SHapley Additive exPlanations (SHAP). The SHAP results showed that AV followers were more likely to cooperate with the lead HDV, whereas HDV followers were more likely to cooperate with the lead AV than the lead HDV. Thus, this study underscores the importance of considering drivers’ memory and cooperation with the lead vehicle for the prediction of car-following behaviors in AV–HDV mixed traffic.

Keywords

data and data science artificial intelligence machine learning (artificial intelligence)

Autonomous vehicles (AVs) were developed to solve traffic problems such as traffic congestion, -safety, and -pollution. Although the penetration rate of AVs in traffic is still significantly low, it is predicted that AVs will become widespread in the years 2030 to 2040 ( 1 ). From this prediction, it is evident that it will take some time until all vehicles in traffic are AVs.

As more AVs are mixed with human-driven vehicles (HDVs), their interaction will be more frequent, and new car-following behaviors will be observed because of their different vehicle performance characteristics. Drivers’ car-following behavior generally depends on the lead vehicle’s motion, for example, drivers reduce speed if the lead vehicle decelerates ( 2 ). Thus, their car-following behavior is likely to vary with different types of lead vehicles (AV or HDV).

In an AV–HDV mixed-traffic scenario, there are four different groups of lead and following vehicle pairs: an AV following an AV (AV–AV), an AV following an HDV (AV–HDV), an HDV following an AV (HDV–AV), and an HDV following an HDV (HDV–HDV). HDV–HDV car-following behaviors have been widely studied in the literature. However, unlike HDV followers, who use driver perception to detect the lead vehicle’s motion, AV followers can better detect lead vehicle motion through sensors, and maintain shorter headways with the lead vehicle or reduce reaction times ( 3 ).

Interactions between HDVs and AVs have been modeled using separate mathematical car-following models for HDVs and AVs ( 4 ). However, in general, these models did not consider differences in car-following behavior among different types of lead and following vehicle pairs (HDV–AV, AV–HDV, and HDV–HDV). Moreover, models have generally been applied in microscopic traffic simulations, but they have not been calibrated and validated using real-world AV and HDV car-following data.

Thus, it was necessary to develop a model that could predict the car-following behaviors of HDVs and AVs that took into account their differences in vehicle performance characteristics and cooperation with the lead vehicle. In this study, cooperation between the lead and following vehicles was modeled using prospect theory (PT). The objectives of this study were as follows:

To analyze the car-following behaviors of HDVs and AVs for different types of lead and following vehicle pairs.

To model vehicle cooperation using PT and extend a data-driven car-following model to incorporate vehicle cooperation and driver memory (long short-term memory [LSTM] model) for predicting the car-following behaviors of HDVs and AVs.

The remainder of this paper is organized as follows: the next section reviews previous mathematical and data-driven car-following models applied in AV–HDV mixed traffic, AV-only traffic, and HDV-only traffic. The third section describes and analyzes real-world car-following data of HDVs and AVs. The fourth section describes PT, LSTM, and SHapley Additive exPlanations (SHAP), which determine the importance of variables in the model. The fifth section presents the results and discussion, and the final section summarizes the findings and makes recommendations.

Literature Review

Previous studies have developed mathematical models to predict car-following behavior. These classical car-following models are typically categorized on their underlying assumptions, the parameters they predict (such as spacing, acceleration, or speed), and the calibrated parameters. For instance, some models are classified as “collision-free” models, assuming that the following vehicle always maintains a safe distance from the lead vehicle to prevent rear-end collisions. Examples of such models are Gipps’ model ( 5 ), the intelligent driver model (IDM) ( 6 ), and the Krauss model ( 7 ).

On the other hand, there are stimulus-response models, like the General Motors model ( 8 ), which assume that the following vehicle reacts (accelerates or decelerates) based on the motion of the lead vehicle. Some models are also categorized as psychophysical models because they consider the perception and reaction times of the driver, such as the Wiedemann model ( 9 ). These established models have laid the foundation for understanding car-following behavior, however, they do not consider all the factors that can affect this. As a result, researchers have extended and refined these models to improve their performance. For instance, the models incorporate other factors, such as honking effects, backward-looking effects, and other contextual factors ( 10 – 14 ). It is worth noting that these conventional models have mainly focused on HDVs following HDVs.

Some researchers have developed car-following models where the lead and following vehicles are AVs. For example, Friji et al. developed a car-following model using a reinforcement learning neural network and high dimensional red green blue depth frames ( 15 ). Xoap et al. proposed a car-following model for AVs that assumes that the behavior of the following vehicle is affected by two or more preceding vehicles ( 16 ). They found the model could achieve optimal microscopic and macroscopic performance compared with other models such as the IDM. Sharma et al. developed an extended IDM for connected vehicles that incorporated driver compliance with information ( 17 ). The driving strategy developed using PT was used for calibrating this. Sharma et al. found that the extended IDM better predicted car-following behavior than the original.

Some researchers have developed data-driven car-following models that can replace mathematical models. For instance, Shi et al. developed a data-driven car-following model using random forest, and showed that the model could better predict car-following behavior than the GM model ( 18 ). Hao et al. developed a data-driven car-following model based on rough set theory ( 19 ), and Zhang et al. developed a model based on genetic algorithms ( 20 ). Other data-driven models, which outperformed mathematical methods, have been put forward ( 21 – 25 ). Data-driven car-following models that consider the effect of driver memory have also been proposed ( 26 – 31 ). These models utilize recurrent neural networks (RNNs), gated recurrent units, or LSTM models to replicate human driving styles, and have demonstrated promising results in capturing complex car-following behaviors.

The aforementioned studies primarily focused on AV–AV or HDV–HDV behavior in car-following models, neglecting AV–HDV and HDV–AV interactions. To address this gap, some researchers have developed more comprehensive car-following models for mixed-traffic environments involving both AVs and HDVs. For example, Cao et al. proposed a generic car-following model for both AVs and HDVs in mixed traffic flow while considering different market penetration rates of AV ( 32 ). The model utilized an improved IDM encompassing driver memory for HDVs, and an extended cooperative adaptive cruise control (CACC) car-following model based on a nonlinear dynamic headway for AVs. The proposed model was shown to effectively model car-following behavior under different market penetration rates of AVs. Liu et al. proposed an extended IDM that can capture the car-following behavior of AVs under a heterogeneous platoon by assuming a 35% market penetration rate for AVs ( 33 ). The proposed model considered the effect of the preceding connected vehicles within the communication range of the following AV. The proposed model was found to be more stable than traditional IDMs. Rahmati et al. carried out an experimental study to investigate the difference between HDV–HDV and HDV–AV interactions in mixed traffic ( 34 ). They found that human drivers felt more comfortable and showed risk-taking behavior when they followed AVs. Stabler et al. ( 35 ) and Ahmed et al. ( 36 ) modeled HDV–AV interactions using dynamic traffic assignment models and car-following models in microscopic traffic simulation, respectively. Ding et al. also developed an extended IDM for AVs and HDVs that incorporated cooperation between drivers, modeled using PT ( 37 ) . Unlike other studies that made use of simulated data, they used the field data observed from three Tesla vehicles in mixed traffic flow. The results indicated that the model captured the heterogeneity of car-following behaviors in AV–HDV mixed traffic.

Zhu et al. simulated car-following behavior in mixed traffic flow using a multiagent system ( 38 ). The reliability of the simulation was tested using the data collected from actual roads. The multiagent simulation showed that mixed HDV and AV traffic will move at a faster speed and with smaller between-car spacing. Zhu and Zhang proposed a car-following model for AVs in AV–HDV mixed traffic with adjustable sensitivity and smoothing factors. They verified the correctness of the proposed car-following model using numerical simulation ( 39 ). Ozkan and Ma studied car-following behavior and energy efficiency in AV–HDV mixed traffic using inverse reinforcement learning ( 40 ). The model had the capacity to learn and replicate the observed car-following behavior. The study also showed that HDVs consume less fuel when following AVs. Using a driving simulator, Schoenmaker et al. investigated the car-following behavior of HDVs when following AVs ( 41 ). They found that HDV drivers maintained a significantly shorter time headway when driving in the proximity of AV platoons. Lin et al. developed an LSTM model that considered Connected and Autonomous Vehicles (CAVs) following AVs using the NGSIM (i.e., Next Generation Simulation) dataset ( 42 ). They tested the attention-based LSTM model at different market penetration rates and found that the model provided more accurate longitudinal trajectory predictions for different time steps. Wen et al. investigated the interactions between HDVs and AVs using time-to-collision, driving volatility measures, and principal component analysis using the Waymo Open Dataset ( 43 ). They identified that HDV–AV traffic exhibited lower driving volatility compared with HDV–HDV.

In summary, the aforementioned studies have certain limitations, such as the use of simulated datasets that do not accurately represent real-world vehicle trajectories, and the utilization of mathematical models that may not fully capture complex driver behaviors. Thus, this study aimed to overcome these limitations by extending a data-driven car-following model to incorporate driver cooperation with the lead vehicle using PT, and driver memory using LSTM. The model was calibrated and validated using real-world vehicle trajectory data from AVs and HDVs. By combining PT with LSTM, we were able to account for the effects of driver (or AV) memory and cooperation with the lead vehicle on car-following behavior. To the best of our knowledge, this is the first study to incorporate PT into LSTM for predicting car-following behavior in AV–HDV mixed traffic.

Methods and Data

This section is divided into three subsections: first, a description of the data and analysis of car-following behavior in AV–HDV mixed traffic using real-world vehicle trajectory data; following this, PT for modeling cooperation; and finally, LSTM for the prediction of car-following behavior.

Data and Analysis

The real-world vehicle trajectory data used in this study were obtained from Waymo LLC, a leading AV technology company. The data were collected from multiple cities across the United States including San Francisco, CA, Phoenix, AZ, and Mountain View, CA ( 44 ). The data were collected using Lidar and camera technologies, capturing vehicle trajectories from diverse road segments such as freeways and urban streets. This was carried out at different times of the day, encompassing various weather conditions to ensure a comprehensive representation of real-world driving scenarios.

For each vehicle, the data were collected at a time interval of 0.1 s. The data included the type of vehicle (HDV or AV); speed (in m/s); road segment; weather condition; time of day; and the length, width, and height of the vehicle; acceleration; position (x and y coordinates); spacing between the following and lead vehicle; and the ID of the lead vehicle. The data provided trajectory data spanning a total of 300,000 s, which allowed a comprehensive analysis of car-following behavior.

Specifically, the data included 1,032 vehicle pairs for HDV–HDV, 274 vehicle pairs for HDV–AV, and 196 vehicle pairs for AV–HDV. The data did not include trajectories for AV–AV vehicle pairs. As a result, the analysis in this study was limited to three groups of vehicle pairs: HDV–HDV, HDV–AV, and AV–HDV. Despite this limitation, the data still provided valuable insights into car-following behavior in AV–HDV mixed-traffic scenarios and meaningful analysis and findings.

To ensure the highest level of data accuracy and reliability, a curated, preprocessed version of the Waymo dataset was provided by Hu et al. for this study ( 44 ). Given its size, this dataset served as a robust foundation for investigating the interactions and behaviors of diverse vehicle types in mixed-traffic scenarios.

The dataset was initially checked for noise and irregularities to ensure data quality. To achieve this, a Savitzky-Golay (SVG) filtering algorithm was applied to the global positioning data (x and y coordinates) of all vehicles in the original dataset. The SVG filtering algorithm is commonly used for smoothing time series data owing to its ability to filter out noise in a series of equally spaced data values by applying a polynomial fit. For instance, Figure 1 provides a comparison between the smoothed position and the original position of two randomly selected vehicle trajectories. The similarity between these positions indicated that the original dataset had minimal noise. Next, the output of the SVG filtering algorithm was used to calculate the speed of each vehicle in the dataset. The speed was calculated using Equation 1 as follows:

v (t) = \frac{x (t + Δ t) - x (t)}{Δ t} = \frac{dx (t)}{dt}

(1)

where

$v (t) =$ speed at time $t$ ,

$x (t) =$ position at time $t$ , and

$Δ t =$ time interval.

The car-following behavior for all three types of vehicle pair was analyzed using eight of the core variables in the original dataset, as shown in Table 1.

Figure 1.

Comparison of original and smoothed vehicle positions: (a) Vehicle 1 and (b) Vehicle 2.

Table 1.

List of Variables

Variable	Definition
$a_{n}, a_{n + 1}$	Acceleration of lead vehicle, n, and following vehicles, n+ 1
$j_{n}, j_{n + 1}$	Jerk of lead and following vehicles
$v_{n}, v_{n + 1}$	Speed of lead and following vehicles
$x_{n}$	Spacing between following and lead vehicles
$Δ v_{n + 1}$	Speed difference between following and lead vehicles (= $v_{n + 1} - v_{n}$ )

Initially, a safety critical analysis was conducted for each type of vehicle pair, taking into consideration different ranges of spacing. The distribution of acceleration for different ranges of spacing is depicted in Figure 2. The analysis revealed that when the spacing between vehicles was greater than 15 m, the behavior was similar for all three vehicle pair groups. However, when the spacing was less than 15 m, the acceleration of an AV following an HDV tended to concentrate more around zero compared with the acceleration of an HDV following an AV or HDV following another HDV. This observation suggests that AVs tended to maintain a more consistent speed to ensure a safe distance from the lead vehicle compared with HDVs.

Figure 2.

Distribution of a following vehicle’s acceleration for different ranges of spacing: (a) spacing 10 m, (b) 10 $m <$ spacing 15 m, and (c) 15 $m <$ spacing 30 m.

A comparison of spacing distributions was also conducted among the three vehicle pair groups, as illustrated in Figure 3. The analysis revealed that the median and variance of spacing were similar between HDV–AV and AV–HDV vehicle pairs. However, the median spacing was smaller for HDV–HDV pairs. Furthermore, the figure also indicated that the median spacing for HDV–HDV pairs was slightly longer than the median spacing for HDV–AV pairs. This observation implies that the car-following behavior of HDV drivers may vary depending on the type of lead vehicle (AV or HDV), indicating that human drivers may adjust their following distance differently based on the type of vehicle they are following.

Figure 3.

Distribution of spacing for different vehicle pair groups.

Figures 4 and 5 display the distribution of acceleration and jerk of the following vehicle, respectively, in relation to the spacing for the three vehicle pair groups. Jerk represents the rate at which the acceleration of either the following vehicle or the lead vehicle changes. Figure 4 reveals that when an HDV followed an AV, the deceleration tended to be more concentrated in the 10- to 30-m range of spacing. However, when an AV followed an HDV, deceleration and acceleration were concentrated on small values close to zero within a low ≤10-m range of spacing. A similar trend was observed for spacing ranges ≥35 m. Furthermore, Figure 5 shows that the distribution of jerk followed a similar pattern to that of speed. These results suggest that AVs were capable of better speed control without the need for hard deceleration compared with HDVs when following another vehicle.

Figure 4.

Distribution of acceleration for different vehicle pair groups: (a) AV following HDV, (b) HDV following AV, and (c) HDV following HDV.

Figure 5.

Distribution of jerk for different vehicle pair groups: (a) AV following HDV, (b) HDV following AV, and (c) HDV following HDV.

Figure 6 shows the distributions of spacing and velocity difference between the following and lead vehicles. The figure reveals that the distributions were similar between HDV–AV and HDV–HDV vehicle pair groups. However, for the AV–HDV pair group, the velocity difference was relatively smaller at larger spacing ranges of 10 to 48 m. This suggests that AVs tend to maintain smaller velocity differences with HDVs at larger spacing ranges compared with HDV–AV and HDV–HDV pairs. A smaller velocity difference may indicate that AVs are better able to maintain safe and consistent spacing.

Figure 6.

Distribution of velocity difference for different vehicle pair groups: (a) AV following HDV, (b) HDV following AV, and (c) HDV following HDV.

Lastly, the mutual information (MI) algorithm was adopted to investigate the relationships between selected variables and the velocity of the following vehicle at the next time step, $v_{n + 1} (t + 1)$ . MI is a concept from information theory, which measures the association between two variables, where one acts as the independent variable (X) and the other as the dependent variable (Y). This algorithm quantifies the amount of information shared between X and Y, and measures how much knowledge of one variable decreases the uncertainty in the other variable ( 45 ).

Figure 7 displays the ranking of MI shared between the selected variables and the velocity at the next time step. The results revealed that the ranking of the variables for HDV–AV and HDV–HDV pairs was similar. However, when an AV was following an HDV, the spacing with the lead vehicle and the lead vehicle speed exhibited almost equal potential in reducing uncertainty with the velocity of the following vehicle at the next time step. This finding aligned with expectations, as AVs utilize sensor data to simultaneously control maneuvers based on the observed lead vehicle’s speed and spacing, making them more adaptive in mixed-traffic scenarios.

Figure 7.

Mutual information ranking for different vehicle pair groups: (a) AV following HDV, (b) HDV following AV, and (c) HDV following HDV.

Prospect Theory for Modeling Cooperation

PT has found application in various domains, such as mental accounting in behavioral economics and modeling driver compliance or route choice in transportation ( 46 ). PT was introduced by Kahneman and Tversky, who observed that risk decisions are often influenced by subjective considerations that, in turn, may have been shaped by language expressions, leading to different preferences in different scenarios ( 47 ). In the context of a car-following model, PT can be used to show that drivers may not always make decisions based solely on rational calculations of expected outcomes. Instead, they may evaluate potential gains and losses in a subjective and biased manner, taking into account factors such as the perceived risk of collision, comfort level, and potential benefits of following closely. PT also proposes that individuals are more sensitive to losses than gains, resulting in risk-seeking behavior in certain situations. Incorporating these behavioral aspects in car-following models can enhance their ability to capture the subjective decision-making processes of drivers in real-world traffic scenarios.

In PT, the decision makers’ choices are first formulated through prospects and then the utility value of each prospect is calculated ( 37 ). The prospect value, $U$ , is calculated using the utility function described below and the prospect generated using the maximum utility is identified as the selected choice of the decision maker (the driver).

U (x) = \sum_{i} ω (p_{i}) V (x_{i})

(2)

where $ω (p)$ is the weighting function related to probability $p$ of outcome $x$ (e.g., low or high cooperation with the lead vehicle), and $V (x)$ is the value function related to outcome $x$ . They are calculated as follows:

ω^{+} (p) = \frac{p^{γ}}{{(p^{γ} + {(1 - p)}^{γ})}^{1 / γ}}

(3)

ω^{-} (p) = \frac{p^{δ}}{{(p^{δ} + {(1 - p)}^{δ})}^{1 / δ}}

(4)

where

$p$ = probability of outcome $x$ ;

$γ$ and $δ$ = degrees of curvature and elevation, respectively;

$ω^{+} (p)$ = probability of gains; and

$ω^{-} (p)$ = probability of losses.

V (x) = {\begin{matrix} x^{α}, x > 0 \\ - λ {(- x)}^{β}, x \leq 0 \end{matrix}

(5)

where $α$ and $β$ are the degrees of sensitivity for gains and losses, respectively, and $λ$ is the degree of loss aversion.

The PT curve generated through the utility function, $U (x)$ , is controlled by the three parameters $λ$ , $γ$ , and $α$ . With respect to car-following behavior, if $α$ is less than or equal to 1, the gain part of the curve in $V (x)$ will exhibit a concave shape, indicating that drivers are risk-averse and may exhibit diminishing sensitivity to gains. Conversely, the loss part of the curve will be convex, suggesting that drivers are risk-seeking and may display increased sensitivity to losses, as shown in Figure 8 ( 37 ).

Figure 8.

Prospect theory curves: (a) α≤ 1 and (b) λ > 1 ( 37 ).

However, if $λ$ exceeds 1, the loss curve will be steeper than the gain curve, indicating that drivers are even more sensitive to small losses compared with gains. This may result in more cautious and defensive car-following behavior, as drivers may be more inclined to maintain larger time gaps to avoid potential losses, such as collisions or accidents. Additionally, the parameter $γ$ plays a crucial role in how decision makers evaluate and respond to situational cues, when equal to 1, the weight function $ω (p)$ becomes linear.

In an AV–HDV mixed-traffic environment, AVs and HDVs have different car-following behaviors and vehicles may have different preferences that lead to different response regimes to the same situation in surrounding traffic conditions. In this study, the cooperation between vehicles in car-following conditions was modeled using PT based on the spacing with the lead vehicle, as proposed by Ding et al. ( 37 ). The cooperation value was measured with a range of 0 to 1. A cooperation value close to zero means a lower likelihood of change in the car-following behavior in response to the lead vehicle’s speed and represents low cooperation. On the other hand, a cooperation value close to 1 means a significant change in the car-following behavior and represents high cooperation. It was expected that as the spacing increased, the cooperation value would decrease and vice versa. The cooperation function is represented as follows ( 37 ):

Cooperation Utility = V (s) \times w (s)

(6)

where $V (s)$ is the urgency value that describes how urgent the situation is for the following vehicle to react based on the observed spacing, $s$ , and is similar to the value function, $V (s)$ , of PT. $w (s)$ , which is similar to the weight function, $ω (p)$ , of PT, denotes the weight and describes how much the following vehicle “weighs” the perceived spacing based on different cooperation levels.

As the urgency value, $V (s)$ , is inversely proportional to the observed spacing, it is calculated as follows ( 37 ):

V (s_{obs}) = \frac{1}{1 + e^{λ (α \times s_{obs} - 1)}}

(7)

where $s_{obs}$ is the observed spacing, and $α$ and $λ$ are parameters that decide the shape of the value function curve. The values of $α$ and $λ$ were calibrated using the observed data. From Equation 7, it can be seen that the sensitivity of $V (s_{obs})$ decreased for both very small and large spacing, which was similar to the sensitivity of the PT curve. The equation can also be used to estimate the value of maximum spacing ( $s_{\max}$ ) and minimum spacing ( $s_{\min}$ ), which are the spacings when $V (s) =$ 0 and $V (s) =$ 1, respectively.

Similar to PT, the weighting function for gains and loss is represented as the weighting function for low cooperation, W_LC, and high cooperation, W_HC,

W_{LC} (P_{LC}) = \frac{P_{LC}^{γ}}{{(P_{LC}^{γ} + {(1 - P_{LC}^{γ})}^{γ})}^{1 / γ}}

(8)

where $γ$ is the degree of curvature and $P_{LC}$ is the probability of low cooperation, which is calculated as follows:

P_{LC} = \min (\frac{s_{obs}}{s_{\max}}, 1)

(9)

As the observed spacing approaches the spacing at $V (x) =$ 0 (i.e., lower cooperation), the low cooperation weight also approaches 1. The high cooperation weighting function is calculated as follows:

W_{HC} (P_{HC}) = \frac{P_{HC}^{δ}}{{(P_{HC}^{δ} + {(1 - P_{HC}^{δ})}^{δ})}^{1 / δ}}

(10)

where $δ$ is the degree of curvature, and $δ = γ$ because both cooperation levels represent gains. $P_{HC}$ is the probability of high cooperation and is calculated as follows:

P_{HC} = \min (\frac{s_{\min}}{s_{obs}}, 1)

(11)

As the observed spacing approaches the spacing at $V (x) =$ 1 (i.e., high cooperation), the high cooperation weight also approaches 1. Finally, the cooperation utility is the maximum of low and high cooperation utility values as follows:

U = \max (U^{LC}, U^{HC})

(12)

where $U^{LC}$ and $U^{HC}$ are the cooperation utility at low and high levels of cooperation, respectively.

U^{LC} = V (s_{obs}) \times W_{LC} P_{LC}

(13)

U^{HC} = V (s_{obs}) \times W_{HC} P_{HC}

(14)

In this study, PT was used to model the perceived gains and losses when the following vehicle was uncertain of the lead vehicle’s future behavior. The output from PT, the cooperation utility, $U (s)$ , for a given spacing, $s$ , was used as an input variable of the car-following model.

Long Short-Term Memory Neural Network

An LSTM neural network (NN) is a specific type of RNN with a longer memory and better transition ability. The network is best known for its ability to effectively solve sequential problems such as speech recognition, image recognition, and time series problems owing to its ability to store previously encountered patterns in its memory for the prediction of future patterns.

Similar to typical NNs, the LSTM-NN has three layers: the input layer, the hidden layer and the output layer. The input layer is the first layer in the network and it initializes the input variables for the subsequent layers. The output layer is responsible for giving the output. The hidden layer is the most important feature of LSTM-NN because it has memory cells that consist of gates to solve vanishing gradient problems as it passes information from one sequence to another ( 29 ). This feature allows LSTM-NN to learn long-term relationships between complex features in a dataset. Through these memory gates, which are capable of updating, discarding, and retaining information, LSTM is able to mimic the human decision-making process. The LSTM gates are divided into three types:

Input gate: Takes the independent variables as input into the network and decides which new information should be stored.

i^{(t)} = σ \cdot (W^{ix} \cdot x^{t} + W^{ih} \cdot h^{(t - 1)} + W^{ic} \cdot C^{(t - 1)} + b^{i})

(15)

C^{t} = \tanh (W^{cx} \cdot x^{t} + W^{ch} \cdot h^{(t - 1)} + b^{c})

(16)

Forget gate: Determines which part of the processed information of the previous output state should be retained in the knowledge base.

f^{(t)} = σ \cdot (W^{fx} \cdot x^{t} + W^{fh} \cdot h^{(t - 1)} + W^{fc} \cdot C^{(t - 1)} + b^{f})

(17)

Output gate: Transfers the processed data as an input to the input gate of the hidden layer at the next time step.

o^{(t)} = σ \cdot (W^{ox} \cdot x^{t} + W^{oh} \cdot h^{(t - 1)} + W^{oc} \cdot C^{(t - 1)} + b^{o})

(18)

where

$W^{ix}$ , $W^{ih}$ , $W^{ic}$ , $W^{fx}$ , $W^{fh}$ , $W^{fc}$ , $W^{ox}$ , $W^{oh}$ , and $W^{oc}$ are weight matrices;

$b^{i}$ , $b^{f}$ , and $b^{o}$ are the bias in each of the gates;

$x^{t}$ are input variables that denote the historical driving information of the car-following model, and

$h^{(t - 1)}$ is the output of the previous LSTM block at time step $t - 1$ .

$σ$ and $\tanh$ are activation functions that model the nonlinear relationship between the input and output variables. $σ$ is defined as sigmoid, which maps the value to a range of values between [0, 1].

σ (x) = \frac{1}{1 + e^{- x}}

(19)

$\tanh$ transforms the input and output value into a range of [−1, 1] and is calculated as follows:

\tanh (x) = \frac{\exp (x) - \exp (- x)}{\exp (x) + \exp (- x)}

(20)

The LSTM-NN is then trained to reduce the loss function, which evaluates how well the network fits the data. The most popular loss function is mean squared error (MSE) as follows:

MSE = \frac{1}{n} \sum_{t = 1}^{n} {(Y_{t} - \bar{Y_{t}})}^{2}

(21)

where

$Y_{t} =$ observed value,

$\bar{Y_{t}} =$ predicted value, and

$n =$ number of observed data.

Based on the results of the MI (Figure 7) and the cooperation utility of PT, the LSTM model predicts the following vehicle’s speed in the current time step, $t$ , based on the spacing ( $S_{n + 1}$ ), the lead vehicle’s velocity ( $v_{n}$ ), the following vehicle’s velocity ( $v_{n + 1}$ ) and the cooperation utility ( $U_{n + 1}$ ) in previous, $T$ , time steps as follows:

v_{n + 1} (t) = f (S_{n + 1} (t - T : t - Δ t), v_{n + 1} (t - T : t - Δ t), v_{n} (t - T : t - Δ t), U_{n + 1} (t - T : t - Δ t))

(22)

where

$f (x)$ is the function that processes the input variables through the LSTM-NN;

$Δ t$ is the minimum time step = 0.1 s in the dataset;

$T$ denotes the duration of the input sequence, which is set as 1.0 s owing to the size of the dataset; and

$v_{n + 1} (t)$ is the value to be predicted, which is the velocity of the following vehicle at the current time step.

In this study, the memory attribute of LSTM was used to model the driver’s memory in car-following behavior. This study considered two different models: the LSTM model that incorporates PT to consider the cooperation utility—the “PT-LSTM model” (Equation 22); and the LSTM model that does not consider the cooperation utility—the “baseline LSTM model” (Equation 23) as follows:

v_{n + 1} (t) = f (S_{n + 1} (t - T : t - Δ t), v_{n + 1} (t - T : t - Δ t), v_{n} (t - T : t - Δ t))

(23)

SHAP for Determining Importance of Variables of Models

Understanding why a complex machine learning model makes a certain prediction has become an important part of many scientific applications. In this regard, SHAP has been applied as an interpretable machine learning framework, similar to the permutation importance (PIMP) that helps interpret model predictions and underlying properties of data. Unlike the PIMP, which measures variable importance based on how a feature affects model performance, SHAP measures the importance of a feature based on the magnitude of attributions. SHAP function was derived from game theory and was developed on the additive feature attribution method that unifies six feature importance methods ( 48 ). This approach allows SHAP to be less affected by highly correlated values, unlike PIMP. The SHAP function is described as follows:

S_{i} (f, x) = \sum_{S \subseteq S_{\frac{all}{i}}} \frac{S! (M - S - 1)!}{M!} [f_{x} (S \cup i) - f_{x} (S)]

(24)

where

$M$ = set of all input features,

$i$ = $i^{th}$ feature,

$S$ = subset of the input feature, referred to as players, and

$f_{x} (S)$ = LSTM prediction function.

Although SHAP produces more accurate results than PIMP and other variable importance methods, it is computationally intensive. This means that when the number of input features (N) is large, the computational complexity becomes 2^N, which is very expensive ( 49 ).

Results and Discussion

The PT-LSTM model and the baseline LSTM model were estimated using Python on Google Colab owing to the availability of more processing power. To ensure that there was no overfitting, cross-validation using TimeSeriesSplit was implemented where KFold was set to 3 at all times and 20% of the data were used for testing. For each KFold, 20% was used for validation and 60% for training. First, the baseline LSTM model was estimated. Table 2 shows the LSTM configuration parameters that gave the lowest root mean square error (RMSE) after performing an informal search. It is worth noting that this might not be the optimal value.

Table 2.

LSTM Configuration Parameters

Details	Structure	Details	Structure
Hidden layers	2	Activation function	ReLU
Hidden layer nodes	100 and 60	Learning rate	0.001
Epoch/batch size	10/100	Training algorithm/time step	Adam/1 s

Note: LSTM = long short-term memory.

After estimating the baseline LSTM model, the PT-LSTM model was estimated using the LSTM configuration parameters in Table 2. The PT parameter for each vehicle pair group of HDV–AV, HDV–HDV, and AV–HDV was calibrated separately using differential evolution (DE) and the optimization function that gave the lowest RMSE was derived. Each parameter of the DE function was set as follows: the population size, the maximum number of generations, and the function tolerance were set to 100, 300, and 1e⁻⁶, respectively. Table 3 shows the range of each parameter value used while calibrating PT parameters for each vehicle pair group using DE.

Table 3.

Range of Parameter Values in Prospect Theory

Parameters	Description	Range
α	Urgency function parameter	[0.01, 1.0]
λ	Urgency function parameter	[1, 6]
γ	Weighting function parameter	[0.1, 1.0]

Table 4 presents the RMSE for different sets of the PT parameters. The table shows 10 randomly selected DE-calibrated outputs including the baseline LSTM model and the PT-LSTM models with RMSE values. The first result on the list is the RMSE of the baseline LSTM model. The parameters in bold are from the PT-LSTM model with calibrated PT parameters that produced the lowest RMSE. The table shows that the RMSE of the PT-LSTM model with the lowest RMSE was lower than the RMSE of the baseline LSTM model. This result indicates that the LSTM model with the cooperation utility (PT-LSTM model) outperformed the baseline LSTM model.

Table 4.

Calibration of Cooperation Utility Parameter Values with LSTM

$λ_{hdv - hdv}$	$α_{hdv - hdv}$	$γ_{hdv - hdv}$	$λ_{hdv - av}$	$α_{hdv - av}$	$γ_{hdv - av}$	$λ_{av - hdv}$	$α_{av - hdv}$	$γ_{av - hdv}$	RMSE
na	na	na	na	na	na	na	na	na	0.459
5.93	0.05	0.29	4.66	0.06	0.64	3.37	0.05	0.69	0.551
1.19	0.03	0.55	3.25	0.10	0.68	2.38	0.10	0.19	0.536
1.28	0.29	0.46	1.99	0.74	0.38	1.48	0.36	0.27	0.435
1.17	0.09	0.38	4.49	0.04	0.74	5.90	0.09	0.73	0.480
3.48	0.07	0.16	1.63	0.09	0.61	5.88	0.04	0.93	0.286
3.02	0.89	0.47	4.82	0.32	0.70	4.39	0.23	0.81	0.323
1.10	0.57	0.74	1.17	0.49	0.68	5.78	0.35	0.33	0.639
5.25	0.05	0.49	5.63	0.08	0.70	2.92	0.09	0.69	0.628
5.14	0.92	0.38	5.31	0.35	0.54	1.44	0.55	0.91	0.364
4.86	0.95	0.13	2.52	0.95	0.26	5.60	0.33	0.34	0.345

Note: LSTM = long short-term-memory; RMSE = root mean square error; na = not applicable.

The parameters in bold are from the PT-LSTM model with calibrated PT parameters that produced the lowest RMSE.

From Table 4, it can be seen that the $γ$ value for the AV–HDV vehicle pair was the largest for the PT-LSTM model with the lowest RMSE. A large $γ$ value indicates the follower’s willingness to cooperate is stronger when the driving state of the leading vehicle changes ( 37 ). This suggests that AVs were more willing to change driving behavior as the driving behavior of the lead vehicle changed. On the other hand, the $γ$ value for the HDV–HDV pair was the lowest. This suggests that HDVs were less willing to cooperate with the lead vehicle.

The distributions of the urgency value, $V (s_{obs})$ , and cooperation utility, $U$ , of the PT-LSTM model with the lowest RMSE value are shown in Figures 9 and 10, respectively. Figure 9 shows that the urgency values varied with spacing and were different among the three vehicle pair groups. The figure shows that the urgency value for AV followers was higher than other groups and this indicates that AVs were more likely to react to the lead vehicle speed changes faster than HDV followers. The figure also shows that HDV followers were more likely to react to the lead vehicle faster when the spacing was short (≤20 m), however, HDV followers were more likely to react to the lead HDV faster than the lead AV at short spacing. This suggests that HDVs were more cautious when following HDVs than AVs.

Figure 9.

Urgency value of the LSTM with calibrated parameters: (a) urgency value and (b) urgency value (spacing of 0 to 50 m).

Figure 10.

Cooperation utility value of the LSTM with calibrated parameters: (a) cooperation plot based on U value and (b) cooperation plot based on U value (spacing of 0 to 50 m).

Figure 10 shows that the cooperation utility values were lower for HDV followers than AV followers for a given spacing. This result indicates that HDVs were more aggressive in their car-following behavior (i.e., they were less likely to cooperate with the lead vehicle unless the spacing was very short) regardless of the lead vehicle type. However, AVs are able to track and cooperate with the lead vehicle through sensors in a car-following environment. It is also important to note from the figure that the cooperation utility increased more slowly as the spacing decreased for HDV–AV than HDV–HDV pairs. This suggests that HDVs had a more prolonged level of cooperation with the lead AV than when following the lead HDV. This HDV–AV behavior can be attributed to the longitudinal velocity control mechanism of AVs ( 43 ).

Figure 11 presents a comparison of prediction accuracy for the following vehicle speed among the PT-LSTM model, baseline LSTM model, and a time-series-based multiple layer perceptron (MLP) model for each vehicle pair group. The results demonstrate that the PT-LSTM model achieved lower RMSE than both the baseline LSTM model and MLP model for all vehicle pair groups. Specifically, the difference in RMSE between the PT-LSTM model and the baseline LSTM model was more pronounced for HDV–AV than for AV–HDV and HDV–HDV. Therefore, the PT-LSTM model outperformed both the baseline LSTM and MLP models, demonstrating greater accuracy in predicting the interaction between an HDV and an AV in mixed traffic.

Figure 11.

Comparison of prediction accuracy between models for each group of vehicle pair.

We further adopted SHAP to compare the importance of variables in the PT-LSTM model. Figure 12, a to c , shows the SHAP output for AV–HDV, HDV–AV, and HDV–HDV, respectively. A higher SHAP value indicates the greater import a variable has to the prediction of car-following behavior. The figure shows that the following vehicle speed and the lead vehicle speed at previous time steps were most important for the prediction of the following vehicle speed in the current time step.

Figure 12.

Importance of variables of PT-LSTM model determined using SHAP: (a) AV following HDV, (b) HDV following AV, and (c) HDV following HDV.

Moreover, the SHAP value of the cooperation utility (ut value) was relatively higher for AV–HDV than HDV–AV and HDV–HDV. This result suggests that AVs following HDVs are more likely to control their behaviors based on the cooperation utility, unlike HDV drivers following AVs or HDVs. This is because the following AV is more likely to cooperate with the lead HDV. However, for HDV–HDV, the SHAP value was consistently higher for spacing than the cooperation utility in all time steps, and its cooperation utility was lower than AV–HDV and HDV–AV. This means that HDV drivers following HDVs were more likely to control their behaviors based on the spacing and less likely to cooperate with the lead HDV. Also, the SHAP value of the cooperation utility was slightly higher for HDV–AV than HDV–HDV. This indicates that HDV drivers were more likely to cooperate with the lead AV than the lead HDV. This is intuitive because HDV drivers generally have a greater level of comfort and trust when they follow an AV than an HDV ( 50 ).

The PT-LSTM model was compared against two other car-following models, namely the PT-IDM (IDM with the incorporation of PT) proposed by Ding et al. ( 37 ), and Gipps’ model, which is widely recognized as a collision avoidance model. The PT-IDM is described as follows:

a_{n} (t) = δ \times a_{n - 1} (t) + η \times [S_{n} (t) - S_{n}^{\times} (t)] + θ \times [V_{n - 1} (t) - V_{n} (t)]

(25)

S_{n} (t) = Y_{n - 1} (t) - Y_{n} (t)

(26)

S_{n}^{\times} (t) = S_{0} + [1 + U (d h_{obs})] \times T V_{n} (t) + \frac{V_{n} (t) \times Δ V_{n}}{2 \times \sqrt{ab}}

(27)

where

$a_{n - 1} (t)$ and $a_{n} (t)$ are the accelerations for the lead and following vehicles at time t, respectively;

S _n (t) and $S_{n}^{\times} (t)$ are the spacing and the desired spacing at time $t$ , respectively;

$Y_{n} (t)$ and $Y_{n - 1} (t)$ are the positions of the following and lead vehicles at time $t$ , respectively,

$V_{n} (t)$ , $V_{n - 1} (t)$ , and U(dℎ_obs) are the following vehicle’s velocity, the lead vehicle’s velocity, and the cooperation utility value, respectively,

$S_{0}$ , a, b, and T are the safe spacing, maximum acceleration, desired deceleration, and desired time headway, respectively, and

δ, η, and θ are parameters to be calibrated.

To compare the PT-IDM, Gipps’, and PT-LSTM models, a total of six vehicle pairs were carefully selected, with two pairs chosen for each vehicle group. To fine-tune the parameters of the PT-IDM and Gipps’ models, the DE optimization technique was used with a population size of 100, a maximum number of 300 generations, and a function tolerance of 1e^-6. For the PT-IDM, the parameter ranges for S₀, a, b, T, δ, η, and θ were set to [2, 10], [0.1, 5], [0.1, 5], [0.1, 5], [0.1, 5], [0.1, 5], and [0.1, 5], respectively, whereas the PT parameters λ, α, and γ were fine-tuned using DE with the ranges set to [0.01, 0.5], [1, 6], and [0.1, 1], respectively. For Gipps’ model, parameter ranges for $v_{n}^{d}$ , $a_{\max}$ , $d_{n}$ , and $d_{n - 1}$ were set to [0, 20], [0.1, 5], [−3, 10], and [−3, 10], respectively. The optimal parameter values were obtained through DE optimization to fine-tune both mathematical models for the comparative analysis.

Tables 5 and 6 display the optimal calibration parameter set used for each vehicle pair. Figure 13 shows the comparison of vehicle speeds, including observed speed, and predicted speed by PT-LSTM, PT-IDM, and Gipps’ model. The findings revealed that the PT-LSTM model outperformed the PT-IDM and Gipps’ model in relation to accuracy in predicting the observed following vehicle speed for all three vehicle pair groups.

Table 5.

Calibration of Parameter Values of PT-IDM with Cooperation Utility

Pair	Pair group	S ₀ (m)	a (m/s²)	b (m/s²)	T	δ	η	θ	λ	α	γ
1	AV–HDV	8.15	4.21	4.12	4.09	2.78	0.22	0.38	1.48	0.48	0.46
2	AV–HDV	5.26	4.66	2.78	2.15	0.34	0.23	0.16	2.77	0.34	0.11
3	HDV–AV	7.46	2.09	4.30	2.45	0.69	0.33	0.29	4.61	0.37	0.41
4	HDV–AV	6.35	3.76	2.65	1.17	0.19	0.45	0.29	3.42	0.31	0.88
5	HDV–HDV	9.50	3.15	3.33	1.49	0.89	0.36	0.28	1.86	0.18	0.49
6	HDV–HDV	4.00	3.72	2.13	0.63	1.09	0.27	0.25	2.36	0.42	0.99

Note: PT-IDM = prospect theory–intelligent driver model; HDV = human-driven vehicle; AV = autonomous vehicle.

Table 6.

Calibration of Parameter Values of Gipps’ Model

Pair	Pair group	$v_{n}^{d}$	$a_{\max}$	$d_{n}$	$d_{n - 1}$
1	AV–HDV	16.58	0.83	−1.72	−1.65
2	AV–HDV	15.54	2.10	8.75	4.09
3	HDV–AV	10.01	3.78	5.13	3.49
4	HDV–AV	16.78	3.52	6.50	9.22
5	HDV–HDV	19.58	0.33	4.15	3.50
6	HDV–HDV	17.74	1.56	7.35	8.37

Note: HDV = human-driven vehicle; AV = autonomous vehicle.

Figure 13.

Comparison of observed and predicted following vehicle speed between PT-LSTM model and PT-IDM: (a) AV–HDV (Vehicle Pair 1), (b) AV–HDV (Vehicle Pair 2), (c) HDV–AV (Vehicle Pair 3), (d) HDV–AV (Vehicle Pair 4), (e) HDV–HDV (Vehicle Pair 5), and (f) HDV–HDV (Vehicle Pair 6).

Conclusions and Recommendations

This study extended a data-driven car-following model for HDVs and AVs in AV–HDV mixed-traffic environments. This extended car-following model predicted the following vehicle’s velocity considering how drivers’ (or AVs’) cooperation with the lead vehicle and memory affected their car-following behavior for three groups of lead and following vehicle pair: an AV following an HDV (AV–HDV), an HDV following an AV (HDV–AV), and an HDV following an HDV (HDV–HDV). The cooperation between the lead and following vehicles was modeled using PT and incorporated into the LSTM model as an input. This data-driven car-following model is called the “PT-LSTM model.”

The PT-LSTM model was calibrated using real-world HDV and AV trajectory data extracted from the Waymo Open Dataset. The model parameters were separately calibrated for the three types of vehicle pairs. The calibrated cooperation utility in the PT-LSTM model showed that AV followers were more likely to cooperate with a lead HDV than HDV followers. Moreover, HDV followers were more likely to cooperate with a lead AV than a lead HDV.

The comparison of the PT-LSTM model with the MLP and LSTM models, which do not incorporate PT, revealed that the former achieved a higher prediction accuracy of the following vehicle’s velocity. The PT-LSTM model was additionally evaluated against two mathematical car-following models: the PT-IDM and Gipps’ model. The PT-LSTM model was found to outperform both models in accurately predicting the speed profiles of the following vehicle in AV–HDV, HDV–AV, and HDV–HDV scenarios. This finding underscored that a data-driven car-following model that incorporates cooperation with the lead vehicle and memory, as captured by the PT-LSTM model, can more effectively replicate the car-following behaviors of drivers or AVs in mixed-traffic scenarios.

Furthermore, the SHAP method was utilized to determine the variable importance in the PT-LSTM model at different time steps. The findings revealed that, for all vehicle pair groups, the previous time step’s speed of both the following vehicle and the lead vehicle was important in predicting the following vehicle’s speed in the current time step. However, in the case of the AV–HDV scenario, cooperation utility was found to be more significant than spacing—in contrast to HDV–AV and HDV–HDV scenarios. This observation suggests that AVs were more inclined to cooperate with the lead vehicle, potentially owing to their superior ability to detect the motion of the lead vehicle through sensors compared with HDVs.

However, it should be noted that there are some limitations to the current study. Factors such as road geometries (e.g., slope and curvature) and weather conditions, which can potentially affect car-following behaviors, were not considered. Furthermore, the limited availability of real-world data on AV–AV car-following behaviors may have hindered a comprehensive analysis of AV–AV interactions. Moreover, the transferability of the PT-LSTM model may be limited as it was trained and validated on a specific dataset from Waymo Open Dataset, which may not fully represent all driving conditions or geographic locations. The association of car-following with lane-changing could not be considered because lane-by-lane vehicle trajectory data were not available. Lastly, traffic performance could not be evaluated by applying the PT-LSTM model to a microscopic traffic simulation (e.g., SUMO), similar to the studies by Goncu et al. ( 51 ) and Silgu et al. ( 52 ), as the observed aggregated traffic data were not available. These limitations should be taken into consideration when interpreting the results of this study and in the design of future research.

In future studies, it is recommended that the PT-LSTM model be calibrated for different road geometry, traffic, and weather conditions in AV–HDV mixed traffic, and the behavioral differences between AVs and HDVs and their interactions during car-following be analyzed more extensively. The robustness and transferability of the model could also be evaluated using a wider set of data. It is also recommended that more advanced control strategies of AVs (e.g., CACC) be developed based on the prediction of human driver behaviors in AV–HDV mixed traffic using the PT-LSTM model. In this regard, the PT-LSTM model could be applied to realistic simulation platforms like CARLA to analyze the visual results of AV maneuvers for different control strategies and traffic scenarios.

Footnotes

Author Contributions

The authors confirm contribution to the paper as follows: study conception and design: A. Adewale, C. Lee; data collection: A. Adewale; analysis and interpretation of results: A. Adewale, C. Lee; draft manuscript preparation: A. Adewale, C. Lee. All authors reviewed the results and approved the final version of the manuscript.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by Natural Sciences and Engineering Research Council of Canada (Grant no. RGPIN-2019-04430).

ORCID iDs

Ayobami Adewale

Chris Lee

Data Accessibility Statement

The data used to support the findings of this study are available in the study by Hu et al. ( 44 ), whereas the code and model are available on request.

References

Stoma

Dudziak

Caban

Droździel

The Future of Autonomous Vehicles in the Opinion of Automotive Market Users. Energies, Vol. 14, No. 16, 2021, p. 4777.

Wang

Liu

Chigan

Liu

Zhao

Car-Following Behavior of Coach Bus Based on Naturalistic Driving Experiments in Urban Roads. 2019 IEEE International Symposium on Circuits and Systems (ISCAS), Sapporo, Hokkaido, Japan, 2019, pp. 1–4.

Xie

D.-F.

Zhao

X.-M.

Heterogeneous Traffic Mixing Regular and Connected Vehicles: Modeling and Stabilization. IEEE Transactions on Intelligent Transportation Systems, Vol. 20, No. 6, 2019, pp. 2060–2071.

Wang

Liu

A Car-Following Model for Mixed Traffic Flow Consisting of Human-Driven Vehicles and Connected Vehicles. 2020 Chinese Automation Congress (CAC), Shanghai, China, 2020, pp. 2851–2856.

Gipps

A Behavioural Car-Following Model for Computer Simulation. Transportation Research Part B: Methodological, Vol. 15, No. 2, 1981, pp. 105–111.

Treiber

Hennecke

Helbing

Microscopic Simulation of Congested Traffic. In Traffic and Granular Flow ’99 ( Helbing

Herrmann

H. J.

Schreckenberg

Wolf

D. E.

, eds.), Springer Berlin Heidelberg, Berlin, Heidelberg, 2000, pp. 365–376.

Krauss

Microscopic Modeling of Traffic Flow: Investigation of Collision Free Vehicle Dynamics, 1998. Technical Report, Report No. DLR-FB-98-08, DLR Deutsches Zentrum fuer Luft- und Raumfahrt e.V., Koeln (Germany).

Chakroborty

Kikuchi

Evaluation of the General Motors Based Car-Following Models and a Proposed Fuzzy Inference Model. Transportation Research Part C: Emerging Technologies, Vol. 7, No. 4, 1999, pp. 209–235.

Leutzbach

Wiedemann

Development and Applications of Traffic Simulation Models. Traffic Engineering and Control, Vol. 27, No. 5, 1986, pp. 270–278.

10.

Hossain

M. A.

Kabir

K. A.

Tanimoto

Improved Car-Following Model Considering Modified Backward Optimal Velocity and Velocity Difference With Backward-Looking Effect. Journal of Applied Mathematics and Physics, Vol. 9, 2021, pp. 242–259.

11.

Liang

Wang

Guo

Nonlinear Analysis of the Car-Following Model Considering Headway Changes With Memory and Backward Looking Effect. Physica A: Statistical Mechanics and its Applications, Vol. 562, 2021, p. 125303.

12.

Wang

Cheng

Nonlinear Analysis for a Modified Continuum Model Considering Driver’s Memory and Backward Looking Effect. Physica A: Statistical Mechanics and its Applications, Vol. 508, 2018, pp. 18–27.

13.

Han

Zhang

Wang

Liu

Wang

Zhong

An Extended Car-Following Model Considering Generalized Preceding Vehicles in V2X Environment. Future Internet, Vol. 12, 2020, p. 216.

14.

Jiang

A Modified Full Velocity Difference Model With Acceleration and Deceleration Confinement: Calibrations, Validations, and Scenario Analyses. IEEE Intelligent Transportation Systems Magazine, Vol. 13, No. 2, pp. 222–235.

15.

Friji

Ghazzai

Besbes

Massoud

A DQN-Based Autonomous Car-Following Framework Using RGB-D Frames. 2020 IEEE Global Conference on Artificial Intelligence and Internet of Things (GCAIoT), Dubai, UAE, 2020, pp. 1–6.

16.

Xiao

Liu

Song

Linked Vehicle Model: A Simple Car-Following Model for Automated Vehicles. Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, Vol. 235, No. 2–3, 2021, pp. 854–870.

17.

Sharma

Zheng

Bhaskar

Haque

M. M.

Modelling Car-Following Behaviour of Connected Vehicles With a Focus on Driver Compliance. Transportation Research Part B: Methodological, Vol. 126, 2019, pp. 256–279.

18.

Shi

Wang

Zhong

Wang

Han

Wang

A Data-Driven Car-Following Model Based on the Random Forest. World Journal of Engineering and Technology, Vol. 9, 2021, pp. 503–515.

19.

Hao

Yang

Shi

Data-Driven Car-Following Model Based on Rough Set Theory. IET Intelligent Transport Systems, Vol. 12, No. 1, 2018, pp. 49–57.

20.

Zhang

Lin

Wang

Verwer

Dolan

A Data-Driven Behavior Generation Algorithm in Car-Following Scenarios. Proc., 25th International Symposium on Dynamics of Vehicles on Roads and Tracks (IAVSD ’17), Rockhampton, Queensland, Australia, 2017, pp. 227–232.

21.

Zheng

Yan

Jia

Jiang

Feedback Forecasting Based Deep Deterministic Policy Gradient Algorithm for Car-Following of Autonomous Vehicle. 2021 IEEE International Conference on Unmanned Systems (ICUS), Beijing, China, 2021, pp. 396–401.

22.

Wei

Paschalidis

Merat

Solernou

Hajiseyedjavadi

Romano

Human- like Decision Making and Motion Control for Smooth and Natural Car Following. IEEE Transactions on Intelligent Vehicles, Vol. 8, No. 1, 2021, pp. 263–274.

23.

Przybyla

Taylor

Jupe

Zhou

Simplified, Data-Driven, Errorable Car-Following Model to Predict the Safety Effects of Distracted Driving. 2012 15th International IEEE Conference on Intelligent Transportation Systems, 2012, Anchorage, AK, pp. 1149–1154.

24.

Harth

Ali

M. S.

Kates

Bogenberger

Data-Driven Modelling of Following Behavior in the Approach of Signalized Urban Intersections. 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, United States, 2021, pp. 1721–1728.

25.

Buyer

Waldenmayer

Zöllner

J. M.

Data-Driven Merging of Car- Following Models for Interaction-Aware Vehicle Speed Prediction. 2021 IEEE 24th International Conference on Information Fusion (FUSION), Sun City, South Africa, 2021, pp. 1–8.

26.

Wang

Jiang

Lin

Zheng

Wang

F.-Y.

Capturing Car-Following Behaviors by Deep Learning. IEEE Transactions on Intelligent Transportation Systems, Vol. 19, No. 3, 2018, pp. 910–920.

27.

Fei

Hei

Liu

The Driver Time Memory Car-Following Model Simulating in Apollo Platform With GRU and Real Road Traffic Data. Mathematical Problems in Engineering, Vol. 2020, 2020, p. 18.

28.

Jones

Walter

Bhadani

Sprinkle

Modeling Human Car-Following Behavior From Demonstration with Recurrent Neural Networks. 2020. https://csl.arizona.edu/sites/default/files/CopyCat%2520Final%2520Paper.pdf.

29.

Fan

Guo

Zhao

Wijnands

J. S.

Wang

Car-Following Modeling Incorporating Driving Memory Based on Autoencoder and Long Short-Term Memory Neural Networks. Sustainability, Vol. 11, No. 23, 2019, p. 6755.

30.

Huang

Sun

A Car-Following Model Considering Asymmetric Driving Behavior Based on Long Short-Term Memory Neural Networks. Transportation Research Part C: Emerging Technologies, Vol. 95, 2018, pp. 346–362.

31.

Lin

Wang

Zhou

Ding

Wang

Tan

Platoon Trajectories Generation: A Unidirectional Interconnected LSTM-Based Car-following Model. IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 3, 2022, pp. 2071–2081.

32.

Cao

Chen

Modeling and Simulating Urban Traffic Flow Mixed With Regular and Connected Vehicles. IEEE Access, Vol. 9, 2021, pp. 10392–10399.

33.

Liu

Peeta

Lin

Car-Following Behavior of Connected Vehicles in a Mixed Traffic Flow: Modeling and Stability Analysis. 2018 IEEE 8th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), Tianjin, China, 2018, pp. 1085–1088.

34.

Rahmati

Hosseini

M. K.

Talebpour

Swain

Nelson

Influence of Autonomous Vehicles on Car-Following Behavior of Human Drivers. Transportation Research Record: Journal of the Transportation Research Board, 2019. 2673: 367–379.

35.

Stabler

Bradley

Morgan

Slavin

Haque

Volume 2: Model Impacts of Connected and Autonomous/Automated Vehicles (CAVs) and Ride-Hailing With an Activity-Based Model (ABM) and Dynamic Traffic Assignment (DTA) – An Experiment. Report No. FHWA-HEP-18-081. U.S. Department of Transportation, Federal Highway Administration, Washington, D.C., 2018.

36.

Ahmed

H. U.

Huang

A Review of Car-Following Models and Modeling Tools for Human and Autonomous-Ready Driving Behaviors in Micro-Simulation. Smart Cities, Vol. 4, 2021, pp. 314–335.

37.

Ding

Chen

Peng

An Extended Car-Following Model in Connected and Autonomous Vehicle Environment: Perspective From the Cooperation between Drivers. Journal of Advanced Transportation, Vol. 2021, 2021, p. 2739129.

38.

Zhu

Simulation Modeling of Car-Following in Mixed Traffic Flow Based on Multi-Agent System. 20th COTA International Conference of Transportation Professionals, Xi’an, China, 2020, pp. 279–290.

39.

Zhu

W.-X.

Zhang

Analysis of Mixed Traffic Flow With Human-Driving and Autonomous Cars based on Car-Following Model. Physica A: Statistical Mechanics and its Applications, Vol. 496, 2018, pp. 274–285.

40.

Ozkan

M. F.

Modeling Driver Behavior in Car-Following Interactions With Automated and Human-Driven Vehicles and Energy Efficiency Evaluation. IEEE Access, Vol. 9, 2021, pp. 64696–64707.

41.

Schoenmakers

Yang

Farah

Car-Following Behavioural Adaptation When Driving Next to Automated Vehicles on a Dedicated Lane on Motorways: A Driving Simulator Study in the Netherlands. Transportation Research Part F: Traffic Psychology and Behaviour, Vol. 78, 2021, pp. 119–129.

42.

Lin

Gong

Peeta

Long Short-Term Memory-Based Human-Driven Vehicle Longitudinal Trajectory Prediction in a Connected and Autonomous Vehicle Environment. Transportation Research Record: Journal of the Transportation Research Board, 2021. 2675: 380–390.

43.

Wen

Cui

Jian

Characterizing Car-Following Behaviors of Human Drivers When Following Automated Vehicles Using the Real-World Dataset. Accident Analysis Prevention, Vol. 172, 2022, p. 106689.

44.

Zheng

Chen

Zhang

Sun

Processing, Assessing, and Enhancing the Waymo Autonomous Vehicle Open Dataset for Driving Behavior Research. Transportation Research Part C: Emerging Technologies, Vol. 134, 2022, p. 103490.

45.

Liu

Wei

Zhang

Guo

Mutual Information Based Feature Selection for Multivariate Time Series Forecasting. 2016 35th Chinese Control Conference (CCC), Chengdu, China, 2016, pp. 7110–7114.

46.

Chiu

Prospect Theory, John Wiley Sons, Ltd, Hoboken, NJ, 2011.

47.

Kahneman

Tversky

Prospect Theory: An Analysis of Decision under Risk. Econometrica, Vol. 47, No. 2, 1979, pp. 263–291.

48.

Lundberg

S. M.

Lee

S.-I.

A Unified Approach to Interpreting Model Predictions. In Advances in Neural Information Processing Systems ( Guyon

Luxburg

U. V.

Bengio

Wallach

Fergus

Vishwanathan

Garnett

, eds.), Curran Associates, Inc., Long Beach, CA, 2017, Vol. 30. pp. 4768–4777.

49.

Zhou

Dvornek

N. C.

Ventola

Duncan

J. S.

Efficient Shapley Explanation for Features Importance Estimation Under Uncertainty. Springer-Verlag, Berlin, Heidelberg, 2020, pp. 792–801.

50.

Wang

Hurwitz

Chand

Jashami

Koll

Integrating Driving Simulator Experiment Data with a Multi-Agent Connected Automated Vehicles Simulation (MA-CAVs) Platform to Quantify Improved Capacity. Pacific Northwest Transportation Consortium (PacTrans) USDOT University Transportation Center for Federal Region 10, University of Washington, 2020. https://rosap.ntl.bts.gov/view/dot/58625.

51.

Goncu

Erdagi

I. G.

Silgu

M. A.

Celikoglu

H. B.

Analysis on Effects of Driving Behavior on Freeway Traffic Flow: A Comparative Evaluation of Two Driver Profiles Using Two Car-Following Models. 2022 IEEE Intelligent Vehicles Symposium (IV), Aachen, Germany, 2022, pp. 688–693.

52.

Silgu

M. A.

Erdagi

I. G.

Goksu

Celikoglu

H. B.

Combined Control of Freeway Traffic Involving Cooperative Adaptive Cruise Controlled and Human Driven Vehicles using Feedback Control through SUMO. IEEE Transactions on Intelligent Transportation System, Vol. 23, No. 8, 2022, pp. 11011–11025.

Prediction of Car-Following Behavior of Autonomous Vehicle and Human-Driven Vehicle Based on Drivers’ Memory and Cooperation With Lead Vehicle

Abstract

Keywords

Literature Review

Methods and Data

Data and Analysis

Prospect Theory for Modeling Cooperation

Long Short-Term Memory Neural Network

SHAP for Determining Importance of Variables of Models

Results and Discussion

Conclusions and Recommendations

Footnotes

Author Contributions

Declaration of Conflicting Interests

Funding

ORCID iDs

Data Accessibility Statement

References