Sage Journals: Discover world-class research

Abstract

Modeling the interactive behavior of human drivers is essential for achieving safe and fully autonomous vehicles. Unfortunately, most decision-making systems employed in current autonomous vehicles rely on complex deep neural network models that function as black boxes with opaque reasoning that hampers human interpretation. Drawing upon the needs theories endorsed by psychologists and driving-related psychological research, we summarize five fundamental driving needs underlying the driver’s behavior: safety, dominance, achievement, order, and relatedness. Leveraging the behavior selection module from general cognitive architectures, we propose a decision-making model explicitly tailored for autonomous vehicles, comprising three distinct modules: needs assessment, motivation generation, and behavior selection. We conducted experiments to evaluate the proposed model using a self-developed 2D simulator based on Unity. The results intuitively visualized the motivation and behavior of self-driving vehicles. This model demonstrates remarkable proficiency in handling routine tasks, such as independent and complete driving tasks, intersection navigation, and maneuvering among multiple vehicles.

Keywords

Decision-making autonomous vehicles interpretability needs theory cognitive architecture

Introduction

Accurate modeling of the interactive behavior of human drivers is crucial for achieving safe and full vehicle autonomy. It enables better prediction of human driver intentions and movements and is valuable for generating more human-like decisions and trajectories in autonomous vehicles. However, the current autonomous vehicle decision-making systems rely predominantly on complex deep neural network models that operate as black boxes with opaque reasoning that hinders human interpretation. The pursuit of a transparent and interpretable model in safety-critical domains has recently attracted attention, surpassing performance considerations alone.^1–3

This paper presents a decision-making model that considers the psychology of human drivers. Drawing upon Maslow and Reiss’ theories,^4,5 we consider every action driven by an underlying motive. Therefore, based on various psychological theories of needs and driving-related psychological research, this study summarizes and analyzes five fundamental driving needs that underlie driver behavior: safety, dominance, achievement, order, and relatedness. Referring to the behavior selection module of general cognitive architectures, we designed a decision-making model for autonomous vehicles based on these driving needs. The model comprises three modules: needs assessment, motivation generation, and behavior selection. The needs assessment module continuously monitors the real-time level of each need; the motivation generation module compares the priorities of urges conveyed by the needs assessment module to determine the urge with the highest priority. Finally, the behavior-selection module selects an action that aligns with the determined motivation.

Related work

Autonomous-driving decision-making has two primary approaches: behavior planning and end-to-end learning.⁶ Behavior planning models, particularly those based on conditional representations, such as finite-state machines, are simple and effective decision-making methods. They discretize the action space and are well-suited for handling the interdependent behaviors of traffic participants in natural traffic environments, where the number of interactions exponentially increases with the number of individuals. For example, MIT’s Talos utilizes a navigation module for high-level behavioral planning in scenarios such as intersections, crossing, merging, and passing.⁷ Mellon’s Boss takes a less granular approach by categorizing scenarios into three environments: roads, intersections, and zones. Correspondingly, the behavior layer of Boss distinguishes among three behaviors: lane driving, intersection handling, and achieving a zone pose.⁸ However, finite-state machines, which are widely employed, have limitations in coping with large and complex systems, making it challenging to scale them up in line with the complexity of driving environments. A substantial number of states and conditions must be considered to extend these methods to more general urban traffic scenarios. This entails an enormous workload and is prone to errors during the pre- and post-maintenance stages.^9–11

In contrast to finite state machines that rely entirely on determined state transfer and decision rules, the partially observable Markov decision process (POMDP) provides a framework to address dynamic decision problems in imperfectly observable states.⁶ The Karlsruhe Institute of Technology has been exploring the application of POMDPs in decision-making for autonomous driving in the post-DARPA era. In 2014, Brechtel et al. introduced a continuous POMDP-based decision process that incorporated the poses and speeds of relevant road users. The experimental results demonstrated successful merging of the ego vehicle into the target lane without severe occlusion of the perceptual system.¹² In 2017, Hubmann et al. proposed a unified POMDP decision framework that not only considered the intentions and predicted movements of surrounding vehicles, as in previous works, but also focused on intersections of any geometry with any number of traffic participants. By pre-planning routes and speeds prior to simulation, the authors reduced the action space for real-time planning.⁹ The exploration of POMDP at the Karlsruhe Institute of Technology has yielded promising results in simulating unsignalized intersections. However, applicable scenarios are currently limited, and the authors hope that future research will expand the scope to complex environments such as multiple lanes. In traffic environments, autonomous vehicles require frequent decision-making. However, the challenge with POMDP lies in the time-consuming computational process and the difficulty of real-time application in online driving environments unless the complexity of the action space or environment can be reduced.⁶

The end-to-end approach to automated driving can be traced back to the development of the ALVINN by Carnegie Mellon University in 1989.¹³ Inspired by ALVINN, the NVIDIA team introduced a convolutional neural network in 2016 that directly generated steering control commands for vehicles using raw images from three front-facing cameras. This network is extensive and features 20 million connections and 250,000 parameters. The experimental results demonstrated the ability of the test vehicle to successfully navigate various road environments and diverse weather conditions.¹⁴ In 2018, Bansal et al. developed the ChauffeurNet decision-making neural network. It uses a reconstructed top-down image generated through perception system processing as the input. This image encapsulates complex urban-environment information such as vehicles, roads, signals, and a predefined route. The image is first processed by FeatureNet to extract the features, which are then used by AgentRNN to generate driving paths. Finally, the controller converts these paths into steering-wheel angles and accelerations. The ChauffeurNet model successfully operates a real-world vehicle although improvements can be made when compared with traditional motion-planning methods.¹⁵ Chen et al. adopted a similar technical approach at the University of California, Berkeley. They developed a deep imitation learning trajectory planning module utilizing a convolutional neural network with a fully connected layer containing 1000 units connected after the VGGNet16. The output layer represents the relative coordinates of the target sequence. Despite encountering errors in roundabouts, unstructured areas, and two-lane roads, the model demonstrated the ability to follow designated routes in multiple road environments. It responded to surrounding objects and adjusted to different traffic signal states, achieving results using only 120,000 data frames.¹⁶

Some studies argue that the behavior planner, tasked with making high-level decisions, and the trajectory planner, responsible for generating feasible trajectories, are commonly independently developed, lacking a shared objective. This separation changes the behavior planner, adversely affecting the fine-tuned trajectory planner, thus failing to fully exploit their strong coupling. Consequently, studies have attempted to address this issue by integrating behavior planning with motion planning. Uber’s Sadat et al. proposed an approach featuring an interpretable cost function and a joint learning algorithm that learns a shared cost function employed by both their behavior and trajectory components. Experiments have demonstrated that the jointly learned planner performs better in terms of both similarity to human driving and other safety metrics.¹⁷ In another study, Singh et al. proposed a bi-level optimization that can simultaneously search for the optimal higher-level behavioral decisions along with the lower-level trajectories necessary for executing them. Extensive simulations demonstrated that their approach outperforms state-of-the-art model predictive control and reinforcement learning approaches in terms of collision rate while remaining competitive in driving efficiency.¹⁸

Although safety and ethical concerns persist,¹⁹ both the end-to-end approach and other AI-based decision-making methods have achieved significant advancements. Current automated driving decision methods rely predominantly on deep neural networks and fall under the connectionist approach of cognitive science. These models often process data in a black-box manner and lack interpretability. Although these models yield accurate prediction results, explaining their underlying logic and decision-making processes has become challenging. In the event of problems during simulation tests, the interpretability of autonomous vehicle decisions is crucial for analyzing the triggering factors of accidents and promptly troubleshooting the tested autonomous driving system.^20,21

Decision-making model

General structure

The decision-making model presented in this paper was inspired by the CLARION²² and MicroPsi²³ cognitive architectures. Figure 1 illustrates this model following a driving-need-driven approach. It comprises three modules: needs assessment, motivation generation, and behavior selection. The needs assessment module continuously monitors real-time needs. When an autonomous vehicle experiences an urge for one or more needs in its current environment, the needs assessment module sends the urge to the motivation-generation module. The motivation-generation module compares the priorities of the urges received from the needs-assessment module. Urges with higher levels are assigned higher priority and become the driving force. Even if an action selected at a lower level is already in progress, it is immediately terminated to relinquish decision dominance. Finally, the behavior selection module makes further assessments and determines the appropriate behavioral action that aligns with motivation, including the goal of the action and the timeframe to achieve it.

Figure 1.

General structure of the decision-making model. It comprises three modules: needs assessment, motivation generation, and behavior selection.

Needs assessment

Real-time evaluation and monitoring of the urge $u_{d_{i}}$ for each need $d_{i}$ were conducted using the needs assessment module. When an autonomous vehicle experiences urges for one or more needs in a given environment, the needs assessment module promptly transmits these urges to the motivation generation module. Table 1 illustrates the five driving requirements: safety $d_{sfty}$ , dominance $d_{dom}$ , achievement $d_{achv}$ , order $d_{ord}$ , and relatedness $d_{rel}$ . These needs are derived from various human needs theories and through extensive research on the psychological motivations of drivers. The corresponding urges for each requirement are described below.

Table 1.

Correspondence of the five driving needs in different theories.^24–32

Theory	Safety	Dominance	Achievement	Order	Relatedness
Instinct Theory²²	Flight				Pugnacity
	Repulsion				Gregariousness
Theory of Personology²³	Harmavoidance	Dominance	Achievement	Blamavoidance	Abasement
					Rejection
	Defendance				Narcism
					Affiliation
					Succorance
Achievement Motivation Theory²⁴		Power	Achievement		Affiliative motives
Hierarchy of Needs⁴	Safety		Self-actualization		Belonging
ERG Theory²⁵	Existence		Growth		Relatedness
16 Basic Desires Theory²⁶	Tranquility	Power		Honor	Acceptance
					Social contact
Self-Determination Theory (SDT)²⁷		Competence			Relatedness
Driving Cost and Benefit²⁸	Life endangerment	Sense of control
Multiple Comfort Zone Model³³	Safety margin		Good progress of trip
Road User Needs³⁶		Speed control	Reaching the destination	Rule compliance	Traffic tasks
Interacting Mental Models²⁹	Risk or safety			Compliance to social norm
CLARION³⁰	Avoiding physical dangers			Honor	Affiliation and belongingness
	Avoiding unpleasant stimuli				Affiliation and belongingness
MicroPsi⁴⁰	Physiological needs	Competence			Affiliation

Urge for safety

Safety is a fundamental requirement for survival, which compels individuals to avoid risks and potential harm when faced with external threats. The significance of safety needs in driving is evident from the fact that, as of 2008, approximately 10% of the articles published in the journal Human Factors focused on driving and driving safety.³³ Meeting safety needs is crucial for drivers because it allows them to feel secure and comfortable while driving, minimizing excessive psychological loads. Drivers are compelled to maintain an adequate distance from stationary entities, such as road boundaries and obstacles, and dynamic entities, such as vehicles and pedestrians. Summala’s “multiple comfort zone” model introduces the concept of a “safety margin,” which defines safety requirements in terms of time and space. Once an entity enters the driver’s subjectively constructed “safety margin,” the driver experiences discomfort and fear, prompting them to take appropriate actions, such as braking or steering, to pull away and mitigate potential risks.^34,35

The urge for safety $u_{sfty}$ was exclusively determined by the safety zone $zon e_{sfty}$ , which is a virtual rectangular area surrounding an autonomous vehicle. This region represents the minimum safe distance that an autonomous vehicle aims to maintain from other dynamic entities. Its dimensions are influenced by the size, speed, and acceleration of the vehicle. When any other dynamic entity $entit y_{dyn}$ enters this region, the safety need $d_{sfty}$ triggers a safety urge $u_{sfty}$ , as indicated by equation (1).

\begin{matrix} u_{sfty} = {\begin{matrix} 0, zon e_{sfty} \cap entit y_{dyn} = \emptyset \\ 1, zon e_{sfty} \cap entit y_{dyn} \neq \emptyset \end{matrix} \end{matrix}

(1)

Urge for dominance

Dominance refers to the desire to exert influence over the environment and fulfill one’s aspirations. In the context of driving, the need for dominance is evident through a driver’s active control of speed and direction. Summala suggests that whether driving at a normal speed or pushing the limits for excitement, drivers experience a sense of “driving pleasure,” which arises from fulfilling their desire for control over the vehicle.³⁶ The discretionary lane changes observed in the microscopic traffic-flow models can be attributed to the need for dominance.³⁷ Drivers can change lanes to adjacent lanes to maintain their desired speed without being impeded by slower traffic in the same lane, thereby ensuring control over the driving situation.

The need for dominance $d_{dom}$ encompasses the driver’s desire to control the vehicle’s direction and speed, particularly in the pursuit of higher speeds. It generates an urge $u_{dom}$ in two specific situations: when there is another dynamic entity $entit y_{dyn}$ within a certain distance ahead, and when there is a discrepancy between the current $v_{curr}$ and desired speeds $v_{expt}$ .

\begin{matrix} u_{dom} = {\begin{matrix} 0, u_{do m_{d} yn} + u_{do m_{v}} = 0 \\ 1, u_{do m_{d} yn} + u_{do m_{v}} \neq 0 \end{matrix} \end{matrix}

(2)

where $u_{dom_dyn}$ is the impulse related to the front vehicle during the control urge as expressed in equation (3) and $u_{dom_v}$ is the urge related to vehicle speed, as shown in equation (4).

\begin{matrix} u_{do m_{d} yn} = {\begin{matrix} 0, entit y_{dyn} = None \\ 1, entit y_{dyn} \neq None \end{matrix} \end{matrix}

(3)

\begin{matrix} u_{do m_{v}} = {\begin{matrix} 0, | v_{curr} - v_{expt} | \leq ϵ \\ 1, | v_{curr} - v_{expt} | > ϵ \end{matrix} \end{matrix}

(4)

Urge for achievement

This requirement for achievement is typically purposeful. Accomplishing a goal or task is necessary to achieve ambition.³⁸ Summala’s “good or expected driving progress” concept captures the driver’s need for achievement.³⁴ In microscopic traffic models, mandatory lane changing is typical behavior driven by the need for achievement, particularly when reaching a destination.³⁵ However, Vechione et al. present slightly biased situations to justify mandatory lane changes, such as unsustainability in the current lane or the need to cooperate with intersection turns.³⁹

Nonetheless, these situations only partially encompass the range of mandatory lane changes. For example, if a driver intends to travel straight and the left-turn lane is not part of the planned route, a mandatory lane change is necessary if the driver is in the left-turn lane at that point. Therefore, a mandatory lane change occurs when the current lane deviates from the planned route. Conversely, when drivers are already in a lane that aligns with the planned route, they would refrain from making a lane change to fulfill their driving task, driven by the need for achievement. When establishing a planned route from the initial point to the destination, it is imperative that autonomous vehicles adhere to this route. Hence, throughout the journey, an autonomous vehicle must continuously assess whether the current lane aligns with the planned route. This evaluation involves determining whether a lane change is required. Specifically, if the estimated distance for a lane change $ds t_{curr \to dest}$ exceeds the feasible lane change distance $dst'_{rmng}$ , the need for achievement $d_{achv}$ will generate an urge for achievement $u_{achv}$ . Conversely, if the estimated distance for a lane change $ds t_{curr \to dest}$ falls short of the executable lane-change distance $dst'_{rmng}$ , the need for achievement $d_{achv}$ will not generate the urge for achievement $u_{achv}$ , as illustrated in equation (5).

\begin{matrix} u_{achv} = {\begin{matrix} 0, ds t_{curr \to dest} - dst'_{rmng} \leq 0 \\ 1, ds t_{curr \to dest} - dst'_{rmng} > 0 \end{matrix} \end{matrix}

(5)

Before determining the achievement urge, the nearest candidate lane $lan e_{dest}$ that aligns with the predetermined route $Route$ to the current lane $lan e_{curr}$ must be identified. Assuming that several lanes are available within the current section of the road that lead to the preplanned route, the set comprising these lanes can be represented by equation (6).

\begin{matrix} CandidateLaneSet = {lane, lane \in Route \cap Sec t_{curr}} \end{matrix}

(6)

The lanes nearest the current lane from the set $CandidateLaneSet$ are selected, and the function $ID (x)$ is used to obtain the lane number.

\begin{matrix} TargetLaneSet = \\ {lane \in CandidateLaneSet | argmin (dstID (lane))}, \end{matrix}

(7)

where

\begin{matrix} dstID (lane) = abs (ID (lane) - ID (lan e_{curr})) \end{matrix}

(8)

Because more than one candidate target lane closest to the current lane $lan e_{curr}$ can exist, all candidate target lanes are represented as a set $TargetLaneSet$ , and one lane is randomly selected as the target lane $lan e_{dest}$ . Next, the estimated lane-change distance $ds t_{curr \to dest}$ from the current lane to the target lane is calculated. Assume the following:

\begin{matrix} 0 < ID (lan e_{curr}) \leq ID (lan e_{dest}) \end{matrix}

(9)

Using the average speed ${\overset{↼}{v}}_{i}$ of each lane $lan e_{i}$ in section $Sec t_{curr}$ , we estimate the distance to travel from the current lane $lan e_{curr}$ to the target lane $lan e_{dest}$ in the longitudinal direction $dis t_{curr \to dest}$ .

\begin{matrix} ds t_{curr \to dest} = \sum_{i = curr}^{t - 1} ({\overset{↼}{v}}_{i} τ_{w} + \frac{{\overset{↼}{v}}_{i} + {\overset{↼}{v}}_{i + 1}}{2} τ_{c}), \end{matrix}

(10)

where $τ_{w}$ is the estimated lane-change waiting time and $τ_{c}$ is the estimated lane-change duration. The executable lane change distance $dst'_{rmng}$ can be calculated using equation (11).

\begin{matrix} dst'_{rmng} = ds t_{rmng} - ds t_{0}, \end{matrix}

(11)

where $dis t_{0}$ is the reserved distance of the road section and $dis t_{rmng}$ is the actual remaining distance from the current position to the end of the road.

Urge for order

The need for an order pertains to adhering to regulations and avoiding penalties. In the context of driving, these regulations include formal traffic laws,^36,40 such as obeying traffic signals, staying within designated lanes, and adhering to speed limits, and informal rules.⁴¹ Adhering to these rules helps drivers avoid accidents and penalties,³³ enhances the overall efficiency driving, minimizes congestion and accidents, and saves time and expenses for both drivers and other road users. A straightforward example is when approaching a signalized intersection that follows the instructions displayed by traffic signals, either coming to a stop or proceeding, depending on the signal’s indication. In the absence of traffic signals or police instructions, drivers follow rules applicable to unsignalized intersections, such as yielding to straight-through traffic, giving way to vehicles on the main road at a junction, and prioritizing right turns.

While the need for order is relevant throughout the journey, certain actions, such as staying within lanes and adhering to traffic regulations, are already implicit in the motion planning process. Therefore, the current model explicitly incorporates the need for order as autonomous vehicles approach intersections. As an autonomous vehicle approaches an intersection, the need for an order $d_{ord}$ triggers the order urge $u_{ord}$ , as described in equation (12).

\begin{matrix} u_{ord} = {\begin{matrix} 0, ds t_{x} \geq ds t_{rsv} \\ 1, ds t_{x} < ds t_{rsv} \end{matrix} \end{matrix}

(12)

An order urge is generated when the distance $ds t_{x}$ between the vehicle and intersection falls below the desired distance $ds t_{rsv}$ . Conversely, no order urge is generated when the distance from the intersection exceeds the desired distance.

Urge for relatedness

The need for relatedness encompasses the desire to establish connections with others, manifesting as positive experiences, such as acceptance, respect, and reciprocity, or negative experiences, such as refusal to cooperate, rejection, and confrontation. While Summala’s “multiple comfort zones” model primarily focuses on individual needs, it does not extensively address interpersonal dynamics.^33,34 Rumar defines relatedness as a traffic objective that involves interacting with other traffic participants in a manner that maintains mobility while avoiding collisions.³⁶ However, achieving this goal in practice is challenging. For instance, when a vehicle in an adjacent lane merges into the driver’s lane, the driver may adjust the speed and increase the following distance to facilitate merging if they accept or agree with the other vehicle’s behavior. Conversely, if the driver rejects or opposes merging, they may maintain or slightly increase their speed while reducing the following distance to prevent the other vehicle from merging. It is evident that maneuverability is compromised in the former scenario, whereas collision risk is heightened rather than mitigated in the latter scenario.

The need for relatedness encompasses the desire to cooperate with other traffic participants, seek approval and recognition, and express refusal. In the context of autonomous vehicles, other dynamic entities $entit y_{dyn}$ actively communicate their requests to the vehicle when affected by its actions. An autonomous vehicle generates a relatedness urge upon receiving a request $RE Q_{dyn}$ , as expressed in equation (13).

\begin{matrix} u_{rel} = {\begin{matrix} 0, RE Q_{dyn} = True \\ 1, RE Q_{dyn} = False \end{matrix} \end{matrix}

(13)

Motivation generation

Psi’s theory proposes that all goal-directed actions stem from motivation.⁴² Reproducing these motivations enhances the comprehensiveness of the decision model and provides a deeper understanding of the underlying mental processes. Once the environment triggers the generation of urges corresponding to needs, the motivation generation module compares the priorities of all available urges. The urge for higher priority is the prevailing motivation. Even if an action planned at a lower level is currently being executed, it must be terminated immediately to relinquish the dominance of the decision. The priority order of each urge within the motivation generation module is outlined below.

\begin{matrix} p_{sfty} > p_{ord} > p_{achv} > p_{rel} > p_{dom} \end{matrix}

(14)

In the above equation, $p_{i}$ denotes the priority of the urge $u_{i}$ . Safety urge had the highest priority, followed by order, achievement, relatedness, and dominance. In the absence of an urge, the decision-making process enters an unmotivated behavioral phase.

Behavior selection

Alternative behaviors and actions

This section confines the actions that the driver can perform $B$ to changing the speed $B_{changeSpeed}$ , changing the lane $B_{changeLane}$ , and maintaining the current state $B_{unchange}$ , as shown in equation (15).

\begin{matrix} B = {B_{changeSpeed}, B_{changeLane}, B_{unchange}} \end{matrix}

(15)

Changing speed

The changing speed $B_{changeSpeed}$ refers to the behavior of adjusting the velocity of the vehicle, either by increasing or decreasing the speed to reach a specific target. This behavior includes maintaining distance $a_{keep_dst}^{cs}$ , emergency braking $a_{emg_brk}^{cs}$ , adjusting speed $a_{adj_lmt}^{cs}$ , slowly stopping $a_{stop_slw}^{cs}$ , and yielding $a_{plt_dec}^{cs}$ , as shown in equation (16).

\begin{matrix} B_{changeSpeed} = {a_{keep_dst}^{cs}, a_{emg_brk}^{cs}, a_{adj_lmt}^{cs}, a_{stop_slw}^{cs}, a_{plt_dec}^{cs}} \end{matrix}

(16)

Both action $a_{keep_dst}^{cs}$ and action $a_{plt_dec}$ involve maintaining a specific distance from the preceding vehicle; however, they differ in their target objects. $a_{keep_dst}^{cs}$ aims to maintain a safe distance from the vehicle directly ahead in the current lane, whereas $a_{plt_dec}$ targets the vehicle ahead that is approaching to merge into the current lane. The completion time $t_{f}$ of the two actions is the headway time distance $t_{pre_h}$ of the driver preference. The target states are expressed using equation (17), where $x_{f}, v_{f}, and a_{f}$ denote the target position, target speed, and target acceleration at the end of the maneuver, respectively; $x_{ve h_{fnt}}, v_{ve h_{fnt}}, and a_{ve h_{fnt}}$ are the position, speed, and acceleration of the target vehicle, respectively.

\begin{matrix} {\begin{matrix} x_{f} = x_{ve h_{fnt}} \\ v_{f} = v_{ve h_{fnt}} \\ a_{f} = a_{ve h_{fnt}} \\ t_{f} = t_{pr e_{h}} \end{matrix} \end{matrix}

(17)

$a_{emg_brk}^{cs}$ and $a_{stop_slw}^{cs}$ aim to bring the vehicle to a stop with a target speed of zero. However, they differ in execution characteristics. $a_{emg_brk}^{cs}$ focuses on stopping the vehicle as quickly as possible, resulting in shorter execution time without specifying the target location. By contrast, $a_{stop_slw}^{cs}$ has a predetermined target location for the vehicle to stop.

Finally, $a_{adj_lmt}^{cs}$ is designed to enable the vehicle to reach the target speed within a specific timeframe. The target speed in this action is determined by the driver’s desired speed $v_{\exp}$ considering the road conditions. The execution time is influenced by driver comfort preferences and vehicle performance.

Changing lane

Changing lane $B_{changeLane}$ refers to the transition from one lane to another. This behavior includes four actions: mandatory lane changing $a_{dest_\ln}^{cl}$ , discretionary lane changing $a_{obs_\ln}^{cl}$ , random lane changing $a_{rnd_\ln}^{cl}$ , and aborting lane changing $a_{abt_\ln}^{cl}$ , as shown in equation (18).

\begin{matrix} B_{changeLane} = {a_{dest_\ln}^{cl}, a_{obs_\ln}^{cl}, a_{rnd_\ln}^{cl}, a_{abt_\ln}^{cl}} \end{matrix}

(18)

Mandatory lane changing

$a_{dest_\ln}^{cl}$ involves changing lanes toward the nearest target lane. While multiple lanes may separate the current lane $lan e_{curr}$ and target lane, this action only executes a single lane change, as illustrated in Figure 2. In contrast to the random lane selection utilized to generate the achievement urge for distance estimation, the target lane for mandatory lane changing and the lane for $a_{dest_\ln}^{cl}$ action can be determined through the following processes:

Figure 2.

Interrelationships between lanes. Current lane $lan e_{curr}$ , target lane for mandatory lane-changing $lan e_{dest}$ and lane for the $a_{dest_\ln}^{cl}$ action $lan e_{t}$ .

After determining the set $TargetLaneSet$ , the set of lanes for a single lane change $TempLaneSet$ is the lane that is one unit away from the current lane $lan e_{curr}$ in the direction of the mandatory target lane.

\begin{matrix} TempLaneSet = \\ {lane \in Sec t_{curr} | dstID (lane) \in LaneStepSet}, \end{matrix}

(19)

where

\begin{matrix} LaneStepSet = \\ {sgn (ID (lane) - ID (lan e_{curr})), lane \in TargetLaneSet} . \end{matrix}

(20)

We then check if the lanes in $TempLaneSet$ have sufficient space to support a lane change.

\begin{matrix} AvailableLaneSet = \\ {lane \in TempLaneSet | spareSpace (lane) = true}, \end{matrix}

(21)

where $spareSpace (\cdot)$ is a function that determines whether space exists in a lane. If more than one lane still exists in the set $AvailableLaneSet$ , then any lane can be considered as the single lane-changing target $lan e_{t}$ of action $a_{dest_\ln}^{cl}$ , and $lan e_{dest}$ is the corresponding target lane for mandatory lane changing.

Discretionary lane changing

$a_{obs_\ln}^{cl}$ changes lanes to adjacent lanes, whose average speed is faster than that of the current lane. The set of adjacent lanes $AdjacentLaneSet$ is determined using equation (22).

\begin{matrix} AdjacentLaneSet = {lane, abs (ID (lane) - ID (lan e_{curr})) = 1} \end{matrix}

(22)

We then filter adjacent lanes using average speed to retain lanes with faster average speeds.

\begin{matrix} FastLaneSet = {lane \in AdjacentLaneSet | {\overset{↼}{v}}_{lane} > {\overset{↼}{v}}_{curr}}, \end{matrix}

(23)

where denotes the current lane average speed and ${\overset{↼}{v}}_{lane}$ denotes the lane $lane$ average speed. We check whether the lanes in $FastLaneSet$ have sufficient space to support lane changes.

\begin{matrix} AvailableLaneSet = \\ {lane \in FastLaneSet | spareSpace (lane) = true} \end{matrix}

(24)

If set $AvailableLaneSet$ still has two lanes, we select any lane from it as the target lane for this action.

Random lane changing

$a_{rnd_\ln}^{cl}$ randomly selects adjacent lanes for lane changes. After the set of adjacent lanes, $AdjacentLaneSet$ is determined using equation (22), the lanes are identified to determine whether there is sufficient space to support lane changes.

\begin{matrix} AvailableLaneSet = \\ {lane \in AdjacentLaneSet | spareSpace (lane) = true} \end{matrix}

(25)

If there are still two lanes in set $AvailableLaneSet$ , any lane from it is taken as the target lane for this action.

Aborting lane changing

During the execution of any of the aforementioned lane-changing actions, $a_{abt_\ln}^{cl}$ can interrupt the ongoing lane-changing and revert to the lane before initiating the lane-changing action.

Maintaining unchanged

$B_{unchanage}$ involves neither lane changing nor altering the speed of the vehicle, and instead maintains its current speed $v_{curr}$ and continues along the current lane $lan e_{curr}$ . This set of behaviors encompasses both the normal maintenance action, $a_{mnt_unch}^{cn}$ , and the alert-and-maintain action, $a_{alert}^{cn}$ . It is essential to highlight that $a_{alert}^{cn}$ emits a cooperative signal to other traffic participants who will respond to the signal based on their own needs and preferences.

\begin{matrix} B_{unchange} = {a_{mnt_unch}^{cn}, a_{alert}^{cn}} \end{matrix}

(26)

Behavioral decision for each motivation

In this study, the behavioral decisions for each motivation were represented using individual behavior trees. Behavior trees are widely adopted for designing and implementing intelligent behaviors in various systems. They offer a hierarchical structure that enables the creation of complex and adaptive behaviors by organizing smaller behavioral units known as nodes.

The execution of a behavior tree commences at the root, which sends ticks to its child nodes at a predefined frequency. A tick serves as an enabling signal to execute a child node. When a node in the behavior tree is eligible for execution, it returns a status to its parent, indicating whether it is “running” if its execution is ongoing, “success” if it has achieved its goal, or “failure” if it has not.

This study utilized three control nodes (sequence, selector, and random selector), two execution nodes (action and condition), and one decorator node (inverter). The sequence node (Figure 3(a)) seeks and executes the first child who has not yet succeeded. The selector node (Figure 3(b)) identifies and executes the first child that does not fail. The random selector node (Figure 3(c)) randomly selects and executes one of its child nodes. The condition node (Figure 3(d)) assesses a specific condition and returns a success or failure status based on the evaluation. An action node (Figure 3(e)) represents a specific action or behavior that the agent or system should perform. The inverter node (Figure 3(f)) inverts the result of its child node.

Figure 3.

Nodes in the behavior tree. This study utilizes three control nodes ((a) sequence, (b) selector, and (c) random selector), two execution nodes ((d) condition and (e) action), and one decorator node ((f) inverter).

Actions for safety motivation

The behavioral decisions driven by safety motivation are shown in Figure 4. It begins by determining whether the hazard originates in the longitudinal or the lateral direction. Suppose that the dynamic entity ahead $entit y_{dyn}$ approaches the autonomous vehicle’s safety zone too closely, triggering the safety motive $m_{sfty}$ , the ego vehicle initiates the action $a_{keep_dst}^{cs}$ to adjust the distance between the two vehicles. If this action fails to eliminate the imminent danger, and $entit y_{dyn}$ remains within the safety zone $zon e_{sfty}$ , $a_{emg_brk}^{cs}$ is activated to rapidly reduce the speed of the vehicle, with the aim of creating a more significant gap between the vehicles until the hazard is resolved or the vehicle stops.

Figure 4.

Behavior tree for safety motivation.

An additional emergency braking action, $a_{emg_brk}^{cs}$ , is included in the sequence node to account for potential motion planning failures in $a_{keep_dst}^{cs}$ . A selector connects the two parts.

Suppose that the ego vehicle is in the process of changing lanes and becomes too close to a lateral adjacent dynamic entity. In this case, the lane-change action must be promptly aborted and the vehicle should be returned to the original lane via $a_{abt_\ln}^{cl}$ . If the autonomous vehicle is not engaged in a lane-change process, it decelerates using $a_{emg_brk}^{cs}$ and swiftly moves away from the longitudinal direction to avoid a collision with the adjacent lateral entity.

Actions for dominance motivation

When a dominance motive is present, it is essential to differentiate whether it arises from an urge associated with the preceding vehicle $u_{ctl_dyn}$ or from an urge related to the speed of the ego vehicle $u_{ctl_v}$ .

If the preceding vehicle enters the ego vehicle’s field of view and its speed is slower than the ego vehicle’s current speed, the motive is related to the preceding vehicle. In this case, the ego vehicle can maintain its current motion and signal the preceding vehicle via $a_{alert}^{cn}$ , to either appropriately increase its speed or change lanes to faster adjacent lanes.

If the current lane is unobstructed by the preceding vehicle, or if there is no preceding vehicle at all and the ego vehicle’s speed is lower than the desired speed, it suggests that the dominance motive is speed-dependent. In this scenario, the ego vehicle can engage in a speed adjustment action $a_{adj_lmt}^{cs}$ . The behavioral decisions driven by dominance motivation are shown in Figure 5.

Figure 5.

Behavior tree for dominance motivation.

Actions for achievement motivation

The behavioral decision process driven by the achievement motive begins by assessing whether the current lane $lan e_{curr}$ is among the set $CandidateLaneSet$ . If it is not considered a candidate target lane, $a_{dest_\ln}^{cl}$ is required to transition toward $lan e_{dest}$ . Conversely, if the current lane corresponds to the target lane, the driving state remains unchanged via $a_{mnt_unch}^{cn}$ because the remaining distance is deemed insufficient at this stage. The behavioral decisions driven by achievement motivation are shown in Figure 6.

Figure 6.

Behavior tree for achievement motivation.

Actions for order motivation

Behavioral decisions driven by the order motive are currently governed by traffic rules applicable to unsignalized intersections that dictate that vehicles proceed sequentially based on their arrival order at the intersection. If there are preceding vehicles traversing the intersection, vehicles approaching the intersection must stop at the designated stop line via $a_{stop_slw}^{cs}$ and must maintain their stationary state until it is their turn to proceed via $a_{mnt_unch}^{cn}$ . The behavioral decisions driven by order motivation are depicted in Figure 7.

Figure 7.

Behavior tree for order motivation.

Actions for relatedness motivation

The behavioral decision process driven by the relatedness motive is divided into three distinct scenarios. First, suppose that the ego vehicle is undergoing a lane change when it receives a request from the vehicle behind it. In this case, it has the option to either proceed with the ongoing lane-change action or abort the lane change and return to the original lane via $a_{abt_\ln}^{cl}$ , based on a predefined preference probability. Given the ambiguity of the source of the lane-change action at this point, a proxy action $a_{delegate}^{cl}$ is employed to represent the ongoing lane-change action originating from a lower-level motive.

Second, suppose that the ego vehicle is in a normal driving state and does not undergo a lane change when it receives a request. In this case, it can respond by selecting one of three actions: adjusting the speed $a_{adj_lmt}^{cs}$ , maintaining the current motion $a_{mnt_unch}^{cn}$ , or randomly changing lanes to an adjacent lane $a_{rnd_\ln}^{cl}$ .

Third, when the ego vehicle receives a request from the vehicle ahead, a possible scenario arises in which a vehicle in an adjacent lane intends to merge into the ego vehicle’s lane. In response, the ego vehicle can choose to either accept the request and ensure a safe distance by slowing down and yielding via $a_{plt_dec}^{cs}$ or can decline the request and maintain its current state to prevent lane change via $a_{mnt_unch}^{cn}$ . The behavioral decisions driven by the relatedness motivation are shown in Figure 8.

Figure 8.

Behavior tree for relatedness motivation.

Unmotivated

When none of the motives are fulfilled, the autonomous vehicle resorts to selecting one of the two actions: $a_{mnt_unch}^{cn}$ and $a_{rnd_\ln}^{cl}$ . To mitigate the risk of motion planning failure associated with action $a_{rnd_\ln}^{cl}$ , $a_{mnt_unch}^{cn}$ was incorporated outside the random selector node. This inclusion ensures reliable functioning of the behavioral decision process. The behavioral decisions of the unmotivated state are shown in Figure 9.

Figure 9.

Behavior tree for the unmotivated state.

Experiments

The autonomous driving model employed for the experimental validation in this study adopts Michon’s three-level driver behavior model, which includes strategic, maneuvering, and control levels.⁴³ At the strategic level, path planning was accomplished using the A* algorithm based on the road network. The maneuvering level was further divided into behavioral and motion planning. The decision-making process utilizes the model proposed in this study, whereas the motion planning aspect employs the analytical calculation method outlined by Li.⁴⁴ Lastly, the control level assumes an ideal “unit 1” scenario, where vehicle control and dynamics are optimized. In other words, the ego vehicle precisely follows the trajectory generated by the planning module.

The experiments were performed on a MacBook Pro (16-inch, 2019) with a 2.3 GHz Intel Core i9 processor, 32 GB 2667 MHz DDR4 memory, and AMD Radeon Pro 5500M with 8 GB of graphics memory. The software employed in this study was a custom-developed 2D simulator built on Unity 2022.2.0b12,⁴⁵ and all programming tasks were implemented using the C# language. Each vehicle within the simulation utilized the same decision-making model, but with distinct preference settings. Given the multifaceted nature of the model, the evaluation process involved conducting three separate experiments to validate the different features. The experiments were conducted as follows:

Completion of the assigned driving task: In this experiment, a vehicle traveled from its designated starting point to a predefined destination. The objective was to assess the ability of the model to navigate predetermined routes effectively.

Handling unsignalized traffic junctions: Four vehicles converge at a junction sequentially, and each vehicle must wait for the preceding vehicles to exit the junction before entering. This experiment focused on the capability of the model to navigate through unsignalized junctions.

Weaving ability in multilane traffic: A vehicle is expected to maneuver skillfully through groups of vehicles from the rear of a multilane highway, allowing it to increase its speed. This experiment evaluated the weaving ability of the model in multilane traffic situations.

Completion of assigned driving task

The autonomous vehicle embarked on its journey from lane 1 on Road 2392, with the ultimate destination being Road 2452. The planned route, indicated by the green lines in Figure 10, encompassed road IDs 2392, 2391, 2431, 2432, 2289, 2655, 2446, 2444, 2365, 2361, 2367, and 2452. Figure 11(a) visually depicts the vehicle’s location at each moment throughout the driving task.

Figure 10.

Planned route for the assigned driving task. The planned route, indicated by the green lines, encompassed by road IDs 2392, 2391, 2431, 2432, 2289, 2655, 2446, 2444, 2365, 2361, 2367, and 2452.

Figure 11.

Completion of the assigned driving task. From top to bottom, the five panels (a–e) represent the road ID, longitudinal displacement, motivation, action, and longitudinal velocity versus time of the autonomous vehicle, respectively.

Beginning at $t = 0$ s, the vehicle operates under dominant motivation, selecting action $a_{adj_lmt}^{cs}$ to increase its speed to approximately $10.8$ m/s. Subsequently, it transitioned to an unmotivated state and opted for the action $a_{mnt_unch}^{cn}$ . At $t = 86.94$ s, as the vehicle reached the end of Road 2392, it entered the order motivation and decelerated to a speed of $0$ m/s using action $a_{stop_slw}^{cs}$ . Given the absence of other vehicles at the intersection of Road 2391, the vehicle returned to dominance motivation, initiating acceleration through action $a_{adj_lmt}^{cs}$ . There was an approximately $0.4$ s during which the vehicle entered the achievement motivation. However, upon entering Road 2431, it reverted to an unmotivated state and continued driving via action $a_{mnt_unch}^{cn}$ . Because no other vehicles were involved in the scenario and only one autonomous vehicle was present, this cycle was repeated until the vehicle reached its destination, Road 2452. The entire journey lasted for approximately 11.5 min. In Figure 11, from top to bottom, the five panels represent the road ID, longitudinal displacement, motivation, action, and longitudinal velocity versus time of the autonomous vehicle, respectively.

Handling unsignalized traffic junctions

Four autonomous vehicles, namely $ve h_{1000}, ve h_{1001}, ve h_{1002},$ and $ve h_{1003},$ approached the intersection on different roads, as depicted in Figure 12. The routes followed by each vehicle are as follows:

- $ve h_{1000}$ departed from Road 5277, proceeded straight through the intersection, and entered Road 5154;

- $ve h_{1001}$ departed from Road 5271, turned right at the intersection, and entered Road 5277;

- $ve h_{1002}$ departed from Road 5154, turned left at the intersection, and entered Road 5165; and

- $ve h_{1003}$ departed from Road 5165, turned right at the intersection, and continued on Road 5154.

Figure 12.

Routes followed by each vehicle. $ve h_{1000}$ departed from Road 5277, proceeded straight through the intersection, entered Road 5154, $ve h_{1001}$ departed from Road 5271, turned right at the intersection, and entered Road 5277, $ve h_{1002}$ departed from Road 5154, turned left at the intersection, and entered Road 5165; and $ve h_{1003}$ departed from Road 5165, turned right at the intersection, and continued onto Road 5154.

Each of the four vehicles began their journey at 4 s intervals. Upon starting, each vehicle entered the dominance motivation, increasing its speed to approximately $10 m / s$ using the action $a_{adj_lmt}^{cs}$ . Subsequently, it transitioned into an unmotivated state and performed the action $a_{mnt_unch}^{cn}$ .

At $t = 20.32 s$ , $ve h_{1002}$ entered order motivation and initiated deceleration using $a_{sto p_{s} lw}^{cs}$ . Because no other vehicles were present at the intersection, $ve h_{1002}$ transitioned into dominance motivation and accelerated using $a_{adj_lmt}^{cs}$ . Concurrently, $ve h_{1001}$ approached the intersection and entered the order motivation, starting to decelerate with $a_{stop_slw}^{cs}$ . At $t = 28.98 s$ , $ve h_{1002}$ exited the intersection onto Road 5165, allowing $ve h_{1001}$ to transition into dominance motivation, increase its speed, and enter the intersection. At $t = 28.52 s$ , $ve h_{1000}$ entered the order motivation and began to slow down, reaching a complete stop at $t = 33.08 s$ , awaiting the departure of $ve h_{1001}$ from the intersection, as illustrated in Figure 13 $(a)$ and $(b)$ .

Figure 13.

Top-view screenshots of (a) $ve h_{1000}$ , (b) $ve h_{1001}$ , (c) $ve h_{1002}$ , and (d) $ve h_{1003}$ . $ve h_{1000}$ was waiting the departure of $ve h_{1001}$ from within the intersection.

Subsequently, at $t = 33.1 s$ , $ve h_{1000}$ commenced accelerating and entered the intersection. After a delay of approximately $2.5 s$ , $ve h_{1003}$ arrived at the intersection. It entered the order motivation and initiated deceleration using $a_{stop_slw}^{cs}$ , whereas $ve h_{1000}$ proceeded straight through the intersection.

By $t = 39.52 s$ , $ve h_{1000}$ transitioned to dominance motivation, accelerated using $a_{adj_lmt}^{cs}$ , and turned right onto Road 5307. At this point, the remaining vehicles entered an unmotivated state and continued their motion using $a_{mnt_unch}^{cn}$ .

From top to bottom, the five panels in Figure 14 show the road ID, longitudinal displacement, motivation, action, and longitudinal velocity versus time for the four vehicles, respectively. The blue dash-dot-dotted, orange dashed, green dotted, and red dash-dotted lines represent $ve h_{1000}$ , $ve h_{1001}$ , $ve h_{1002}$ , and $ve h_{1003}$ , respectively.

Figure 14.

Handling unsignalized traffic junctions. From top to bottom, the five panels (a–e) represent road ID, longitudinal displacement, motivation, action, and longitudinal velocity versus time for the four vehicles. The blue dash-dot-dotted, orange dashed, green dotted, and red dash-dotted lines represent $ve h_{1000}$ , $ve h_{1001}$ , $ve h_{1002}$ and $ve h_{1003}$ , respectively.

Weaving in multilane traffic

From top to bottom, the seven panels in Figure 15 show the longitudinal displacement, motivation, action, longitudinal velocity, lane ID, lateral displacement, and lateral velocity versus time for the six vehicles, respectively. The blue dash-dot-dotted, orange dashed, green dotted, red dash-dotted, purple dense dash-dotted, and pink solid lines represent $ve h_{1000}$ , $ve h_{1001}$ , $ve h_{1002}$ , $ve h_{1003}$ , $ve h_{1004}$ , and $ve h_{1005}$ , respectively. On the multi-lane highway (road 1030), vehicles $ve h_{1000}, ve h_{1001}, ve h_{1002}, ve h_{1003},$ and $ve h_{1004}$ started from different lanes every $6 s$ . During this period, they were in the dominance motivation and accelerated states using $a_{adj_lmt}^{cs}$ . Once the vehicles reached a speed of approximately $10 m / s$ , they transitioned to an unmotivated state and continued their motion by executing action $a_{mn t_{unch}}^{cn}$ .

Figure 15.

Weaving in multilane traffic. From top to bottom, the seven panels (a–g) respectively represent the longitudinal displacement, motivation, action, longitudinal velocity, lane ID, lateral displacement, and lateral velocity versus time for six vehicles. The blue dash-dot-dotted, orange dashed, green dotted, red dash-dotted, purple dense dash-dotted, and pink solid lines represent $ve h_{1000}$ , $ve h_{1001}$ , $ve h_{1002}$ , $ve h_{1003}$ , $ve h_{1004}$ , and $ve h_{1005}$ , respectively.

At $t = 30.04 s$ , $ve h_{1005}$ departed from the $- 3$ lane and entered dominance motivation, accelerating through $a_{adj_lmt}^{cs}$ . At $t = 33.52 s$ , $ve h_{1005}$ transitioned to an unmotivated state, maintaining its motion for only $0.18 s$ before noticing $ve h_{1004}$ ahead in the same lane. It then reverted to the dominance motivation and changed lanes from the $- 3$ lane to the $- 2$ lane using the action $a_{obs_\ln}^{cl}$ . After adjusting its speed with $a_{adj_lmt}^{cs}$ , $ve h_{1005}$ entered an unmotivated state and continued its movement.

At $t = 46.4 s$ , $ve h_{1005}$ noticed $ve h_{1003}$ ahead and entered dominance motivation, expecting to alert the other vehicle to increase its speed through action $a_{alert}^{cn}$ . In response, $ve h_{1003}$ entered the relatedness motivation and increased its speed from $9.68 m / s$ to $9.96 m / s$ using $a_{ad j_{l} mt}^{cs}$ . However, the speed increase was not sufficiently significant, prompting $ve h_{1005}$ to decide to change lanes again. Upon detecting that $ve h_{1005}$ intended to switch to its lane, $ve h_{1004}$ entered relatedness motivation and executed the yielding action $a_{plt_dec}^{cs}$ to decrease its speed and maintain a safe distance. At $t = 53.2 s$ , $ve h_{1005}$ successfully changed lanes back to the $- 3$ lane, followed by $ve h_{1004}$ , which resumed its speed to $10.5 m / s$ and returned to the unmotivated state.

At $t = 65.52 s$ , $ve h_{1005}$ caught up with $ve h_{1001}$ ahead and, in dominance motivation, attempted to alert $ve h_{1001}$ to increase its speed through action $a_{alert}^{cn}$ . $ve h_{1001}$ immediately entered relatedness motivation and increased its speed using $a_{adj_lmt}^{cs}$ , although the increase in speed was not significant. $ve h_{1005}$ then utilized the action $a_{obs_\ln}^{cl}$ to change lanes from the $- 3$ lane back to the $- 2$ lane, as illustrated in Figure 16 $(g)$ . At $t = 78.26 s$ , $ve h_{1005}$ caught up with the first departing vehicle $ve h_{1000}$ . Without alerting the preceding vehicle, $ve h_{1005}$ directly changed lanes from the $- 2$ lane back to the $- 3$ lane using action $a_{obs_\ln}^{cl}$ . $ve h_{1001}$ , located behind, noticed the upcoming merge of $ve h_{1005}$ and entered the relatedness motivation, executing the deceleration yielding action $a_{plt_dec}^{cs}$ to maintain distance from $ve h_{1005}$ . At $t = 80.28 s$ , $ve h_{1001}$ entered dominance motivation again and resumed acceleration. By $t = 83.08 s$ , $ve h_{1005}$ completed the lane change, whereas $ve h_{1001}$ returned to a speed of $9.56 m / s$ by $t = 83.42 s$ .

Figure 16.

Top view screenshots of (a) all vehicles, (b) $ve h_{1000}$ , (c) $ve h_{1001}$ , (d) $ve h_{1002}$ , (e) $ve h_{1003}$ , (f) $ve h_{1004}$ , and (g) $ve h_{1005}$ . Green arrow in (a) indicates the direction of all the vehicles’ movement. $ve h_{1005}$ was changing lane from the $- 3$ lane back to the $- 2$ lane.

Conclusion

We summarize and analyze the five fundamental driving needs underlying driver behavior, namely, safety, dominance, achievement, order, and relatedness, based on various psychological needs theories and research on driving psychology. Subsequently, an interpretable autonomous driving decision-making model is proposed. The model consists of three modules: needs assessment, motivation generation, and behavior selection. The needs assessment module continuously monitors the levels of various needs in real-time. The motivation generation module compares the priority of urges sent by the needs assessment module and determines the urge with the highest priority as the motivation. Finally, the action behavior selection module determines the actions that align with motivation. The experiments were conducted using a self-developed 2D simulator based on the Unity software. All vehicles in the simulation were equipped with the same decision-making model, but with different preferences. The results provide intuitive observations of the motives and behaviors of autonomous vehicles and demonstrate their effectiveness in various driving tasks, including single-lane driving, intersections, and multi-vehicle interactions.

However, the existing approach for behavior selection is based on a behavior tree, which is a relatively rigid planning method. Once the structure is established, the method of behavior selection remains unchanged. Future research endeavors may explore alternative methods that offer greater flexibility, introducing a higher level of uncertainty in behavior selection.

Footnotes

Handling Editor: Sharmili Pandian

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Yanfeng Li

References

Sun

Zhan

, et al. Interpretable modelling of driving behaviors in interactive driving scenarios based on cumulative prospect theory. In: 2019 IEEE intelligent transportation systems conference (ITSC), Auckland, New Zealand, 27–30 October 2019, pp.4329–4335. New York: IEEE.

Wang

Gao

Yuan

, et al. Interpretable decision-making for autonomous vehicles at highway on-ramps with latent space reinforcement learning. IEEE Trans Veh Technol 2021; 70: 8707–8719.

Salvucci

. Modeling driver behavior in a cognitive architecture. Hum Factors 2006; 48: 362–380.

Maslow

. A theory of human motivation. Psychol Rev 1943; 50: 370.

Reiss

. Multifaceted nature of intrinsic motivation: the theory of 16 basic desires. Rev Gen Psychol 2004; 8: 179–193.

Schwarting

Alonso-Mora

Rus

. Planning and decision-making for autonomous vehicles. Annu Rev Control Robot Auton Syst 2018; 1: 187–210.

Leonard

How

Teller

, et al. A perception-driven autonomous urban vehicle. J Field Robot 2008; 25: 727–774.

Urmson

Anhalt

Bagnell

, et al. Autonomous driving in urban environments: boss and the urban challenge. J Field Robot 2008; 25: 425–466.

Hubmann

Becker

Althoff

, et al. Decision making for autonomous driving considering interaction and uncertain prediction of surrounding vehicles. In: 2017 IEEE intelligent vehicles symposium (IV), Los Angeles, CA, USA, 11–14 June 2017, pp.1671–1678. New York: IEEE.

10.

Hoel

Driggs-Campbell

Wolff

, et al. Combining planning and deep reinforcement learning in tactical decision making for autonomous driving. IEEE Trans Intell Veh 2019; 5: 294–305.

11.

Deshpande

Vaufreydaz

Spalanzani

. Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network. In: 2020 16th international conference on control, automation, robotics and vision (ICARCV), Shenzhen, China, 13–15 December 2020, pp.428–433. New York: IEEE.

12.

Brechtel

Gindele

Dillmann

. Probabilistic decision-making under uncertainty for autonomous driving using continuous POMDPs. In: 17th international IEEE conference on intelligent transportation systems (ITSC), Qingdao, China, 8–11 October 2014, pp.392–399. New York: IEEE.

13.

Pomerleau

. Alvinn: an autonomous land vehicle in a neural network. Adv Neural Inf Process Syst 1988; 1: 305–313.

14.

Bojarski

Del Testa

Dworakowski

, et al. End to end learning for self-driving cars. arXiv preprint arXiv:1604.07316, 2016.

15.

Bansal

Krizhevsky

Ogale

. Chauffeurnet: learning to drive by imitating the best and synthesizing the worst. arXiv preprint arXiv:1812.03079, 2018.

16.

Chen

Yuan

Tomizuka

. Deep imitation learning for autonomous driving in generic urban scenarios with enhanced safety. In: 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS), Macau, China, 3–8 November 2019, pp.2884–2890. New York: IEEE.

17.

Sadat

Ren

Pokrovsky

, et al. Jointly learnable behavior and trajectory planning for self-driving vehicles. In: 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS), Macau, China, 3–8 November 2019, pp.3949–3956. New York: IEEE.

18.

Singh

Shrestha

Albarella

. Bi-level optimization augmented with conditional variational autoencoder for autonomous driving in dense traffic. In: 2023 IEEE 19th international conference on automation science and engineering (CASE), Auckland, New Zealand, 26–30 August 2023, pp.3949–3956. New York: IEEE.

19.

Wang

Khajepour

Cao

, et al. Ethical decision making in autonomous vehicles: challenges and research progress. IEEE Intell Transp Syst Mag 2020; 14: 6–17.

20.

Claussmann

Revilloud

Glaser

, et al. A study on al-based approaches for high-level decision making in highway autonomous driving. In: 2017 IEEE international conference on systems, man, and cybernetics (SMC), Banff, AB, Canada, 5–8 October 2017, pp.3671–3676. New York: IEEE.

21.

Grigorescu

Trasnea

Cocias

, et al. A survey of deep learning techniques for autonomous driving. J Field Robot 2020; 37: 362–386.

22.

Sun

. The motivational and metacognitive control in CLARION. In: Gray

(ed.) Integrated models of cognitive systems. Oxford: Oxford University Press, 2007, pp.63–75.

23.

Bach

. Principles of synthetic intelligence PSI: an architecture of motivated cognition. Vol. 4. New York, NY: Oxford University Press, 2009.

24.

McDougall

An introduction to social psychology. London: Psychology Press, 2015.

25.

Murray

McAdams

. Explorations in personality. Oxford: Oxford University Press, 2007.

26.

McClelland

. Human motivation. Cambridge: CUP Archive, 1987.

27.

Alderfer

. An empirical test of a new theory of human needs. Organ Behav Hum Perform 1969; 4: 142–175.

28.

Reiss

. Who am I? The 16 basic desires that motivate our behavior and our personality. New York, NY: Penguin Putnam, 2000.

29.

Ryan

Deci

. Self-determination theory and the facilitation of intrinsic motivation, social development, and well-being. Am Psychol 2000; 55: 68.

30.

Taubman-Ben-Ari

. Motivational sources of driving and their associations with reckless driving cognitions and behavior. Eur Rev Appl Psychol 2008; 58: 51–64.

31.

Boer

Hoedemaeker

. Modeling driver behavior with different degrees of automation: a hierarchical decision framework of interacting mental models. In: Proceedings of the 17th European annual conference on human decision making and manual control, 14–16 December 1998, pp.63–72.

32.

Sun

Motivational representations within a computational cognitive architecture. Cognit Comput 2009; 1: 91–103.

33.

Lee

. Fifty years of driving safety research. Hum Factors 2008; 50: 521–528.

34.

Näätänen

Summala

. A model for the role of motivational factors in drivers’ decision-making*. Accid Anal Prev 1974; 6: 243–261.

35.

Summala

. Towards understanding motivational and emotional factors in driver behaviour: comfort through satisficing. In: Cacciabue

(ed.) Modelling driver behaviour in automotive environments: critical issues in driver interactions with intelligent transport systems. London: Springer, 2007, pp.189–207.

36.

Summala

. Traffic psychology theories: towards understanding driving behaviour and safety factors. In: International conference of traffic and transport psychology, Nottingham, England, 2005.

37.

Erdmann

. SUMO’s lane-changing model. In: Behrisch

Weber

(eds) Modeling mobility with open data: 2nd SUMO conference 2014Berlin, Germany, May 15–16, 2014. Cham: Springer International Publishing, 2015, pp.105–123.

38.

Parkes

Franzen

. Driving future vehicles. London: CRC Press, 1993.

39.

Vechione

Balal

Cheu

. Comparisons of mandatory and discretionary lane changing behavior on freeways. Int J Transp Sci Technol 2018; 7: 124–136.

40.

Varet

Granié

Carnis

, et al. The role of perceived legitimacy in understanding traffic rule compliance: a scoping review. Accid Anal Prev 2021; 159: 106299.

41.

Björklund

Åberg

. Driver behaviour in intersections: formal and informal traffic rules. Transp Res Part F Traffic Psychol Behav 2005; 8: 239–253.

42.

Bach

. Modeling motivation in MicroPsi 2. In: Bieger

Goertzel

Potapov

(eds) Artificial general intelligence: 8th international conference, AGI 2015, AGI 2015, Berlin, Germany, July 22–25, 2015, Proceedings 8. Cham: Springer International Publishing, 2015, pp.3–13.

43.

Michon

. A critical view of driver behavior models: what do we know, what should we do? In: Evans

Schwing

(eds) Human behavior and traffic safety. Boston, MA: Springer, 1985, pp.485–524.

44.

. Motion planning for dynamic scenario vehicles in autonomous-driving simulations. IEEE Access 2023; 11: 2035–2047.

45.

Juliani

Berges

Teng

, et al. Unity: a general platform for intelligent agents. arXiv preprint arXiv:1809.02627, 2018.

An interpretable decision-making model for autonomous driving

Abstract

Keywords

Introduction

Related work

Decision-making model

General structure

Needs assessment

Urge for safety

Urge for dominance

Urge for achievement

Urge for order

Urge for relatedness

Motivation generation

Behavior selection

Alternative behaviors and actions

Changing speed

Changing lane

Mandatory lane changing

Discretionary lane changing

Random lane changing

Aborting lane changing

Maintaining unchanged

Behavioral decision for each motivation

Actions for safety motivation

Actions for dominance motivation

Actions for achievement motivation

Actions for order motivation

Actions for relatedness motivation

Unmotivated

Experiments

Completion of assigned driving task

Handling unsignalized traffic junctions

Weaving in multilane traffic

Conclusion

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References