Abstract
Objective
To develop and validate a computational framework that infers individualized attention strategies and latent distraction states to support personalized modeling of multitasking behavior and intervention.
Background
Driver distraction from in-vehicle systems is a growing safety concern. However, the level of distraction is often latent and varies significantly across individuals. Existing models typically overlook these differences, limiting their effective use for personalized interventions.
Method
We introduce a Partially Observable Semi-Markov Decision Process (POSMDP) to model hidden attentional dynamics and attention allocation decisions. Using behavioral data, including glance behavior, velocity, and pupillometry, from a high-fidelity driving simulator with 18 participants, we estimate personalized reward functions that reflect each driver’s subjective valuation of secondary task utility versus safety cost.
Results
The method accurately infers distraction states and recovers participant-specific utility weights governing the trade-off between secondary task benefit and driving cost. Compared to a well-established 2-s glance rule, it improves detection of distraction events and reveals individual variability in attention strategies. Some drivers exhibit highly conservative profiles, while others assign greater value to secondary tasks, even under high distraction. Counterfactual simulations show how perceived task importance could modulate visual attention behavior across individuals.
Conclusion
Our POSMDP-based framework provides an interpretable and individualized account of driver attention allocation, capturing both latent states and behavioral variability.
Application
This model enables the development of personalized, risk-sensitive driver assistance systems that adapt to individual attention strategies, enhancing road safety through context-aware, graded interventions.
Introduction
Despite advances in vehicle automation, human drivers are expected to continue playing a key supervisory role in automated systems for the foreseeable future (Kashevnik et al., 2021). The growing integration of technology in in-vehicle information systems (IVIS) has raised concerns about driver distraction, which is increasingly recognized as a major contributor to vehicle accidents and fatalities (Zhong et al., 2024). According to multiple resource theory (Wickens, 2008), driving and secondary (non-driving related) tasks compete for limited cognitive resources across visual, auditory, manual, and cognitive domains. Distraction increases when both tasks draw from the same resource pool, such as visual input for driving and screen-based IVIS interactions. In Salvucci and Taatgen (2008), the authors conceptualize multitasking as threads of cognitive activity that compete for bottleneck resources (e.g., declarative memory and procedural execution). This theory provides an account of how drivers are able to manage multiple tasks at the expense of some performance degradation.
In this context, understanding the driver’s internal attentional state, whether fully engaged with the driving task or partially distracted, is critical for evaluating its impact on driving performance and safety. We specifically focus on visual–manual distraction arising from interactions with IVIS, rather than auditory or purely cognitive distraction. Studies have shown that drivers differ in how they choose to engage with IVIS. For example, some drivers adopt a cautious approach, while others interact with systems impulsively or without much deliberation (Lerner et al., 2008). Sometimes, drivers may even prioritize secondary tasks during driving, particularly when these tasks (e.g., responding to a work email) seem more engaging or time-sensitive (Lee, 2014). These behavioral differences raise important questions for human factors research: How do drivers assess the cost or risk of engaging in secondary tasks while driving? And do drivers with different decision-making styles exhibit distinct patterns of distraction or task-switching behavior? Quantifying driver decisions can help develop personalized and targeted distraction mitigation strategies, such as optimizing user interface design, tailoring public education and driver training programs, and determining appropriate criteria for warnings (Lerner et al., 2008).
Answering these questions requires accurately measuring distraction; however, this is inherently difficult due to the latent and context-dependent nature of attentional states during driving. For instance, not all off-road glances are distracting, mirror checks or speedometer glances are necessary for safe driving and should not be misclassified as distraction (Schmitt et al., 2016). When multitasking, drivers often fail to recognize their attentional state due to reduced self-monitoring, hindering both real-time adjustments and learning about their multitasking limits (Sanbonmatsu et al., 2016; Starkey & Charlton, 2020; Watson & Strayer, 2010). These observations highlight that drivers may not be fully aware of their distraction state. Instead, drivers rely on their own beliefs about their attentional state, inferred from indirect cues such as changes in driving performance, to guide their decisions regarding engagement in secondary tasks.
In this paper we develop a Bayesian model of subjective perception of hidden/latent (or partially observable) attentional states that evolve according to a controlled Semi-Markov process. The proposed model captures how drivers allocate attention and manage multitasking under varying levels of distraction. Using noisy behavioral signals, including glance behavior, vehicle dynamics, and pupillometry, we develop a two-stage structural estimation procedure to infer latent distraction levels and driver-specific decision policies, and to estimate how perceived rewards associated with secondary task engagement evolve as distraction increases. To test the model, we designed a driving simulator-based experiment involving human participants to observe interactions with IVIS under controlled task conditions. With the experiment data, we verify that the proposed model (i) provides a quantitative account of how latent attentional states evolve and (ii) exhibits improved detection performance when compared with the widely accepted baseline. More importantly, the proposed model helps to understand individual differences by estimating driver-specific utility weights that reflect how each participant weighs the benefits of secondary task engagement against the cost to driving safety. Finally, to confirm that our findings from participant-level estimation are robust, we also estimated a Hierarchical Bayesian version of the model. This analysis corroborates the main results, confirming the same systematic trade-offs and behavioral patterns at the population level. These utility profiles enable simulation of behavioral sensitivity, demonstrating that increases in perceived secondary task importance lead to prolonged off-road glances. Given that individual differences in multitasking strategies are well-established (Jokinen et al., 2021) and that driver assistance technologies increasingly emphasize personalization (Yi et al., 2019), our approach offers a framework for capturing individualized patterns of attention allocation and informing targeted, graded interventions based on driver profiles.
This paper is structured as follows. We begin by reviewing related computational models of driver distraction. Next, we introduce the architecture and computational principles of our proposed model. We then detail the experimental design and data collection procedures used to empirically validate the model with human participants in a simulated driving environment. Finally, we demonstrate the model’s capability to assess behavioral sensitivity through simulations exploring how drivers adjust attention allocation in response to changing task demands and perceived task importance.
Background and Related Work
Driver Distraction
The Task-Capability-Interface (TCI) model (Fuller, 2005) and Intelligent Driver Model (IDM) (Treiber et al., 2000) are widely used in driver distraction research. TCI examines the dynamics between the driving demands and driver capabilities, while IDM models the effects of distractions on car-following and traffic flow. These models are often combined to study driver distraction (Hoogendoorn, Van Arem, & Hoogendoorn, 2013; Saifuzzaman et al., 2015). Another school of driver models use task-independent cognitive architectures to model human perception, cognition, and motor responses. ACT-R-based models, such as Distract-R (Salvucci, 2005), predict distraction effects, while the Queueing Network–Model Human Processor (QN-MHP) quantifies cognitive workload and visual attention allocation (Feng et al., 2017). These models assess distraction impacts on perceptual-motor (Dario D. Salvucci, 2001) and cognitive tasks (Salvucci, 2019) but do not provide a structural account of how attentional states vary over time.
Several data-driven approaches based on machine learning (ML) have been employed to predict driver visual-manual distraction. Common algorithms include supervised learning models, such as support vector machines (Ersal et al., 2010; Liang et al., 2007; Li et al., 2017), decision trees (McDonald et al., 2020; Zhang et al., 2004), hidden Markov models (Kanaan et al., 2019), Bayesian networks (Liang & Lee, 2014), and deep-learning architectures (Huang et al., 2021; Masood et al., 2020; Ou & Karray, 2019; Tran et al., 2018; Wollmer et al., 2011; Xing et al., 2019). While effective, they do not formally model decision-making processes in driving scenarios where uncertainty arises from limited attention. This is a crucial consideration, especially when multiple tasks vie for the same cognitive resources. In what follows, we discuss the frameworks to study such decision-making processes.
Partially Observable Markov Decision Processes
The Partially Observable Markov Decision Process (POMDP) framework (e.g., Krishnamurthy, 2016) models a forward-looking utility-maximizing agent operating in an environment with unobservable states, where the evolution of these states follows a hidden Markov model that is determined (or controlled) by the agent’s actions. Because the state is not directly observable, the agent maintains a belief, a probability distribution over hidden state space, that serves as a sufficient statistic for decision-making under uncertainty. With observed data in the form of sequences of observations and actions, the structure of the POMDP (i.e., reward function, state transition probabilities, and observation probabilities) can be estimated (Chang et al., 2023). POMDPs have been applied to infer driver perception during lane-keeping with secondary tasks (Schmitt et al., 2016), model multitasking behavior in hierarchical reinforcement learning frameworks (Jokinen et al., 2021), and support AV-related challenges such as driver attentiveness during perception hand-offs (Pant et al., 2022), motion planning, and intent prediction (Sadigh et al., 2016; Li, Zhao, & Wang, 2022; Danesh et al., 2023). More recently, active inference formulations of POMDPs have been introduced to model adaptive human driving and automated vehicle takeovers, demonstrating how goal-directed and uncertainty-resolving behaviors emerge from the principle of expected free energy minimization (Engström et al., 2024; Wei et al., 2025). While a POMDP model accounts for unobserved states that affect decision making, it only allows for memoryless dynamics: that is, the distribution of the sojourn time (the number of consecutive time steps spent in a state before transitioning to another) is independent of the time spent in that state. However, attentional state dynamics in a driving task are not memoryless. For instance, driver inattention persists for varying durations influenced by secondary task demands and external stimuli (Arvin & Khattak, 2020). In this paper, we use a semi-Markov model to realistically represent driver behavior, where semi-Markov state dynamics allows general state sojourn time distributions. The semi-Markov model has been used to describe latent human psychological state dynamics to model mental fatigue (Wang et al., 2022), to describe driver response time to alerts (Hwang et al., 2020) or waiting periods when interacting with pedestrians (Zhang & Fricker, 2020). Given these advantages, we incorporate semi-Markov unobserved state dynamics into POMDP framework, resulting in a so-called Partially Observable Semi-Markov Decision Process (POSMDP) to better capture the temporal variability of driver attentional states and decision making in dual-task scenarios.
A Model of Distraction and Attention Dynamics
Standing Assumptions
To capture key behavioral regimes, we simplify the hidden driver state of attention to low distraction (
The action
We make three standing assumptions (formal definitions provided in Appendix A):
(Conditional Independence): Given the current state, duration, and action, the next attentional state, duration, and observations are conditionally independent.
(Sojourn time distribution): The time a driver remains in a given attentional state follows a binomial distribution, capturing discrete durations consistent with our experimental sampling.
(Observation probabilities): Observable variables (velocity and PCPS) are assumed to follow a state-dependent multivariate normal distribution. For tractability, we assume independence and set the covariance terms to zero because we examined the empirical correlation between velocity and PCPS using the Pearson correlation coefficient and found a statistically significant but weak negative correlation (
Thus, the intertemporal transition of both the attentional state and its associated duration is governed by transition probabilities, expressed as
A two-state model (low distraction vs. high distraction) was adopted for both theoretical and empirical reasons. Conceptually, the binary representation aligns with how visual–manual distraction is characterized in the human factors literature, as transitions between attentive and distracted modes of control (He et al., 2023; Jokinen et al., 2021). Experimentally, the secondary task (map search) was designed to induce distinct episodes of distraction rather than graded engagement levels, making a two-state formulation consistent with the task structure. Moreover, preliminary analyses with a three-state variant produced an additional state reflecting minor fluctuations in driving behavior, but this state was not stable or interpretable across participants. For parsimony and interpretability, we therefore retained the two-state model.
Model of Distracted Driver: Bayesian Inference
To model how a driver interprets incoming information and updates their understanding of their attentional state, we adopt a Bayesian framework. In this view, the driver is treated as a Bayesian agent who maintains an internal belief about whether they are focused or distracted, based on the history of behavior and observations. Let
Intuitively, equation (1) describes how the driver integrates past information with the most recent observation to update their internal belief of attentional focus. For instance, after a brief off-road glance (
Model of Distracted Driver: Attention Reward vs Cost Tradeoffs
After updating their belief about the current attentional state, the driver selects an action based on that belief, deciding how much attention to allocate to the primary driving task versus the concurrent secondary task. When a secondary task appears (e.g., visual search prompt on the in-vehicle system), the driver must divide attention between the primary driving task (e.g., maintaining a safe following distance) and the secondary task. Given that visual attention has a limited capacity (Reimer & Schubert, 2020), the driver is modeled as a rational agent aiming to maximize expected reward by balancing the potential benefits of completing the secondary task against the cost and risks of compromised driving performance (see Jokinen et al., 2021). We thus define the one-step reward
Model of Distracted Driver: Attention Allocation Policies
Based on the driver’s preferences captured by the reward function
To summarize, the driver chooses an action based on the decision policy in equation (5), which maximizes the expected total discounted reward for allocating visual attention resources given the competing demands of driving safely and managing other tasks or information sources.
Estimation
We estimated the model parameters using maximum likelihood estimation (MLE). Specifically, the goal was to determine the parameter sets governing the dynamics of driver distraction,
The detailed likelihood formulation is provided in Appendix B. The estimation is performed in two stages. In the first stage, we estimated the parameters
Experiment Design
Experimental Apparatus and Environment
Driving Simulator
The study utilized a high-fidelity driving simulator (Figure 1) by Realtime Technologies, Inc. The simulator featured a 300° field of view, a four camera video capturing system and driving behavior data was captured at a sampling rate of 60 Hz. SimCreator DX software was employed to design the scenarios. To simulate interactions with IVIS, participants were required to perform non-driving related tasks on an iPad. Specifically, the Google Maps application was used to replicate typical IVIS interactions and potential driver distractions. The experimental apparatus and environment.
Eye Tracking System
To collect driver pupil data, the study employed a Pupil-core eye tracking system manufactured by Pupil Labs, Germany (Figure 1). This system comprised three key components: a world camera and two eye cameras. The eye cameras utilized 3-D models to detect and track pupil movements with high precision. Calibration of the pupil tracking was achieved using Apriltag markers (Figure 2). These markers were positioned on the central driving screen, enabling the system to distinguish between on-path and off-path glances. Experimental setup showcasing construction zone and eye-tracking markers.
Environment Design
The experimental scenario featured a four-lane highway, with two lanes allocated for each direction of travel. The total simulated driving distance spanned approximately 15 miles and approximately 12 min to complete. Ambient vehicles were added in the inner lane to simulate realistic traffic conditions. Traffic density was set to represent stable flow conditions, where drivers are influenced by the presence of other vehicles, but desired speeds remain unaffected.
To enhance engagement and realism, the route incorporated a construction zone setting for 15% of the route (Figure 2). The route also included two curved sections to reflect typical highway geometry.
Primary Driving Task
The study’s primary task was designed to simulate realistic highway driving conditions, focusing on vehicle following behavior. Participants were instructed to follow a lead vehicle traveling at 70 mph in the outer lane, while maintaining a safe relative distance throughout the experiment (Figure 1). An auditory alert system ensured adherence to a 1.75 s headway threshold (
Secondary Task: Simulated In-Vehicle System Interaction
To simulate in-vehicle distractions, participants performed intermittent map search tasks requiring extended glances away from the road. These tasks, designed to require high-attention and be cluttered (Pankok & Kaber, 2018), involved both listening to routine navigation questions and interacting with the system to find answers (Figure 3). Questions included: (1) “Can you find the gas station with the lowest gas price?” (2) “Can you find the CVS pharmacy that is open now?” (3) “Can you find the closest gas station?” (defined as the gas station with the shortest time deviation from the current route, shown in minutes in Figure 3(b)). Secondary task interface for map search tasks.

The procedure involved three steps (see Figure 4): first, participants clicked on the search icon to access options (Step 1). Options include categories such as gas stations, pharmacies, and restaurants, similar to those in common navigation apps. Next, they selected the relevant category, either “gas station” for questions 1 and 3 or “pharmacy” for question 2 (Step 2). Finally, they clicked on the correct choice from the available options to complete the task (Step 3). The IVIS task was limited to the three questions listed above. These same three questions were used in both practice and experimental drives, with randomized order within each drive. IVIS content was updated dynamically so the correct answer for each question varied across drives and across repeated presentations of the same question. Secondary tasks were presented randomly, with a minimum interval of 60 s between tasks. Studies have shown that the effects of distraction can persist for 10–40 s after the distracting task ends (Bowden et al., 2019; Strayer et al., 2016). Our 60-s interval was chosen conservatively to ensure adequate recovery time across various individual differences and distraction types. To introduce variability in when tasks appeared, we added a random delay (0–5 s) to the 60-s base interval. Secondary task operations: (1) Participants click on the search icon, (2) they select the relevant category, either “gas station” for questions 1, 3 or “pharmacy” for question 2, and (3) they click on the correct choice from the options provided to complete the task. Step 3 varies based on the specific question being asked, as illustrated in Figure 3.
Based on trials of our secondary task setup, we found the average task completion time was 12 s. To standardize the experiment, we set a 15-s time limit per task, measured from the question onset. An auditory notification marked the end of the task, prompting participants to disengage from the IVIS and refocus on driving.
Procedure
The experiment began with participants completing an informed consent form. A 2-min baseline pupil size measurement was collected using an eye tracker. Participants then trained on the simulator and completed a trial run of the secondary task. The main experiment included four 12-min driving sessions, with 5–10-min breaks to reduce fatigue. Participants were instructed to treat the scenario as real-world driving, with no prioritization guidance for driving or secondary tasks. After the sessions, a final 2-min baseline measurement was taken, followed by a questionnaire collecting demographics. The average of the two baselines was used to calculate percent changes in pupil size during driving tasks (Zahabi et al., 2023).
Participants
We collected repeated driving sessions from each participant, providing the detailed participant-specific trajectories needed for our computational model and allowing us to recover participant-specific behavioral profiles. This focus aligns with recent POMDP computational modeling frameworks that rely on detailed individual trajectories and modest sample sizes (Jokinen et al., 2021; Schmitt et al., 2016). We also conducted an a priori power analysis in G*Power software (Faul et al., 2007) to determine the required sample size, using
Estimation Results
Data Pre-Processing
Each driving session lasted 12 min and was segmented into three trajectories, using road curves as natural boundaries. Data was not collected during curves to avoid curvature-induced driving variations (Rosas-López et al., 2021; Xu et al., 2022). Data from all sources were synchronized using timestamps and smoothed with a 1-s rolling average. Eye-tracking data with confidence below 95% were filtered out, and trajectories with over 20% missing or corrupted data were excluded from analysis. For model inference, a trajectory corresponded to one straight road segment and consisted of a continuous sequence of 1-s decision epochs, where each epoch contained the driver’s observed variables and actions. On average, for each participant, each trajectory consists of around 250 discrete decision epochs.
We examined participants’ driving and secondary-task performance to provide behavioral context for the subsequent modeling analyses. During baseline (non-IVIS) driving, participants maintained stable car-following performance, with lane offset exceeding the 0.9 m threshold in approximately 5% of samples and time headway exceeding the 1.75 s threshold in 3% of samples on average. No near-crash events were observed, indicating safe and consistent vehicle control. When engaged in the IVIS task, performance degraded moderately, lane-offset exceedances increased to about 16% on average (range: 1–41%), reflecting individual variability in lateral control during distraction. Nevertheless, participants achieved an average IVIS task success rate of 82%, suggesting that they could generally manage both tasks, albeit with measurable costs to vehicle control.
To verify that the observed heterogeneity in our estimates reflects true individual differences rather than estimation noise, we employed two complementary estimation approaches. First, we estimated participant-specific parameters by fitting the POSMDP model independently to each driver’s data. Second, to assess the robustness of these patterns, we implemented a hierarchical Bayesian extension that pools information across drivers to jointly estimate group-level and participant-level parameters (see Appendix E for details). In the sections that follow, we present the results from both approaches side-by-side to demonstrate that the core behavioral structure persists even after hierarchical shrinkage.
Attention Dynamics
In general, partially observable Semi-Markov Decision Processes (POSMDPs) allow the process to transition from a hidden state back to itself at specific transition epochs, that is, when Summary of estimated state transition probabilities across participants.
On-Road Glances and Brief Saccades (
)
Overall, the transitions associated with
Short Off-Road Glances (
)
The transitions associated with
Long Off-Road Glances (
)
The transitions associated with
Observation Distributions
Figure 6(a) visualizes the difference in the estimated mean velocity between the low-distraction and high-distraction states. The mean velocity for Estimated mean velocity for each participant under the low- and high-distraction latent states.
Figure 7(a) illustrates the difference in estimated mean PCPS between the high-distraction and low-distraction states. The low-distraction state ( Estimated mean PCPS for each participant under the low- and high-distraction latent states.
Sojourn Time Distributions
The sojourn time estimates are visualized in Figure 8(a). For the low-distraction state, we have Estimated sojourn time parameters (
Perceived Utility of Attention Allocation
In our framework, drivers are assumed to make attention allocation decisions based on perceived utilities and costs associated with various actions. As formalized earlier (equation (3)), the reward function denotes the trade-off between the benefit
Subjective Ratings of Cost and Benefit for Different Off-Road Glance Durations.
For attention-sensitive tasks, such as reading a long message, entering navigation information, or interpreting a complex visual display, the benefit of sustained attention is high (Strayer et al., 2019), making the relative benefit structure,
Figure 9(a) summarizes the central empirical finding of drivers’ attention strategies. Drivers perceive greater utility in off-road attention when distracted ( State-dependent weights assigned to off-road glances for each participant.
Model Evaluation
Unlike machine learning models that focus on prediction using high-dimensional data, our structural model explains the latent evolution of attentional states, prioritizing interpretability and enabling counterfactual reasoning—features typically lacking in black-box approaches. Given these fundamental differences in goals and design, we do not compare against ML approaches, as, to our knowledge, no existing models offer the same explanatory framework. Instead, we adopt a widely accepted behavioral benchmark, off-road glances exceeding 2 s (Kircher & Ahlström, 2017; Klauer et al., 2006) as a natural and interpretable baseline to evaluate our model.
Comparison of Distraction Detection Performance Between the POSMDP Model and the 2-S Off-Road Glance Baseline, Evaluated on Held-Out Test Data.
Note. Detection rates are reported for secondary task engagements and lane deviations. “Secondary task engagements” denote IVIS interaction periods, and “lane deviations” indicate cases where IVIS interaction coincided with lateral offsets exceeding 0.9–1.0 m. The POSMDP model consistently outperforms the baseline across both event types.
In addition to detection accuracy, we also estimated the lead time, that is, how much earlier the POSMDP model identified distraction events compared to the baseline 2 s rule (for example, see Figure 10). On average, across all test data, our model detected distraction 1.30 s earlier than the baseline. This advance prediction window is critical for enabling timely interventions and mitigating potential risks. While a shorter baseline threshold (e.g., 1.5 s) could detect distraction earlier, such heuristic rules are fixed. In contrast, our model learns a data-driven representation that adapts to the driver’s behavioral context. Illustrative example of lead time estimation.
Counterfactual Simulations for Task Prioritization and Risk Sensitivity
Drivers vary in risk sensitivity, attention allocation strategies, and multitasking capacity, highlighting the need for personalized distraction interventions (Yi et al., 2019). Figure 11 simulates how a driver’s effort allocation would change in response to the increased perceived importance of a secondary task Predicted effect of increasing driver’s perceived importance of secondary task (w).
As the weight increases toward the midrange
At high values of
Additionally, counterfactual simulations can be used to explore how drivers may adjust their attention in response to specific IVIS tasks with varying visual demands. As noted earlier, tasks like reading messages or entering navigation require sustained attention and high The x-axis represents 
The surface increases monotonically along both the
Irrespective of the underlying cause, a driver profile characterized by high weight for secondary task benefit is concerning. Prolonged off-road glances are linked to double the risk of safety-critical events, increased lane deviation, slower braking responses, and reduced situational awareness (Dingus et al., 1989; Klauer et al., 2006; Liang et al., 2012; Victor et al., 2015). Existing research has established mechanisms such as graded warning systems, intervention strategies that escalate in intensity based on the inferred severity of distraction or safety risk (Lee et al., 2004). Our framework lends itself naturally to the development of graded warning systems. For individuals with low priority for secondary task engagement, the system could deliver subtle cues such as a visual prompt or a brief audio tone. Conversely, for drivers with a high
Discussions
Contributions and Relation to Existing Computational Models of Distraction
The goal of this paper was to develop a computational account of how drivers allocate attention between the roadway and secondary IVIS tasks when their level of distraction is uncertain. This perspective differs fundamentally from prior computational models of driver distraction, which have largely focused on cognitive architectures and queuing network formulations of workload. The Distract-R system and the related general executive models (Salvucci, 2005, 2013) adopt a generative, production-rule approach in which a normative “average” agent interleaves tasks based on a goal queue. Although this approach formalizes how task constraints influence multitasking, it relies on simple, fixed rules that fail to explain how adaptive strategies emerge. Similarly, Queuing Network–Model Human Processor (QN-MHP) models distraction and warning responses as a problem of network congestion and information routing (Zhang et al., 2022; Wu & Liu, 2007). In QN-MHP, performance decrements arise when neural entities compete for processing servers, and task urgency is handled via priority classes (derived from lead time thresholds). This approach focuses on capacity limits rather than strategic choices. Consequently, it cannot predict how drivers dynamically adjust their task-interleaving behavior in response to evolving uncertainty.
Our POSMDP framework extends this literature in three key ways. First, we view the driver not merely as a processor subject to network congestion but as a rational agent who optimizes a utility function balancing safety and secondary-task benefit. Second, rather than simulating a normative agent, we estimate the specific reward weights (
Our framework shares conceptual similarities with active inference approaches (Engström et al., 2024; Wei et al., 2025) in its emphasis on inference under uncertainty, but differs in both objective and application. However, the work of Wei et al., 2025 is closest to ours. That study fits an active inference model of decision making using observational data on AV takeovers together with self-reported measures of fatigue, trust, and situation awareness. A key distinction in our experimental design is that we directly manipulate distraction through a controlled secondary task. This manipulation provides an objective proxy for attentional load and thus a form of ground-truth supervision that is not available in purely observational settings, enabling sharper identification and validation of the inferred latent attentional state. In Engström et al., 2024, human driving maneuvers are modeled within an active inference framework as uncertainty-reducing actions, highlighting the role of epistemic objectives in shaping control behavior. The work does not address the task of estimating the parameters of such model. The estimation of an active inference model for car-following task is studied in Wei, Garcia, et al., 2025.
Finally, our model enables counterfactual simulations of how perceived task importance influences glance behavior. The findings have direct implications for the design of intelligent vehicle systems aimed at mitigating distraction. By inferring latent states and individual attention strategies, the framework supports adaptive, human-centered driver assistance systems that deliver graded, context-aware warnings and dynamically manage visually demanding IVIS interactions when distraction risk is high. Individualized utility profiles further allow tailoring interface complexity, feedback, and automation to driver-specific tendencies, improving both the timeliness and acceptance of interventions. Together, these contributions establish an interpretable and personalized modeling framework for understanding driver attention under multitasking.
Limitations and Future Work
We make simplifying assumptions to ensure model tractability, notably by assuming independence between observations (velocity and PCPS) by setting covariance terms to zero. A more sophisticated extension could exploit these interdependencies. Additionally, the current setup relies on velocity and PCPS as indicators of distraction and workload based on the available experimental design. While our model can be extended to incorporate other driving performance measures, those indicators are not significant in our current experimental setup. For example, reaction time is typically assessed using sudden-onset stimuli (e.g., braking lead vehicle) (Strayer et al., 2013). However, our experiment does not have such critical events. Future study could incorporate additional observations to provide a more comprehensive view of driver state. Future work could extend the POSMDP model to capture auditory and/or cognitive distraction by modifying the observation model. All participants in the present study were young male drivers. Although gender was not restricted during recruitment, only male volunteers enrolled and completed the study. It would be valuable and interesting for future research to explore how attention allocation strategies and multitasking behaviors may differ across diverse demographic groups, including various ages, genders, and cultural backgrounds, thus broadening the understanding of driver distraction. Environmental variation, such as differing traffic conditions or road types, could also be integrated to better understand attentional resource allocation in more realistic driving contexts.
An interesting avenue for further research is to infer a larger number of latent driver states, such as fatigue or drowsiness, offering deeper insights into driver behavior. Pursuing these extensions would require experimental protocols tailored to reliably evoke and distinguish among these additional states.
Conclusions
We present a computational framework for modeling and estimating driver attention and multitasking behavior under distraction. By integrating semi-Markov dynamics into a decision-theoretic framework, our model accounts for the temporal variability of attentional states, overcoming the limitations in standard POMDPs. Our framework not only infers the hidden state of the driver but also quantifies the perceived costs and benefits of secondary task completion that evolve with the state of the driver. The results highlight our ability to capture individual differences in multitasking strategies, and structured inference enables the design of context-sensitive, graded interventions that enhance driver safety.
Key Points
• We introduce a personalized driver distraction model using a Partially Observable Semi-Markov Decision Process (POSMDP). • Model estimation recovers individual utility weights that capture how drivers trade off secondary task benefits against driving safety. • POSMDP model outperforms traditional glance-duration thresholds in detecting distraction events and lane deviations. • Our structural model enables counterfactual simulations to predict attention behavior under varying task demands and driver profiles.
Supplemental Material
Supplemental Material - Inferring Hidden Attentional States in Driving: A Bayesian Approach to Modeling Distraction and Secondary Task Engagement
Supplemental Material for Inferring Hidden Attentional States in Driving: A Bayesian Approach to Modeling Distraction and Secondary Task Engagement by Lekhapriya Dheeraj Kashyap, Zhide Wang, Yanling Chang, Maryam Zahabi, and Alfredo Garcia in Human Factors.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was partially supported by National Science Foundation awards #2048395, #2236477, and Army Research Office (ARO) under grant W911NF-22-1-0213.
Data Availability Statement
Supplemental Material
Supplemental material for this article is available online.
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
