Sage Journals: Discover world-class research

Abstract

This paper presents a real-time driver characterisation algorithm that models the driver based on 10 parameters governing their control of vehicle longitudinal speed and acceleration. It utilises an open-loop longitudinal model in conjunction with an Unscented Kalman Filter (UKF). The algorithm can operate in real-time using velocity and acceleration measurements, in addition to a-priori knowledge of the path curvature and road features such as speed limit locations. The UKF also enables a system identification process to estimate average response of each driver. These identified parameters are subsequently fitted to independent measures of fuel economy and safety to demonstrate possible uses for the characterisations. In addition to the comprehensive characterisation of the drivers’ behaviour the work is also novel in its unique approach to identifying time-based parameters within a time-varying dynamic system (UKF). Possible applications include insurance black box assessments of driver aggressiveness and safety, and in autonomous driving algorithms by providing a model to closely replicate the specific driving styles of various users.

Keywords

Driving modelling system identification driver characterisation parameter estimation longitudinal speed control

Introduction

Driver behaviour is challenging to model as it is a function of many factors. These include mood, cognitive load, physical health, alertness level, aggressiveness and relative safety as factors specific to the driver, as well as external factors such as weather and traffic conditions. Some of these factors are constant or slowly varying, whereas others vary within each journey.

MacAdam’s significant work in modelling and understanding the human driver placed the greatest importance of the driving control task on the visual and vestibular sensory channels.¹ The former accounts for over 90% of the control task, while in some scenarios, such as during cornering or in conditions of heavy crosswind, the vestibular channel is dominant. This is also supported by other studies,^2,3 and is consistent with the general consensus of a requirement for driver preview and an internal model of the vehicle’s response, as explored in MacAdam,⁴ Ungoren and Peng⁵ and Zhang et al.⁶

The task of driving requires simultaneous lateral and longitudinal control actions; multiple algorithms have been presented for lateral control,^5,7–9 but relatively few for longitudinal control. Most of those lateral control works concentrate on accurate path-following or modelling near-limit race driving, whereas the average driver on the road neither uses the limit nor aims to accurately follow path centre line. Most papers considering longitudinal control on the other hand tends to focus on car following and motorway driving, which is quite different to the natural speed control behaviour of people on country roads in the absence of traffic.

In Kiencke et al.,⁹ the authors split the model into several modes – straight line driving, cornering and the deceleration and acceleration events pertaining to them. While they go into great detail of modelling the discrete events that make up the task of longitudinal driving, they do not provide a means of characterising drivers. With regards to driver speed choice, there are multiple studies that have shown drivers rely on their vestibular system to limit lateral acceleration levels in corners of significant curvature, and that this relationship is nonlinear.¹⁰ Another study¹¹ found that drivers tend to slow down earlier for corners of higher curvature. This paper will present a new approach to modelling these events and also go on to demonstrate how the parameters governing this model can be correlated to over-arching (global) characteristics of the driver.

There are a number of factors that can be used to predict driver behaviour that have been identified (e.g. in Weir and Chao¹²), however, only attitude/personality and experience are considered to be ‘constant’ in the long-term, as opposed to driver state, task demand and situational awareness, which can change even within the same journey. Furthermore, multiple studies,^12,13 show that there are inter and intra-personal differences between drivers and their behaviour.

In terms of driver characterisation, there are two main areas of focus – incident prevention and emissions reduction. Both of these are heavily related to identifying aggressive driver behaviour, and multiple approaches (along with their combinations) have been proposed; anomaly detection, acceleration threshold and machine learning methods.

Identifying driver behaviour is typically done by counting the number of ‘aggressive’ events per drive or per unit distance.¹⁴ Event detection can be achieved through various statistical (Partial Least Squares Regression, Gaussian Mixture Model or K Nearest Neighbors¹⁵) or machine learning methods^16,17 that identify patterns in longitudinal acceleration or find outliers that can be classified as anomalous. Threshold-based methods, on the other hand, make use of fixed or variable thresholds for acceleration to detect to count aggressive driving events – for example harsh braking, aggressive turns. However, these are very dependent on road type, traffic conditions, traffic flow and weather.

Aggressive driving is by definition at higher speed and accelerations, which also require more energy and, hence, fuel. Consequently in Edwards,¹⁸ the author uses a specific dynamics metric in an attempt to quantify driver aggressiveness. The author highlights the differences between real-world driving emissions and drive-cycle test emissions that stem from a variety of factors as a need for the development and consequent matching of ‘aggressiveness’ of those drives and cycles, respectively.

The algorithm given here is based on a naturalistic driving study, designed to characterise drivers in terms of their speed control in the specific environment of driving on single carriageway, winding country roads in the absence of other traffic. The focus is deliberately narrow; this environment is unique in exposing differences in driving style that are entirely driver-dependent. The purpose is to capture those aspects of vehicle speed control that best characterise differences such as driver confidence, aggression (as expressed by energy wastefulness) and attitude to safety (factors that are difficult to determine in other more constrained environments such as driving in traffic or on clear highways). The method provides continuous identification of the driver, so there are numerous benefits: To infer over-arching performance, for example in terms of driver safety (for insurance purposes), energy efficiency and to determine significant changes of behaviour over time. It also allows for future autonomous systems to better replicate speed control behaviour that the driver finds agreeable.

The paper considers collection of test data and the formulation of a model for forward speed and acceleration in Section 1. Section 2 provides a Kalman filter structure for real-time evolution of those model parameters. Section 3 then considers the use of the same filter for system identification, by repeated iteration over the recorded data, to provide an average characterisation of each driver over an hour of driving. Finally, Section 4 illustrates the capacity of the collected parameters to estimate two examples of over-arching driver performance using a formal regression technique.

Test data

Twenty-two drivers took part in two drives on separate days, with each session lasting around 90 min, in a Ford Focus RS with manual gearbox. A wide range of driver demographic was captured, with ages ranging from 20 to 85; the drivers are ordered by age in Appendix 1. The route concentrated on B class country roads around the village of Bagworth in Leicestershire, UK. These are relatively narrow bidirectional single carriageway roads of between 5 and 6.5 m overall width, with the majority (about 75%) de-restricted to the national speed limit of 60 mph. Significantly the road visibility and frequency of corners prevents drivers from driving at such a high speed, so we observe a range of maximum speeds and cornering acceleration behaviours. Roads through villages had speed limits of 30 or 40 mph. The route has a diversity of road features in corners of varying curvature, straights of different length in addition to narrower and wider sections. A complete map is shown in Figure 1.

Figure 1.

Complete map of routes traversed.

The drivers were asked to drive freely, as they would if they were alone, with the data logger seated in the back and communication restricted to the provision of directions when necessary. Recording was restricted to free driving with no car following, though this information was purposefully kept from the driver to avoid influence on their behaviour. Position, velocity and acceleration data were collected using an OXTS 3200 INS, and steer angle was collected from the vehicle CAN.

The longitudinal model

A time-varying (situation-dependent) open-loop particle model is adopted to represent speed and acceleration events that the driver performs while driving on straight sections, through corners of varying severity, and under the influence of speed limits. The aim is for the model to capture the significant timings and magnitudes of acceleration events, along with straight-line speed, thereby identifying the driving style in a set of well conditioned parameters that can be adapted using a real-time Unscented Kalman Filter. To do this it is assumed that the curvature of the path ahead and the positions of speed limits are known to the model a-priori, and also that the instantaneous position of the vehicle on (and its direction along) the path is known continuously. Current GPS mapping algorithms could be used for the latter, or the position on the path could be estimated by combining the longitudinal model of this paper with the lateral model of Best.¹⁹

For clarity in the discussion below, the model is split into separate accelerations A₁– A₄ based on situational context, and the parameters that will be adapted by the UKF are printed in bold.

Cornering, A₁

The main behaviour that is both characteristic and necessary of all drivers is deceleration on the approach to a corner, combined with acceleration out of it. All but the most novice drivers have developed an internal model of the vehicle response, which combines with their personal driving experience and style to instinctively target a speed through the corner, based on their perception of corner severity, that results in a level of lateral acceleration that they are comfortable with.

The observation in Best²⁰ is that most drivers target a consistent lateral acceleration, usually in the range of 2–4 m/s². This makes for a good means of parametrising driving style. However, the magnitudes and timings of the deceleration and acceleration manoeuvres before and after the corner apex also give insight into the urgency and style of driving.

Figure 2 illustrates raw longitudinal acceleration in grey, along with the chosen model in black. Ignoring the red section of the model for now, we can see that the structure of deceleration and acceleration either side of the corner apex at t = 0 s is approximated by a sequence of three specifically timed hyperbolic tan functions. Perfect replication of the deceleration/acceleration sequence is impractical – for example although the driver instigates a distinct deceleration at −5.5 s they decrease the deceleration magnitude in the lead-in to the corner; we can also see a gear-change in the acceleration phase at around +2.5s. By examination of many instances of driver and corner the selected model captures the core components that the driver controls, that is, the acceleration timing, the magnitude of deceleration needed to achieve a comfortable lateral acceleration in the corner, and the subsequent acceleration event. Interestingly we see drivers perform both stages, including the acceleration event, even when corners rapidly follow on from one another, which strongly suggests an open-loop plan of behaviour.

Figure 2.

Cornering behaviour acceleration of test data versus model.

Equation (1) describes the pair of deceleration/acceleration actions, illustrated in black in Figure 2:

\begin{matrix} A_{1} = \frac{1}{2} (a_{d} + a_{i}) \tanh (η (t_{tc} - t_{d})) + \frac{1}{2} a_{i} + \frac{1}{2} (a_{a} - a_{d}) \\ \tanh (η (t_{tc} - t_{a})) - \frac{1}{2} (a_{a} - a_{f}) \tanh (η (t_{tc} - t_{cr})) \end{matrix}

(1)

Although apparently complicated, this formulation simply quantifies the timing and magnitudes. The parameter $t_{tc}$ is the instantaneous time to the next corner apex as computed from the difference between the vehicle and corner apex positions divided by the current vehicle speed. This doesn’t provide a perfectly accurate time to the corner, as that would vary depending on upcoming braking, but the UKF absorbs this error within the adaptation process. The model transitions from the known initial acceleration a_i, through deceleration a_d and acceleration a_a phases, returning to a final cruising value, a_f that is usually close to zero. The timing is quantified by parameters t_d_,t_a and t_cr (illustrated by the dashed lines in Figure 2) and the transition speed parameter $η$ is fixed at $η = 3$ in order for transitions that take approximately 2 s. The tanh shape provides time periods where the accelerations are modelled as constant and time periods where there is transition. This temporal separation of each parameter’s influence allows the model to have multiple parameters that are also well conditioned when they are adapted by the UKF filter. A novel feature of this approach is that it allows timing parameters to be identified within the time-domain UKF filter, a particularly challenging task.

To give further meaning to the deceleration phase, this is then quantified in terms of the comfortable lateral acceleration magnitude, a_y that the deceleration impulse affords, via

a_{y} = {u_{c}}^{2} c

(2)

u_{c} = \sqrt{\frac{a_{y}}{c}}

(3)

u_{s} - u_{c} = \int_{t_{d}}^{t_{a}} A_{1}

(4)

where $u_{c}$ is the speed that the driver instinctively targets for the upcoming apex of the corner (with peak curvature c), and $u_{s}$ is the driver’s speed immediately prior to the deceleration phase.

Coasting, A₂

When approaching a corner that is visible from some distance, most drivers release the acceleration pedal to let the vehicle coast under engine braking before they initiate the core deceleration phase described above. This behaviour is considered a sub-mode of cornering, illustrated by the initial red section of the model in Figure 2, which is modelled in a similar way:

A_{2} = \frac{1}{2} a_{coast} \tanh (η_{cd} (t_{c} - t_{cd})) + \frac{1}{2} a_{coast}

(5)

Related to $A_{1}$ , the variation of acceleration here is a function of $t_{c}$ , and $a_{b}$ is the acceleration before the onset of coasting, with $t_{cd}$ quantifying the timing of the switch to coasting acceleration.

The parameters $a_{i}$ , $a_{f}$ and $a_{b}$ are obtained from the last known or predicted future value of the model for the respective driving mode – that is the last value of $a_{cd}$ will become $a_{i}$ at the start of cornering.

Straight-line speed, A₃

Although roads have continuously varying curvature, the known path can easily be divided into discretely identified, ‘significant’ corners, with the remainder of the path below a threshold curvature being considered essentially straight ( $c < 0.001$ ). On these sections the driver behaviour is modelled simply as maintaining a constant longitudinal speed which is simulated in the model as fixed:

A_{3} = 0

(6)

u = u_{cr}

(7)

This speed varies for each driver and may also vary within the same journey depending on factors such as cognitive load, mood, weather, etc. It is converted to a deviation, $d_{cr}$ , from the derestricted speed limit $u_{dr}$ ,

u_{cr} = u_{dr} + d_{cr}

(8)

Speed limited behaviour, A₄

Driver behaviour for speed limited sections is separated into two stages – approaching the speed limit and driving within the speed limit. The first stage is modelled in the same way as the coasting behaviour,

A_{4} = \frac{1}{2} (a_{sl} - a_{i}) \tanh (η_{sl} (t_{tsl} - t_{sl})) + \frac{1}{2} (a_{sl} + a_{i})

(9)

which captures the time to the next speed limit $t_{sl}$ and quantifies the deceleration magnitude assuming that the speed limit will be achieved within the forthcoming impulse:

u_{s} - u_{sl} = \int_{t_{sl}}^{0} A_{4}

(10)

In (9), $t_{tsl}$ is the instantaneous time to the next speed limit start location, similar to $t_{tc}$ in (1).

The second stage replicates the straight line driving equations (6)–(7), replacing $u_{cr}$ with $u_{csl}$ – the speed that the driver targets while inside the limit, and to avoid the need for separate parameters for each speed limit (30, 40 mph) the parameter of interest is again taken as the deviation, $d_{sl}$ ,

u_{csl} = u_{sl} + d_{sl}

(11)

Model implementation

The open-loop longitudinal controller is applied through the particle model for the appropriate situation

\overset{\cdot}{u} = A_{i}

(12)

where $A_{i}$ takes the appropriate value based on the driving scenario determined by the current position on the known path. The final set of 10 independent parameters of interest is:

{(\begin{matrix} d_{cr} & a_{y} & a_{a} & t_{d} & t_{a} & t_{cr} & t_{cd} & a_{coast} & d_{sl} & t_{sl} \end{matrix})}^{T}

(13)

The controller logic is provided in full detail with the use of flowcharts in Appendix 2.

Unscented Kalman Filter

Kalman filters are widely used for real-time state estimation of non-linear systems and they can also be applied to identify the parameters of a nonlinear model, for example in Refs.^20–22 The adaptation of model parameters can be achieved either alongside or independently from the estimated states, through the use of either an Extended Kalman Filter (EKF) or an Unscented Kalman Filter (UKF). Actually, Ma et al.¹⁷ concludes that either filter can identify parameters equally effectively; the UKF is preferred over EKF here as it eliminates the need for computing system Jacobians. Of course other techniques (such as Bayesian filtering) could be used to optimise the parameter values, and they are likely to function equally well for system identification; the preference for the Kalman filter here comes from its flexibility to be used both in real-time and for identification, in a computationally efficient way.

The definition of the UKF depends on a nonlinear system model, $f$ , and output model, $h$ , in continuous form. For inputs $i$ , under the influence of parameters $p$ , we can assume there will exist errors $ω$ , due to modelling simplifications in $f$ and errors $υ$ due to the combination of modelling errors and measurement noise in the sensor measurement of the output – $y$ .²³

\overset{\cdot}{x} = f (x, i, p) + ω

(14)

y = h (x, i, p) + υ

(15)

The magnitude of the errors is represented through covariances

Q = E (ω ω^{T})

(16)

R = E (υ υ^{T})

(17)

A step by step process of implementing the UKF has been presented by Best.²⁰ The UKF computes state estimates $\hat{x}$ along with estimates of state error covariance:

P = E ([x - \hat{x}] {[x - \hat{x}]}^{T})

(18)

This is done using a ‘cloud’ of $(2 n + 1)$ sigma points distributed around the $n$ -th order state vector at each instant, $k$ :

χ_{0 k} = {\hat{x}}_{k}

(19)

χ_{ik} = {\hat{x}}_{k} + {\sqrt{(n + κ) P_{k}}}_{i}

(20)

χ_{(i + n) k} = {\hat{x}}_{k} - {\sqrt{(n + κ) P_{k}}}_{i}

(21)

where ${\sqrt{(n + κ) P_{k}}}_{i}$ is the $i$ th column of the matrix square root of $(n + κ) P_{k}$ , obtained using Cholesky decomposition.

Using Euler integration, the sigma points are propagated by:

χ_{i (k + 1)} = T f (χ_{ik}, u_{k}, θ)

(22)

and intermediate estimates for the propagated state and state error covariance matrix are computed by weighted averages as

{\hat{x}}_{k + 1}^{*} = \sum_{i = 0 - 2 n} W_{i} χ_{i (k + 1)}

(23)

P_{k + 1}^{*} = \sum_{i = 0 - 2 n} W_{i} {χ_{i (k + 1)} - {\hat{x}}_{(k + 1)}} {χ_{i (k + 1)} - {\hat{x}}_{(k + 1)}}^{T} + T Q

(24)

with $W_{0} = κ / (n + κ)$ and for all other $i$ , $W_{i} = 1 / 2 (n + κ)$

Weighted estimates of the outputs are then obtained from the output model:

Y_{i (k + 1)} = h (χ_{ik}, u_{k}, θ)

(25)

{\hat{y}}_{k + 1} = \sum_{i = 0 - 2 n} W_{i} Y_{i (k + 1)}

(26)

The UKF propagates output error covariance, $P_{yy}$ , according to the transformed sigma points, which, together with the cross-correlation estimate, $P_{xy}$ , give the Kalman gain at each discrete time-step.

P_{y y} = \sum_{i = 0 - 2 n} W_{i} {Υ_{i (k + 1)} - {\hat{y}}_{(k + 1)}} {Υ_{i (k + 1)} - {\hat{y}}_{(k + 1)}}^{T} + R

(27)

P_{x y} = \sum_{i = 0 - 2 n} W_{i} {χ_{i (k + 1)} - {\hat{x}}_{(k + 1)}} {Υ_{i (k + 1)} - {\hat{y}}_{(k + 1)}}^{T}

(28)

K_{k + 1} = P_{xy} {P_{yy}}^{- 1}

(29)

P_{k + 1} = P_{k + 1}^{*} - K_{k + 1} P_{yy} K_{k + 1}^{T}

(30)

{\hat{x}}_{k + 1} = {\hat{x}}_{k + 1}^{*} + K_{k + 1} (y_{k + 1} - {\hat{y}}_{(k + 1)})

(31)

Adapting for parameter estimation

With minor modifications to the process described above, the UKF can be modified to enable parameter estimation in addition to, or indeed in place of, state estimation. In this case we can simply define the state vector as the set of parameters of the longitudinal model:

x = {(\begin{matrix} d_{cr} & a_{y} & a_{a} & t_{d} & t_{a} & t_{cr} & t_{cd} & a_{coast} & d_{sl} & t_{sl} \end{matrix})}^{T}

(32)

And since there are no actual states in the parameter set, and over any reasonably small time period we would expect parameters to remain constant, the filter’s model for these will be zero:

\overset{\cdot}{x} = 0

(33)

By doing this, we reduce (14) and (16) to a simplified form that excludes process error altogether.

The sensor outputs are longitudinal velocity and longitudinal acceleration, as measured from the vehicle CAN. Both kinematics are included so that the filter can be driven by single time-step variations in speed (a sensor with a low noise to signal ratio) as well as approximation of the measured acceleration (which has a high noise to signal ratio, as illustrated in Figure 2). By including the Euler integration within the model we eliminate the need to include any input in the state vector; this simplifies the filter and also defines the model as a process to approximate parameters to fit u as a sensor value only. There are two separate modes – for cornering:

h_{k + 1} = u_{k} + T a_{k + 1}

(34)

{\hat{y}}_{k + 1} = {(\begin{matrix} h_{κ + 1} & a_{κ + 1} \end{matrix})}^{T}

(35)

And for a derestricted/speed-limited straight line:

h_{k + 1} = u_{k}

(36)

{\hat{y}}_{k + 1} = {(\begin{matrix} u_{cr} & 0 \end{matrix})}^{T}

(37)

Since there are no variable states, the definition of $Q$ is now the error in the assumption that the parameters are constant. Higher $Q$ assumes the parameters should be varied at a higher rate by the UKF. If we reasonably assume the parameters are not cross correlated, and that they are of similar magnitude, we can set $Q$ to moderate the speed of parameter adaptation through a scalar tuning value $λ$ .

Q = λ I

(38)

Furthermore, the filter is able to calculate its own measurement error covariance, $R$ , directly from a given time-history of the modelled and measured outputs:

{υ_{u}}_{k} = u_{k} - u_{k - 1} - T a_{k - 1}

(39)

{υ_{a}}_{k} = a_{sensor} - a_{k - 1}

(40)

R = c o v (\begin{matrix} {\underline{υ}}_{u_{k}} & {\underline{υ}}_{a_{k}} \end{matrix})

(41)

It should also be noted that, due to the propagation of sigma points, the computational load of a UKF increases as 2ⁿ with the number n of estimated states (in this case parameters). For ease of nomenclature we pose the UKF here as one single filter with 10 states. However, as the acceleration model varies according to the path topography, it would be straightforward to instead design separate UKF filters for each stage of the model, and switch between them, running only one at any given time. Further, the limited interdependence of parameters would allow the maximum number of parameters for any one of these UKF to be as low as two, which significantly reduces processing requirements.

System identification

The UKF can be used in conjunction with the Longitudinal Model as a real-time process, to continuously capture variations in driver behaviour. As it runs, the error between the sensor set and the model is minimised, thus driving the parameters that govern the shape of the acceleration curve to values that match the sensor more accurately. The single parameter λ can be adjusted to alter the speed of parameter adaptation.

It can also be used as a system identification process to identify the average behaviour for each driver; this is achieved by setting λ to a low value (here λ = 10⁻⁷) and running the UKF for each driver over a set of their data repeatedly. This is illustrated for a single driver – corner combination in Figure 3, where the parameters were initialising at nominal sub-optimal values and adapted to the final values as shown in Table 1.

Figure 3.

UKF system identification for a single corner.

Table 1.

Initial and final cornering related parameter values.

Parameter	Initial	Final
$d_{cr}$	0	−3.43
$a_{y}$	4	3.14
$a_{a}$	2	1.12
$t_{d}$	3	4.68
$t_{a}$	0	−0.25
$t_{cr}$	6	5.93
$t_{cd}$	8	9.41
$a_{coast}$	0.5	0.29

It is important to note that every driver will behave differently over a corner and individual acceleration curves will vary slightly from the model. For example, the portion past the corner apex includes a gear change, with a large momentary drop in acceleration, but the model still manages to capture the overall acceleration impulse and timing. The same is valid for the section before the corner, where the system identification process was purposefully initialised with a value for $a_{y}$ that yields a smaller deceleration magnitude for cornering (A₁) than for coasting (A₂) yet the process still manages to adapt both timings and magnitudes to suitably characterise the driver.

Figure 4 shows identification results for the same corner for several drivers. It is clear that each takes an individual approach, with some coasting significantly earlier than others, such as drivers T and W, or accelerating more gently over longer periods – drivers H and X. Driver E also stands out as a driver who decelerates late and hard into the corner.

Figure 4.

UKF system identification for a single corner for multiple drivers.

When the identification is conducted over a longer section of road, the cornering parameters will not exactly match each individual corner; that is, drivers do not decelerate at exactly the same time with exactly the same magnitude before a corner every time. However, this variation is mitigated to some extent by parameter $a_{y}$ , as this sets deceleration magnitude according to corner curvature, and therefore corner severity, through equation (4). The average behaviour on a sequence of corners, shown in Figure 5 still adequately represents the acceleration variations. Note that the UKF is able to find optimal parameters even when the initial values are very inaccurate.

Figure 5.

UKF System Identification Acceleration over a Full Section DRIVER G.

The non-cornering related parameters associated with straight-line driving and speed limit behaviour are also adapted accurately to each individual. The average straight-line derestricted cruise section in Figure 6 on the left from 800 to 1190 m matches the actual cruise speed. This is further illustrated by another section traversed by the same driver with a longer straight-line section –Figure 6 (right) with initial and final parameter values shown in Table 2. The restricted section on the left of 8.2 from 1540 m. onwards also shows the correct adaptation of speed limit deviation for that driver.

Figure 6.

UKF system identification velocity over a full section DRIVER G.

Table 2.

Initial and final parameter values.

Parameter	Initial	Final
$d_{cr}$	0	−3.68
$a_{y}$	4	2.18
$a_{a}$	2	0.86
$t_{d}$	5	3.30
$t_{a}$	0	−0.30
$t_{cr}$	4	4.74
$t_{cd}$	9	5.51
$a_{coast}$	0.5	0.78
$d_{sl}$	0	−0.24
$t_{sl}$	5	8.08

Finally, the system identification was conducted on all sections that each driver traversed over a 90 min drive, to find values that are representative of the overall average behaviour of that driver on that day. Figure 7 demonstrates the parameter convergence for this, deliberately slow identification process.

Figure 7.

Parameter convergence throughout system identification.

The final mean values of each parameter for each driver found from the last iteration of the identification process defines the characterisation of that driver (on that day) and are shown in Figure 8. These results present a wide spread of behaviour and some interesting observations in the extremes. For example, driver E is consistently late to react to road features such as corners and speed limits as is evident from his low values for $t_{d}$ and $t_{sl}$ . This correlates with his high tolerance for lateral acceleration inside a corner, which suggests he is a more aggressive driver. He is also in the top 3 drivers in terms of high comfortable cruise speed.

Figure 8.

System identification final parameter values.

It is also curious to note that only two drivers – L and P – consistently accelerate ahead of the corner apex, which is also indicative of a dynamic driver. Furthermore, driver P also maintains the fastest average cruise speed.

At the other end of the spectrum, driver Y shows a relaxed style, with a low derestricted cruise speed and lateral acceleration tolerance, who also coasts down and decelerates earlier than the rest of the sample. The whole sample presents a good mix of behaviours in non-extreme parts of the spectrum with substantial variations in each of the parameters suggesting that individual characteristics of each driver are captured. There does not seem to be a strong or simple correlation with age in any of the parameters, though it is interesting that those over 70 (V-Y) all have a preference for low lateral acceleration through corners.

Mapping over-arching driver metrics

Finally, we explore the ability of an identified set of parameters to explain a single overall indicator of driver behaviour, such as their skill, safety, or fuel efficiency. Such ‘global’ metrics are hard to define, but sensible choices can be made in a number of logical ways. Fuel efficiency is easily and directly quantified since engine speed and torque can be used to calculate the total energy used by each driver. Other indicators, such as ‘skill’ and ‘safety’ are much harder to objectively quantify, but inferences can be made to relate more abstract concepts to measurable values. For example, the fuel efficiency of each driver might also be expected to correlate well with how aggressively they drive.

In addition to fuel efficiency, here we will consider a metric for safety by measuring driver speed on a particularly narrow, straight section of road where peripheral vision is obscured by trees and hedges, as shown in Figure 9.

Figure 9.

Depiction of an obscured narrow road.

At this point on the road, the driver is not close enough to the upcoming corner to be directly influenced by it, so their speed simply represents the overall level of risk they are willing to take while driving on a straight but narrow road. We do not claim this to be a sophisticated or complete metric for safety – it is just used as an indicator; other more complex objective measurements could be used. The purpose is to demonstrate the ability of the information in the driver parameters to model other appropriately separate metrics.

As we have a relatively large number of parameters (10) to map the metrics of a reasonably small sample of drivers (22) it is statistically prudent to find a reduced subset of parameters for this regression, and also to test the regression model in an independent way. A sub-set of the identified parameters can be used as the regressors in an Ordinary Least Squares (OLS) linear regression,

A \underline{θ} = \underline{b} + ε

(42)

where, for the right-hand side, $b$ , we use the global metric – either fuel efficiency or narrow road speed. The set of coefficients $\underline{θ}$ that minimises the norm of $\underline{ε}$ for any given n × m array A made from the n = 22 driver’s m utilised parameters is easily computed as

\hat{θ} = {(A^{T} A)}^{- 1} A^{T} b

(43)

Using all 10 parameters to fit the metric for just 22 drivers presents problems in terms of statistical significance, since noise in the parameters can explain b by chance. The case is only statistically compelling if a relatively small sub-set of parameters achieves a good fit.

A method of Singular Value Decomposition (SVD) presented in Tuplin²⁴ can be used to find the rank of significance of parameters. First, b and each of the 10 columns of the full candidate set of A is normalised by removing the mean and dividing by the vector norm. The single parameter that will have the greatest contribution to explain b can then be obtained from

p = \max (A^{T} b)

(44)

We can then define a new $A^{*}$ (initially just a single vector) using only the most significant parameter(s). Equation (42) with $A^{*}$ replaced by its SVD is:

US V^{T} θ = b + ε

(45)

Where U and V are orthonormal rotation matrices, so the equation can be rewritten:

S θ^{'} = b^{'} + ε

(46)

with $θ^{'} = V^{T} θ$ and $b^{'} = U^{T} b$

In the first iteration, $A^{*}$ is a single vector, so S will be scalar and b’ then contains all of the correlation information in its first element. By then setting this first element of $b^{'}$ to $0$ we define a new vector ${b^{'}}_{e}$ comprising all of the information in b that is not fitted by $A^{*}$ . And by reversing the rotation on ${b^{'}}_{e}$ we can then identify the next best single vector that will explain the remaining error,

p = \max (A'^{T} U {b^{'}}_{e})

(47)

where $A'$ is the remaining set of $A$ with vector(s) $A^{*}$ removed. Using this method repeatedly (and zeroing the first m elements in b’ as $A^{*}$ builds from 1 to m factors), it is possible to identify regression models with a progressively larger number of factors that represent the best possible choice for any m element sub-set of factors, without computing θ for any of the models in advance. Further the condition number for each m factor model is given from the singular values in S, by

C = \frac{S_{1}}{S_{m}}

(48)

Figure 10 shows the corresponding model error (| $ε$ |) for each candidate set of factors, building from 1 to 10 (left) along with the condition number in each case (right).

Figure 10.

Model error and condition number for an increasing set of parameters in the fit.

It is immediately clear that a subset of just five factors for the safety metric provides model errors that are almost as low as for 10 factors, together with good conditioning. It is less clear whether a subset of three or seven parameters is appropriate for the energy metric. It is therefore interesting to also consider the rank order of significance of the parameters, which is shown in Table 3 for both indicators.

Table 3.

Parameter order of significance for both indicators (1 being the most significant parameter in the regression).

	Energy	Safety
1	$d_{cr}$	$d_{cr}$
2	$t_{a}$	$t_{a}$
3	$t_{d}$	$t_{d}$
4	$t_{cd}$	$t_{sl}$
5	$a_{coast}$	$t_{cr}$
6	$t_{cr}$	$d_{sl}$
7	$t_{sl}$	$t_{cd}$
8	$a_{y}$	$a_{coast}$
9	$d_{sl}$	$a_{a}$
10	$a_{a}$	$a_{y}$

In both models, the most significant parameters are $d_{cr}$ , $t_{a}$ and $t_{d}$ . This is not unexpected, as these are the parameters that represent how fast a driver generally drives as well as their acceleration early or late into and out of a corner. It is easy to see how these would correlate with both energy and safety.

For the energy model, the next two parameters pertain to coastdown behaviour. This makes sense as more gradual velocity changes use less energy. Following that are the remaining temporal parameters. The ranking of $a_{y}$ as an unimportant parameter in the energy model is curious, as a more aggressive driver is likely to maintain a higher lateral acceleration within a corner. On the other hand, the exclusion of speed limit related behaviour in the same model is expected, as there will be comparatively little variation between drivers in their behaviour when driving through a speed limit.

For the safety model, it is interesting that the temporal parameters are of greatest significance. With regards to $t_{d}$ and $t_{sl}$ , a late reaction will leave less margin for error and could be indicative of a driver’s safety factor. In terms of $t_{a}$ , the second most significant parameter, earlier acceleration out of the corner leaves less time to react to unexpected obstacles or other road users.

To test the efficacy of the final three factor energy and five-factor safety models in an unbiased way, a validation is run by excluding a driver from the fitting process and examining the result obtained when the model $\underline{θ}$ obtained from all other drivers is applied. This is then repeated for all drivers giving the results shown in Figure 11.

Figure 11.

Model accuracy comparison for driver excluded from the fit.

Although not perfect for all drivers, in most cases the model fit is very good; this method is certainly reliable for deciding broadly whether a given driver is energy wasteful or not, and what broad range of ‘safety factor’ they have, from only the most significant adapted characterisation parameters.

Conclusion

This paper presents a means of fully characterising a driver in terms of a set of parameters that are estimated by a UKF used in combination with a dynamic model. Although at this stage the model has been tested using only one driving scenario, the structure could easily be expanded to other free-driving conditions.

By adopting an open-loop acceleration structure, the model captures routine, pre-planned aspects of driving which most readily characterise driving style. A significant novelty is in the estimation of time-based parameters, which turn out to be important indicators of driver behaviour when the model is correlated to other factors, such as energy wastefulness and a measure of safety.

The filter can operate in real-time using readily-available longitudinal velocity and acceleration signals; it is thus capable of capturing variations in driving behaviour caused by change of mood or other journey-specific conditions. The full characterisation has also been correlated against additional metrics of energy wastefulness and a safety factor. Although the safety factor can not be considered in any way comprehensive, both metrics are suitably independent from the driver model parameters themselves. The ability of a relatively small sub-set of model parameters to map both metrics demonstrates that they appropriately span the behaviour range of the drivers included in the study.

Further development of this work will incorporate the steering model developed in Banerjee et al.¹³ to establish a combined lateral-longitudinal driver characterisation, and to consider a wider range of road types.

Footnotes

Appendix 1

Driver	Age*	Gender	Occupation
A	20.1	M	Student
B	20.2	F	Student
C	20.3	F	Student
D	21.1	M	Student
E	21.2	M	Student
F	24.1	F	Teacher
G	24.2	F	Junior Doctor
H	44	F	University Professor
J	48.1	F	Administrator
K	48.2	M	Engineering Manager
L	48.3	M	Business Manager
M	49	F	Housewife/Teacher
N	50	F	Foster Carer
P	51.1	M	Business Consultant
Q	51.2	M	IT Manager
R	58	M	Careers Advisor
S	59	F	University Lecturer
T	69	M	Retired
V	74.1	M	Retired
W	74.2	F	Retired
X	81	F	Retired
Y	85	M	Retired

Decimals only serve to inform order for ages.

Appendix 2

The controller logic is displayed in the flowcharts in Figures A –C. It relies on setting specific flags for corner, speed limit and coastdown behaviour and indices for relevant corners and speed limits. The flowchart has been split up into three significant blocks – one that governs which mode of behaviour should be relevant and two that govern index advancement and flag setting for both speed limits and corners.

Additional conditions are as shown below to make sure the vehicle is not on the approach to a speed limit, is also past the corner apex but is before cruise behaviour to address a niche scenario so that speed limit approach behaviour takes precedence over corner behaviour:

(48)

(f_{s l} = 1 & t_{t s l} < 0 & t_{c} < t_{a}) = 0

Early corner and speed limit approach provision – in Figures 5 and 6 refers to the special situation where the vehicle is past the current corner apex and either approaching the next corner sooner than expected or approaching a speed limit:

(48)

(t_{t c + 1} > - (t_{d} + 0.8) | (f_{s l} = 1 & t_{s l} < t_{t s l} < 0)) & S_{v e h i c l e} > S_{c}

In both cases, where the corner index is being increased in Figures 5 and 6, we will check that the next upcoming corner is at least 10 s further away than the current coastdown switch time:

(48)

f_{cd} = 1 if t_{c + 1} > t_{cd} + 10

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Emanuil Mladenov

References

Macadam

CC.

Understanding and modeling the human driver. Veh Syst Dyn 2003; 40(1–3): 101–134.

Riemersma

JB.

Visual control during straight road driving. Acta Psychol 1981; 48: 215–225.

Macadam

Sayers

Pointer

, et al. Crosswind sensitivity of passenger cars and the influence of chassis and aerodynamic properties on driver preferences. Veh Syst Dyn 1990; 19: 201–236.

MacAdam

CC.

Application of an optimal preview control for simulation of closed-loop automobile driving. IEEE Trans Syst Man Cybern 1981; 11: 393–399.

Ungoren

Peng

An adaptive lateral preview driver model. Veh Syst Dyn 2005; 43(4): 245–259.

Zhang

Lin

Chin

YKS

. A pattern-recognition approach for driving skill characterization. IEEE Trans Intell Transp Syst 2010; 11(4): 905–916.

Horiuchi

Yuhara

An analytical approach to the prediction of handling qualities of vehicles with advanced steering control system using multi-input driver model. J Dyn Syst Meas Control 2000; 122(3): 490–497.

Symonds

Sharp

Casanova

A mathematical model for driver steering control, with design, tuning and performance results. Veh Syst Dyn 2000; 33(5): 289–326.

Kiencke

Majjad

Kramer

Modeling and performance analysis of a hybrid driver model. Control Eng Pract 1999; 7(8): 985–991.

10.

Montella

Pariota

Galante

, et al. Prediction of drivers’ speed behavior on rural motorways based on an instrumented vehicle study. Transp Res Rec 2014; 2434: 52–62.

11.

Malaghan

Pawar

DS.

A short-term naturalistic driving study on predicting comfort thresholds for horizontal curves on two-lane rural highways. J Transp Eng A Syst 2022; 148(8). DOI: 10.1061/jtepbs.0000703

12.

Weir

Chao

. Review of control theory models for directional and speed control. In: Carlo Cacciabue

(ed.) Modelling driver behaviour in automotive environments: critical issues in driver interactions with intelligent transport systems. London: Springer, 2007, pp.293–311.

13.

Banerjee

Chowdhury

Chakravarty

Mydrive: Drive behavior analytics method and platform. In: WPA 2016 - Proc. 3rd Int. Work. Phys. Anal. co-located with MobiSys 2016, June 2016, pp.7–12. DOI: 10.1145/2935651.2935652.

14.

Wang

, et al. Modeling aggressive driving behavior based on graph construction. Transp Res Part C Emerg Technol 2022; 138: 103654.

15.

Monselise

Yang

CC.

Detecting aggressive driving patterns in drivers using vehicle sensor data. Transp Res Interdiscip Perspect 2022; 14: 100625.

16.

Naji

HAH

Xue

Lyu

, et al. Risk levels classification of near-crashes in Naturalistic driving data. Sustainability 2022; 14(10): 6032.

17.

Zhang

Chen

, et al. A comparative study of aggressive driving behavior recognition algorithms based on vehicle motion data. IEEE Access 2019; 7: 8028–8038.

18.

Edwards

AJ.

Aggressive-dynamics metrics for drive-cycle characterization. Transp Res Interdiscip Perspect 2022; 14: 100592.

19.

Best

MC.

A simple realistic driver model. Avec 2012; 12: 9–12.

20.

Best

MC.

Real-time characterisation of driver steering behaviour. Veh Syst Dyn 2019; 57(1): 64–85.

21.

Best

Gordon

Dixon

PJ.

An extended adaptive Kalman filter for real-time state estimation of vehicle handling dynamics. Veh Syst Dyn 2000; 34(1): 57–75.

22.

Best

MC.

Parametric identification of vehicle handling using an extended Kalman filter. Int J Veh Auton Syst 2007; 5(3/4): 256.

23.

Julier

Uhlmann

JK.

New extension of the Kalman filter to nonlinear systems. Signal Process Sens Fusion Target Recognit VI 1997; 3068: 182.

24.

Tuplin

Adaptive electronic throttle control of road vehicles, Loughborough University, 2006.

Characterisation of driver longitudinal behaviour using an Unscented Kalman Filter

Abstract

Keywords

Introduction

Test data

The longitudinal model

Cornering, A1

Coasting, A2

Straight-line speed, A3

Speed limited behaviour, A4

Model implementation

Unscented Kalman Filter

Adapting for parameter estimation

System identification

Mapping over-arching driver metrics

Conclusion

Footnotes

Appendix 1

Appendix 2

Declaration of conflicting interests

Funding

ORCID iD

References

Cornering, A₁

Coasting, A₂

Straight-line speed, A₃

Speed limited behaviour, A₄