Touchscreens for Aircraft Navigation Tasks: Comparing Accuracy and Throughput of Three Flight Deck Interfaces Using Fitts’ Law

Abstract

Objective

Use Fitts’ law to compare accuracy and throughput of three flight deck interfaces for navigation.

Background

Industry is proposing touch-based solutions to modernize the flight management system. However, research evaluating touchscreen eﬀectiveness for navigation tasks in terms of accuracy and throughput on the flight deck is lacking.

Method

An experiment was conducted with 14 participants in a flight simulator, aimed at creating Fitts’ law accuracy and throughput models of three different flight deck interfaces used for navigation: the mode control panel, control display unit, and a touch-based navigation display. The former two constitute the conventional interface between the pilot and the flight management system, and the latter represents the industry-proposed solution for the future.

Results

Results indicate less accurate performance with the touchscreen navigation display compared to the other two interfaces and the throughput was lowest with the mode control panel. The control display unit was better in both accuracy and throughput, which is found to be largely attributed to the tactile and physical nature of the interface.

Conclusion

Although performance in terms of accuracy and throughput was better with the control display unit, a question remains whether, when used during a more realistic navigation task, performance is still better compared to a touch-based interface.

Application

This paper complements previous studies in the usage of aircraft touchscreens with new empirical insights into their accuracy and throughput, compared to conventional flight deck interfaces, using Fitts’ law.

Keywords

touchscreens interface evaluation human performance modeling coordinated action flight displays Fitts’ law

Introduction

The modern-day flight management system (FMS) was introduced on the Boeing 767 in 1982 (Bulfer, 1991) to assist pilots in both lateral navigation (LNAV) and vertical navigation (VNAV). As an interface to the FMS, the control display unit (CDU) was introduced and remains the industry standard to date. For example, when the CDU on the Boeing 787 was replaced with a digital copy, the look and feel remained the same.

However, looking ahead at future developments in LNAV procedures, the necessity to modernize the FMS interface becomes evident. The SESAR Joint Undertaking expects the number of flights in European airspace to have increased by 52% in 2035 compared to 2012 (SESAR Joint Undertaking, 2014). As a result, Huisman, Verhoeven, Van Houten, and Flohr (1997) expect an increased frequency of en-route route adjustments. Van Marwijk, Borst, Mulder, Mulder, and Van Paassen (2011) call for “a redesign of the navigation planning interface [due to] increasing punctuality in, [amongst others,] European SESAR concepts, [which will] make airborne flight plan amendment increasingly complex.”

Touchscreens have the potential to reduce cognitive workload and increase situation awareness due to their “intuitive” way of interaction and their flexibility in displaying additional task-relevant information, respectively (Dodd et al., 2014; Hutchins, Hollan, & Norman, 1985; Kaminani, 2011; Rogers, Fisk, McLaughlin, & Pak, 2005; Shneiderman, 1982). As such, aircraft and equipment manufacturers have been proposing touchscreens on their newest flight decks in anticipation of increased complexity in future navigation tasks. However, concerns have been voiced about the loss of tactile feedback, usability in dynamic environments (e.g., turbulence), and physical fatigue of operation (Degani, Palmer, & Bauersfeld, 1992; Dodd et al., 2014; Kaminani, 2011; Stuyven, Damveld, & Borst, 2012).

Previous research has been done evaluating touchscreen interfaces in general and comparing them to less direct interfaces such as trackballs, trackpads, and rotary controllers (Ballas, Heitmeyer, & Pérez-Quiñones, 1992; Bjørneseth, Dunlop, & Hornecker, 2012; Degani et al., 1992; Forlines, Wigdor, Shen, & Balakrishnan, 2007; Stanton, Harvey, Plant, & Bolton, 2013). In the aviation domain, Dodd et al. (2014) found increased task execution time, error rates, and subjective workload for touchscreen usage in turbulence and at specific cockpit positions. However, a truly comparative study between a touchscreen and conventional flight deck interfaces on a fundamental input level, quantified in terms of input accuracy and information throughput as a function of task complexity, has not yet been carried out.

The goal of this research is to develop and compare accuracy and throughput models of three flight deck interfaces used during LNAV. These interfaces are the mode control panel (MCP), the CDU, and a touch-based navigation display (TND), illustrated in Figure 1. The models will be developed based on variations of Fitts’ law (Fitts, 1954). This law, first published in 1954, has been used by human–machine interaction researchers for analysis of the speed-accuracy trade-off and movement time (MT) in rapid aimed movement tasks (Jagacinski & Fisch, 1997; Jagacinski, Repperger, Ward, & Moran, 1980; Stoelen & Akin, 2010; Trudeau, Udtamadilok, Karlson, & Dennerlein, 2012), and as a valuable tool for human–machine interface design (Flach, Hagen, O’Brien, & Olson, 1990; Francis & Oxtoby, 2006; Gao & Sun, 2015; Jax, Rosenbaum, Vaughan, & Meulenbroek, 2003; MacKenzie, 1992; Soukoreff & MacKenzie, 2004). Fitts’ law models also enable quantitative comparison of the effectiveness of different interfaces based on their throughput (Jagacinski & Fisch, 1997; MacKenzie, 1992; Soukoreff & MacKenzie, 2004), describing how many bits of task difficulty, as defined by an index of difficulty (ID), an interface can handle per second.

Figure 1.

Three flight deck interfaces that are to be investigated: (a) heading control knob on the mode control panel (MCP), (b) control display unit (CDU), and (c) touch-based navigation display (TND).

Fitts’ Law

The complete and original Fitts’ law model (Fitts, 1954) that describes MT as a function of ID in a high-accuracy pointing task is presented in Equation 1. Here, a and b are empirical linear regression constants, A is the amplitude (distance to be traversed), and W_e is the effective width of the target. The latter is empirically calculated using the standard deviation of measured endpoint coordinates (Soukoreff & MacKenzie, 2004).

M T = a + b \cdot I D = a + b \log_{2} (\frac{A}{W_{e}} + 1) [seconds]

(1)

The usefulness of Fitts’ law in this study is twofold. First, it can help build models of task execution time for a particular interface. Second, it can provide a quantitative description of the FMS interface by comparing the throughput (TP) of individual interfaces. Equation 2 defines the throughput in bits per second, which is calculated by dividing the ID by the measured MT for each participant and experimental condition. The total numbers of conditions and participants are defined by x and y, respectively. $I D_{e_{i j}}$ defines the index of difficulty, adjusted using the effective width $W_{e}$ , and $M T_{i j}$ the movement time, both for a specific experimental condition and participant.

T P (Throughput) = \frac{1}{y} \sum_{i = 1}^{y} (\frac{1}{x} \sum_{j = 1}^{x} \frac{I D_{e_{i j}}}{M T_{i j}}) [bits / s]

(2)

Mode Control Panel (MCP)

The MCP is the standard interface between the pilot and the autopilot and uses, among others, a rotary heading control knob with which the horizontal flight direction (i.e., heading) can be changed. Research by Stoelen and Akin (2010) has shown that Fitts’ law can be extended to rotational input tasks by replacing the linear width and amplitude with an angular width ω and amplitude α, respectively. The effective angular width ω_e can be calculated based on the standard deviation in endpoints $σ_{ϕ}$ . Stoelen and Akin (2010) found a good model fit for a smooth, continuous rotational task. The heading control knob, however, uses detents, resulting in discrete motion inputs, and there is no literature regarding its effectiveness in terms of Fitts’ law. Despite potential inappropriateness of the model proposed by Stoelen and Akin (2010), shown in Equation 3, this research still used it in modeling the heading control knob on the MCP.

M T = a + b \log_{2} (\frac{α}{ω_{e}} + 1) where ω_{e} = \sqrt{2 π e} σ_{ϕ}

(3)

Control Display Unit (CDU)

The CDU is a keyboard-type input device by which pilots can change a planned flight route by entering or deleting waypoints. Research by MacKenzie and Buxton (1992) and Soukoreff and MacKenzie (1995) has shown that Fitts’ law can be extended to keyboard data-entry tasks. The model, shown in Equation 4, is based on an assumption that using either the minimum height H or width W of the target in the computation of the ID is sufficient. MacKenzie and Buxton (1992) have found this to provide adequate results. In the case of a key-repeat task, the amplitude is zero and thus the ID, namely $\log_{2} (0 + 1)$ , will equal zero. Therefore, Soukoreff and MacKenzie (1995) propose an averaged repeat movement time parameter $M T_{r e p e a t}$ for such tasks.

M T_{i j} = (\begin{array}{l} a + b \log_{2} (\frac{A_{i j} + \min (H_{j}, W_{j})}{\min (H_{j}, W_{j})}) & i f i \neq j \\ M T_{r e p e a t} & i f i = j \end{array}

(4)

Furthermore, due to the physical inability to measure movement endpoints on the keys, the computation of the effective width is troublesome. As such, an alternative approach was proposed by Soukoreff and MacKenzie (2004) based on error rate, as presented in (Equation 5) and used in this research. Here, $E r r$ is the error rate of a specific condition that equals the number of wrongly pressed keys over the total number of pressed keys, and $z (x)$ represents “the inverse of the standard normal cumulative distribution, or, the z-score that corresponds to the point where the area under the normal curve is $x %$ .” These accuracy adjustments must be performed for each individual condition and participant, given that $W_{e}$ describes the “within-participant variability,” and hence pooling endpoint information will not result in proper results (Soukoreff & MacKenzie, 2004).

\begin{array}{l} I D_{e} = \log_{2} (\frac{A_{i j}}{W_{i j_{e}}} + 1) \\ where \\ W_{i j_{e}} = (\begin{matrix} \min (H_{j}, W_{j}) \times \frac{2.006}{z (1 - E r r / 2)} & if E r r > 0.0049 % \\ \min (H_{j}, W_{j}) \times 0.5089 & otherwise \end{matrix} . \end{array}

(5)

Touch-Based Navigation Display (TND)

Research by Bi, Li, and Zhai (2013) has extended the original Fitts’ law to produce the Finger Fitts’ Law, shown in Equation 6. Their research proved effective in modeling finger input using touchscreens. Two new parameters are introduced: σ, the variation in movement endpoints, and σ_a, the variation in input device precision (e.g., finger width). The former is calculated using the distribution in endpoint coordinates during the task, where a bivariate standard deviation σ_xy is used for two-dimensional (2D) movements. The latter can be measured using a finger calibration task, where users are asked to repeatedly touch an identical (in size, not location) target; exact touch locations are used in this research to calculate the bivariate standard deviation σ_xy instead of σ.

\begin{array}{l} M T = a + b \log_{2} (\frac{A}{W_{e}} + 1) \\ where W_{e} = \sqrt{2 π e (σ_{x y}^{2} - σ_{a}^{2})} \end{array}

(6)

Method

The objective of the experiment was to develop and compare Fitts’ law models for each of the three interfaces using the respective models described earlier. The experiment consisted of three separate, but similar sub-experiments corresponding to the interfaces. The overarching design of the experiment is discussed here, followed by a brief discussion of each sub-experiment focusing on one interface. Each experiment explicitly measured the effect of ID on the observed MT for participants engaged in an aimed rapid movement task using the respective interface.

Participants

Given that the goal of the experiment was to describe human performance in performing a precision pointing task for a specific interface using Fitts’ law, prior experience with piloting aircraft and/or interacting with the interfaces was not relevant. The lack of previous encounters with either the MCP or CDU (for example by naive participants) was dealt with during a training phase, where each participant got sufficiently accustomed to the input device (see “General Procedure”). Right-handed participants were preferred given the positioning in the left seat and thus interface operation with the right hand. A total of 14 people participated in the experiment, of which a brief profile is given in Table 1. Note that one left-handed participant was invited in order to see the effect of handedness in using the TND.

Table 1

Profile of Participants

Profile	13 students, 1 professor
Gender	11 male, 3 female
Age	Ranging 21 to 49, averaging 24 years
Handedness	13 right-handed, 1 left-handed

Experiment Design

The experiment had a within-participants design. Figure 2 illustrates the different orders employed in presenting the conditions for 12 participants. Three groups of four participants (A, B, C) were administered the same interface order. The remaining two participants followed the order of the first two groups and of which one was left-handed. Given that each interface was different, different manipulations were required to achieve comparable indices of difficulty. The design was such that number of repetitions per unique ID per participant ranged between 10 and 12, similar to that found and recommended in literature (Accot & Zhai, 1997; Bi et al., 2013; Soukoreff & MacKenzie, 2004; Stoelen & Akin, 2010). The specific manipulations per interface condition will be detailed in the description of the interface conditions. Finally, the ranges of the evaluated inputs per interface condition were representative for a realistic LNAV re-routing task to avoid a weather cell (i.e., dialing in a heading with the MCP, inserting a new waypoint using the CDU, and finger dragging a waypoint using the TND).

Figure 2.

Schedule of the experiment per participant group.

Apparatus

The experiment was conducted in the SIMONA Research Simulator (SRS) at the Delft University of Technology, shown in Figure 3. Motion and outside visual capabilities were not utilized; however, the interior cabin provided a realistic look and feel to the interaction between participants and the three flight deck interfaces. Similar to a real flight deck, the locations and sizes of the interfaces, as well as the position of the participants in the left seat, and left of the interfaces, were fixed.

Figure 3.

Cabin of the SIMONA Research Simulator (SRS) showing each of the three flight deck interfaces.

Due to space confinements in the SIMONA Research Simulator, the touchscreen was located below the CDU (see Figure 3). As a result, to allow for a proper comparison, the participants were required to put their seat backwards when using the TND. Markers were installed on the cabin floor to ensure constant seat positioning. As such, the participant’s relative location to the touchscreen was comparable to that of the CDU and MCP.

General Procedure

This research complied with the tenets of the Declaration of Helsinki and was approved by the Human Research Ethics Committee of TU Delft. Informed consent was obtained from each participant. Participants received a briefing document a few days prior to the experiment. An introduction was given concerning the relevance of the experiment, the task to be conducted, and the expected time schedule. Prior to each interface condition, following a standardized procedure, a verbal briefing was given. Most importantly, and “essential for any Fitts’ law experiment” (Soukoreff & MacKenzie, 2004), the participant was requested to put specific emphasis on speed and accuracy in order to achieve an approximate 96% target hit-percentage with a smooth consistent input motion. Feedback on actual hit-rates was provided during all runs of the experiment. Training runs preceded data measurement and provided participants with time to master the speed-accuracy trade-off. When they reached the 96% target, they were considered to be sufficiently trained. More details on specific procedures per interface condition will be provided later.

Mode Control Panel (MCP) Condition

The MCP setup is presented in Figure 4. On the inboard screen, the navigation display (❶) is shown, on which the task information was presented. A magenta heading bug (see ❷) indicates the heading commanded on the MCP. At the start of each trial, the bug was reset to the north-up position. Two independent variables were used: the angular amplitude α and angular width $ω;$ together they determine the ID. The target was shown using two cyan lines (see ❸), the angular distance between which represents the width $ω$ . The angular distance between the starting position of the heading bug and the center of the target is the movement amplitude α. The choice of α and ω were such that they form a representative and realistic range of IDs for the MCP (e.g., when circumnavigating complex weather systems):

\begin{array}{l} α = [10, 20, 30, 40, 50, 60] [\deg] \\ ω = [2, 4, 6, 8] [\deg] \end{array}

These combinations resulted in an ID range of $[1.17, 4.95]$ . In total, participants were confronted with 24 $(6 \times 4)$ different combinations. Given that the variables α and ω are multiples of two, a total of 16 unique ID values existed. For example, the combinations $α = 10, ω = 2$ and $α = 20, ω = 4$ produce the same ID.

Figure 4.

Experiment procedure for mode control panel (MCP) experiment, showing an illustration of the navigation display and heading control knob.

Participants needed to use the course select rotary knob to hit the target ω at a certain amplitude α. The course select knob (illustrated by ❹) on the MCP is a standard rotary encoder with 24 “clicks” per full rotation. Note that for this study, the course knob was used due to a malfunction in the heading knob. Because both knobs operate in the same fashion (although they are used in a different navigation context) and initial hand movements toward the knob was not included in movement time measurements, this was not considered problematic. A small LCD display above the knob reflected the commanded heading. The movement time MT to hit the target was measured in milliseconds. In accordance with recommendations in literature (Soukoreff & MacKenzie, 2004) only the actual time the participant moved the heading knob was measured, thereby omitting engage, homing, dwell, and reaction times. Hereby, confounding factors such as cognitive effort required to understand the task and initial hand movements toward the interface were mitigated. Accuracy was measured by recording the physical endpoints of each individual movement. During the experiment the success rate in acquiring the target was displayed in the control room and communicated to the participant to provide feedback on their adherence to the speed-accuracy trade-off governing Fitts’ law.

The training phase for the MCP condition contained one full set of 24 combinations. The measurement phase constituted eight sets of 24 combinations, totaling 192 measurements runs.

CDU Condition

The experiment setup is shown in Figure 5. An illustration of the CDU including the display is shown in ❶. In a re-routing LNAV task, pilots use the CDU to insert new waypoints by entering their name in the scratchpad (see ❷) and inserting it in the list of waypoints through one of the line select keys (LSK; see ❸). The $CLR$ key could be used to backspace the scratchpad. The full content of the scratchpad could be inserted to any of the 12 line select keys (see ❸) by pushing the respective key. The text subsequently moved and the scratchpad was cleared.

Figure 5.

Experimental setup for control display unit (CDU) experiment, showing an illustration of the control display unit and location within the flight deck.

In this experiment, the variables A and W were defined by a set of words that needed to be entered and subsequently moved to target line select keys. Figure 5 shows an example where a participant is required to enter the word $AOW80$ before moving it to the top-left line select key. The amplitude A was characterized as the shortest distance between each key, and the width W was characterized as the minimum of either the height or width of the key (MacKenzie & Buxton, 1992). The participant was instructed to first search and find the necessary keys prior to initiating data entry in order to reduce cognitive effort. This meant that one five letter word that needed to be positioned at a specified line select key constituted five movements with respective A and W values, because the time needed to search for the first key was not recorded.

In order to complete the model, one word consisted of repeated keys in order to determine $M T_{r e p e a t}$ as shown in Equation 4. The set of words and LSKs were carefully chosen to encompass a representative range of indices of difficulty for a realistic LNAV task.

\begin{array}{l} Words = [KLM 19, AET 50, 47 MAY \\ SSSSS, DJS 73, ANW 80] [-] \\ Target LSK = [L 1, L 2, L 6, R 2, R 4, R 5] [-], \end{array}

An accurate technical drawing of the CDU used during the experiment was consulted to calculate A and W and resulted in an ID range of $[1.26, 5.17]$ .

The combinations of words and LSKs provided a total set of 36 different conditions, each of which consisted of five Fitts’ law movements. Therefore, a minimum of 180 Fitts’ law measurements could be made during one set of combinations.

Similar to the MCP experiment, the movement time MT, excluding homing, dwell, and reaction times, was measured in milliseconds. The accuracy, measured as the number of correct inputs divided by the total amount of keystrokes, was measured and used to provide as feedback to participants. Endpoint distributions of the inputs (i.e., finger locations) on the keys could not physically be measured, however. The training consisted of one block of all 36 conditions (in a random order) and the measurement phase featured two blocks of 36 conditions.

TND Condition

The experiment setup is shown in Figure 6. A large touchscreen was installed horizontally on the center pedestal of the SRS cockpit. An illustration of the display presented on the screen is shown in ❶. A white object was shown with a magenta crosshair at its center (see ❷), which could be moved around using touch-based input.

Figure 6.

Experimental apparatus for the touch-based navigation display (TND) experiment, showing an illustration of the touchscreen display and its location on the flight deck.

The target was depicted using a cyan circle (see ❸) with a black crosshair. The distance to be traversed, the amplitude A, and the diameter or width W of the circular target constituted the two variables that were manipulated. A representative and wide variety of variables A and W were selected. Finally, given that literature has found direction to be a confounding factor (Soukoreff & MacKenzie, 2004), a direction “heading” variable ϕ and display rotation variable θ were introduced, as illustrated in Figure 6. The rotation angle θ rotates the entire reference frame of the display.

\begin{array}{l} A = [45, 80, 115, 150] [mm] \\ W = [5, 15, 25, 35] [mm] \\ ϕ = [- 25, 0, 25] [\deg] \\ θ = [0, 90, 180, 360] [\deg] \end{array}

The choice in variables resulted in a total set of 192 different input combinations. However, only 16 $(4 \times 4)$ different combinations of A and W were present due to the use of directional variables. As a result, the ID range was $[1.19, 4.95]$ and thus comparable to the MCP and CDU indices of difficulty. Similar to the MCP and CDU conditions, the TND input combinations were designed to represent inputs that can be expected during a realistic LNAV re-routing task using a touchscreen device.

Consistent with the MCP and CDU experiments, the MT, excluding homing, dwell, and reaction times, was measured in milliseconds. Accuracy was the other dependent variable and was measured by recording physical endpoints of each individual movement. Given the 2D nature of the task, a bivariate endpoint standard deviation $(σ_{x y})$ was used, which has been found to better describe 2D Fitts’ law tasks (Wobbrock, Cutrell, Harada, & MacKenzie, 2008).

For the finger calibration task to calibrate $σ_{x y},$ a fixed diameter magenta target, slightly larger than a typical index finger, with a white crosshair was drawn at a random $(x, y)$ location on the display.

The task setup in both the training and measurement phases were equal. Once the participant was ready, a set of 192 conditions were loaded, and both the object and target were reset to their respective positions. Measurement started when the participant had successfully acquired the object and started to move it. Object acquisition was done by providing a touch input within a touch area equal in size and location of the object. During the experiment, the success rate in acquiring the target was displayed in the control room and communicated to the participant to provide valuable feedback on their adherence to the speed-accuracy trade-off governing Fitts’ law.

Training consisted of 192 runs containing all possible combinations, and the measurement phase features again 192 runs, albeit in a different (randomized) order.

Results

The numerical results of the three interface conditions are summarized in Table 2, the model fits are shown in Figure 7, and the distributions of accuracy values per interface are depicted in Figure 8. For the MCP condition, the proposed adjustment based on accuracy was done by computing the effective width W_e based on the actual distribution of movement endpoints per ID. Based on the effective width W_e, an effective index of difficulty $I D_{e}$ was calculated (circles in Figure 7). An analysis of variance (ANOVA) test revealed a significant effect of ID on MT, $F (4.33, 56.28) = 138.47, p < . 01$ . Compared with the other interfaces, the MCP has the lowest y-intercept and the lowest throughput. In terms of accuracy, an ANOVA reported a significant effect of the interface conditions $F (1.134, 14.74) = 5.623, p < . 05$ . Pairwise comparisons (adopting a Bonferroni correction) only reported a significant difference between the MCP and CDU. From Figure 8, it can be observed that the MCP accuracy scored between the CDU and the TND.

Table 2

The Fitts’ Law Model, Quality of Fit $(R^{2})$ , Mean Accuracy and Throughput of Each Interface

Interface	Fitts’ Law Model (ms)	$R^{2}$	Accuracy (%)	Throughput (Bits/s)
MCP	$M T = 154.8 + 494.7 \cdot I D_{e}$	0.969	96	1.80
CDU	$M T = 337.9 + 91.7 \cdot I D_{e}$	0.835	99	5.20
	$M T_{r e p e a t} = 267.9$
TND	$M T = 212.3 + 180.3 \cdot I D_{e}$	0.879	95	3.88

Note. MCP = mode control panel; CDU = control display unit; TND = touch-based navigation display.

Figure 7.

Final Fitts’ law models of each individual interface plotted on the same graph for comparative purposes.

Figure 8.

Observed accuracy scores per participant per experiment.

For the CDU condition, the proposed adjustment for accuracy was done by computing the effective width W_e based on the error percentages per ID. Based on this effective width W_e, an effective index of difficulty ID_e was calculated (plusses in Figure 7). An ANOVA test showed a significant effect of ID on mean MT, $F (3.76, 48.83) = 37.05, p < . 01$ . The CDU had the highest throughput as well as the highest accuracy. In Figure 8, it can also be seen that the variability in achieved accuracy is smallest compared with the other interfaces.

For the TND condition, the proposed adjustment for accuracy was done by computing the effective width W_e as shown in Equation 6. Based on W_e, an effective index of difficulty ID_e was calculated (crosses in Figure 7). An ANOVA test concluded that there was a significant effect of ID on MT, $F (2.67, 34.67) = 37.20, p < . 01$ . From the dedicated calibration tests, the finger calibration parameter $σ_{a}$ was $3.6 mm$ . According to Figure 8, the TND has the lowest average accuracy as well as the highest spread pattern in accuracy. Although these results were not found to be significant compared to the MCP and CDU, this does show that navigation-type inputs with the TND can be more error prone compared to the conventional flight deck interfaces. In throughput, however, the TND scores better than the MCP.

Discussion

The results of all three interface conditions show that the different variations of Fitts’ law, acquired from literature and introduced in this article, are adequate ways to develop and compare accuracy and throughput models for the MCP, CDU, and a TND. This is illustrated by Figure 7, and the $R^{2}$ fit qualities of 0.97, 0.84, and 0.88 for the aforementioned interfaces, respectively. The good fit for the MCP was a pleasant surprise, given that literature lacks a study looking at the applicability of Fitts’ law to a rotary controller providing discrete input signals.

Furthermore, when scrutinizing the y-intercept parameter (a) of each Fitts’ law model, the CDU indeed results in the largest expected movement time for tasks of zero difficulty, namely 338 ms. The TND follows with 212 ms. Interestingly, the $M T_{r e p e a t}$ of the CDU is different from its y-intercept, despite that both represent zero index of difficulty $(I D = 0)$ . This is, however, consistent with the findings of Soukoreff and MacKenzie (2002), who indicated that the y-intercept for keyboard-entry tasks is a “theoretical” movement time based on linearly regressing data containing inter-key movements ( $I D > 0$ bits) and may therefore not accurately describe realistic movement times for pressing the same key twice. During the CDU trials, it was also observed that a significant amount of force was required to successfully press the keys. Participants were even observed to occasionally continue toward a next key after unsuccessfully hitting the previous one. Furthermore, although participants were requested to search the necessary keys before initiating data entry, the cognitive effort required to find the required keys is expected to still affect the movement time. The lower y-intercepts for the TND and MCP do suggest that they are indeed more direct (and perhaps more intuitive) in their use.

In terms of accuracy, the MCP scores similar to the TND, but the variability is much smaller than for the heading control knob on the MCP. This finding is very similar to that of Stanton et al. (2013), who also compared a rotary controller with a touchscreen. Interesting to note is that even though data were only available from one left-handed participant, these scores were 70% on the TND compared to 96% and 99% on the MCP and CDU, respectively. This suggests that using the traditional interfaces with a non-dominant hand is easier than with a touchscreen. This finding is intriguing, given that the pilot position within the flight deck relative to interfaces is fixed and cannot easily be adjusted by the pilot. Further research on the effect of handedness on flight deck performance is therefore warranted. Following discussions with participants and observations made during the experiment, the accuracy results could also have been attributed to the tactile nature of and the fixed physical locations of the dials and buttons on the traditional interfaces. Due to the lack of tactile feedback and high freedom of movement with the touchscreen, precise inputs were sometimes more difficult to achieve.

Regarding throughput, scores were highest with CDU, followed by the TND and MCP, respectively. According to Figure 7, the TND and CDU result in similar movement times at low ID values (i.e., 1.5) as their Fitts’ law models converge. At higher ID values, however, the lines diverge and the TND is at a disadvantage compared to the CDU. Based on these results it can be said that for a given short time interval, the CDU can handle more difficult tasks compared to the other two interfaces. This may be explained by the calculation of ID, which is defined by the movement amplitude and target width. On the CDU the target width remained constant, given that the keys had a pre-defined size. Hence, the difficulty in movements was reflected in the distance to be moved. Thus, moving a larger distance was observed to be easier than acquiring a very narrow target, which is reflected by Equation 4. In addition, the physical keys on the CDU make it fairly easy to acquire the target successfully. On the contrary, with the MCP and TND, target difficulty varied both by amplitude and width. For the latter, it was observed on both interfaces that a very narrow target slowed down participants and required them to be more accurate. Finally, movement times were found to be substantially longer for the MCP than for the other two interfaces. This may be attributed to the latency and nonlinear movement of the heading control knob noted by several participants. Research by Stanton et al. (2013) also found that use of a rotary controller resulted in longer task times compared to a touchscreen interface.

Although scores with the CDU were highest on both accuracy and throughput, this does not imply that it is therefore the most optimal interface with the FMS. During the experiment, participants were asked to locate the necessary keys prior to key entry to keep cognitive effort at a minimum. Hence, good performance with the CDU reflects that the user is fully aware of the necessary steps to execute. However, during a more complex task, a substantial amount of cognitive effort is expected in determining the necessary actions with the CDU. Thus, the question remains whether, when used during a more realistic navigation task, the CDU is still better than a touch-based interface.

In addition, during a realistic LNAV task, for example, to avoid bad weather, pilots generally use both the CDU and MCP. In most cases, however, pilots will not use these interfaces concurrently. That is, they use the MCP to deviate from the planned route by dialing in a heading to fly around a weather cell and finally use the CDU to fly directly toward the nearest route waypoint when they cleared the weather cell. On one hand, it can be said that our results could shed light on the expected total task difficulty and completion times for realistic flight navigation tasks requiring combined inputs, given the current focus on modeling the accuracy and throughput of the interfaces in isolation. On the other hand, our results may not be as simple as summing the throughput values and task completion times. In combined inputs with multiple interfaces, time delays associated with re-directing hand movements, distributing visual attention over multiple interfaces, time to engage, homing, and dwell will also play important roles. How such combined interactions with two different interfaces at separate locations on the flight deck compare to a TND, and to what extent our obtained Fitts’ models can predict the results of such interactions, is therefore worth exploring further in a follow-on experiment.

Key Points

The accuracy and throughput characteristics of three flight deck interfaces, that is, the MCP, the CDU, and a TND, were accurately modeled with Fitts’ law.

The Fitts’ law analysis showed the CDU as most effective in both accuracy and throughput, which indicates that more difficult tasks can be handled better with the CDU within a short time frame.

Although the Fitts’ law models derived in this research described individual input movements, they may enable improved analysis and prediction of total task difficulty and completion times for realistic flight navigation tasks that would require a series of combined movements.

Footnotes

ORCID iD

Daan M. Pool

Author Biographies

Nout C. M. van Zon is a consultant at Simon-Kucher & Partners. He received his MSc degree (cum laude) in aerospace engineering in 2017 from Delft University of Technology.

Clark Borst is an assistant professor at Delft University of Technology, Faculty of Aerospace Engineering, Department of Control & Operations. He received his PhD degree in aerospace engineering in 2009 from Delft University of Technology.

Daan M. Pool is an assistant professor at Delft University of Technology, Faculty of Aerospace Engineering, Department of Control & Operations. He received his PhD degree (cum laude) in aerospace engineering in 2012 from Delft University of Technology.

Marinus M. van Paassen is an associate professor at Delft University of Technology, Faculty of Aerospace Engineering, Department of Control & Operations. He received his PhD degree in aerospace engineering in 1994 from Delft University of Technology.

References

Accot

Zhai

(1997). Beyond Fitts’ law: Models for trajectory-based HCI tasks. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 295–302). Atlanta, GA. doi:10.1145/258549.258760

Ballas

J. A.

Heitmeyer

C. L.

Pérez-Quiñones

M. A.

(1992). Evaluating two aspects of direct manipulation in advanced cockpits. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 127–134). Monterey, CA. doi:10.1145/142750.142770

Zhai

(2013). FFitts law: Modeling finger touch with Fitts’ law. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 1363–1372). Paris, France. doi:10.1145/2470654.2466180

Bjørneseth

F. B.

Dunlop

M. D.

Hornecker

(2012). Assessing the effectiveness of direct gesture interaction for a safety critical maritime application. International Journal of Human-Computer Studies, 70, 729–745. doi:10.1016/j.ijhcs.2012.06.001

Bulfer

(1991). Big Boeing FMC user’s guide. Kingwood, TX: Leading Edge.

Degani

Palmer

Bauersfeld

K. G.

(1992). “Soft” controls for hard displays: Still a challenge. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 36, 52–56. doi:10.1177/154193129203600114

Dodd

S. R.

Lancaster

Grothe

DeMers

Rogers

Miranda

(2014). Touch on the flight deck: The impact of display location, size, touch technology & turbulence on pilot performance. Proceedings of the IEEE/AIAA 33rd Digital Avionics Systems Conference (DASC). doi:10.1109/DASC.2014.6979428

Fitts

P. M.

(1954). The information capacity of the human motor system in controlling the amplitude of movement. Journal of Experimental Psychology, 47, 381–391. doi:10.1037/h0055392

Flach

J. M.

Hagen

B. A.

O’Brien

Olson

W. A.

(1990). Alternative displays for discrete movement control. Human Factors, 32, 685–695. doi:10.1177/001872089003200606

10.

Forlines

Wigdor

Shen

Balakrishnan

(2007). Direct-touch vs. mouse input for tabletop displays. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 647–656). San Jose, CA. doi:10.1145/1240624.1240726

11.

Francis

Oxtoby

(2006). Building and testing optimized keyboards for specific text entry. Human Factors, 48, 279–287. doi:10.1518/001872006777724390

12.

Gao

Sun

(2015). Examining the usability of touch screen gestures for older and younger adults. Human Factors, 57, 835–863. doi:10.1177/0018720815581293

13.

Huisman

Verhoeven

R. P. M.

Van Houten

Y. A.

Flohr

E. L.

(1997). Crew interfaces for future ATM. Proceedings of the AIAA/IEEE digital avionics systems conference. Irvine, CA. doi:10.1109/DASC.1997.636189

14.

Hutchins

E. L.

Hollan

J. D.

Norman

D. A.

(1985). Direct manipulation interfaces. Human-Computer Interaction, 1, 311–338. doi:10.1207/s15327051hci0104_2

15.

Jagacinski

R. J.

Fisch

(1997). Information theory and Fitts’ law. In Flach

J. M.

Jagacinski

R. J.

(Eds.), Control theory for humans (pp. 17–26). Mahwah, NJ: Lawrence Erlbaum.

16.

Jagacinski

R. J.

Repperger

D. W.

Ward

S. L.

Moran

M. S.

(1980). A test of Fitts’ law with moving targets. Human Factors, 22, 225–233. doi:10.1177/001872088002200211

17.

Jax

S. A.

Rosenbaum

D. A.

Vaughan

Meulenbroek

R. G. J.

(2003). Computational motor control and human factors: Modeling movements in real and possible environments. Human Factors, 45, 5–27. doi:10.1518/hfes.45.1.5.27226

18.

Kaminani

(2011). Human computer interaction issues with touch screen interfaces in the flight deck. Proceedings of the AIAA/IEEE digital avionics systems conference. Seattle, WA. doi:10.1109/DASC.2011.6096098

19.

MacKenzie

S. I.

(1992). Fitts’ law as a research and design tool in human-computer interaction. Human-Computer Interaction, 7, 91–139. doi:10.1207/s15327051hci0701_3

20.

MacKenzie

S. I.

Buxton

(1992). Extending Fitts’ law to two-dimensional tasks. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 219–226). Monterey, CA. doi:10.1145/142750.142794

21.

Rogers

W. A.

Fisk

A. D.

McLaughlin

A. C.

Pak

(2005). Touch a screen or turn a knob: Choosing the best device for the job. Human Factors, 47, 271–288. doi:10.1518/0018720054679452

22.

SESAR Joint Undertaking. (2014). i4D and SESAR. Retrieved from https://www.sesarju.eu/taxonomy/term/66

23.

Shneiderman

(1982). The future of interactive systems and the emergence of direct manipulation. Behaviour & Information Technology, 1, 237–256. doi:10.1080/01449298208914450

24.

Soukoreff

R. W.

MacKenzie

S. I.

(1995). Theoretical upper and lower bounds on typing speed using a stylus and a soft keyboard. Behaviour & Information Technology, 14, 370–379. doi:10.1080/01449299508914656

25.

Soukoreff

R. W.

MacKenzie

S. I.

(2002, May). Using Fitts’ law to model key repeat time in text entry models. Poster presented at Graphics Interface—GI2002. Retrieved from http://soukoreff.com/academic/GI02-Poster.pdf

26.

Soukoreff

R. W.

MacKenzie

S. I.

(2004). Towards a standard for pointing device evaluation, perspectives on 27 years of Fitts’ law research in HCI. International Journal of Human-Computer Studies, 61, 751–789. doi:10.1016/j.ijhcs.2004.09.001

27.

Stanton

N. A.

Harvey

Plant

K. L.

Bolton

(2013). To twist, roll, stroke or poke? A study of input devices for menu navigation in the cockpit. Ergonomics, 56, 590–611. doi:10.1080/00140139.2012.751458

28.

Stoelen

M. F.

Akin

D. L.

(2010). Assessment of Fitts’ law for quantifying combined rotational and translational movements. Human Factors, 52, 63–77. doi:10.1177/0018720810366560

29.

Stuyven

Damveld

H. J.

Borst

(2012). Concept for an avionics multi touch flight deck. SAE International Journal of Aerospace, 5, 164–171. doi:10.4271/2012-01-2120

30.

Trudeau

M. B.

Udtamadilok

Karlson

A. K.

Dennerlein

J. T.

(2012). Thumb motor performance varies by movement orientation, direction, and device size during single-handed mobile phone use. Human Factors, 54, 52–59. doi:10.1177/0018720811423660

31.

van Marwijk

B. J. A.

Borst

Mulder

van Paassen

M. M

. (2011). Supporting 4D trajectory revisions on the flight deck: Design of a human–machine interface. The International Journal of Aviation Psychology, 21, 35–61. doi:10.1080/10508414.2011.537559

32.

Wobbrock

J. O.

Cutrell

Harada

MacKenzie

S. I.

(2008). An error model for pointing based on Fitts’ law. Proceedings of the ACM SIGCHI conference on human factors in computing systems (pp. 1613–1622). Florence, Italy. doi:10.1145/1357054.1357306