Sage Journals: Discover world-class research

Abstract

This work investigates using pupil size to track changes in physical fatigue during manufacturing tasks. Participants completed an automotive manufacturing task over three trials. Impulse and peak force were recorded as ground-truth metrics of physical fatigue. Pupil size and blink rate were also recorded by means of a head-mounted eye-tracker. Impulse and peak force decreased between trial 1 and trial 3 suggesting an increase in fatigue. Interestingly, this was accompanied by a significant reduction in pupil size. No difference in blink rate was found. Our study adds to the literature on fatigue assessment suggesting pupil size as a future possible alternative metric for fatigue assessment in manufacturing Ergonomics.

Keywords

pupil size cognitive load eye-tracking fatigue manufacturing ergonomics EMG physical cognitive

Introduction

Physical fatigue increases the risk of injury at work. It is estimated that 69% of the workforces in safety-critical jobs feel tired at work (NSC, 2018). In manufacturing, recent estimates suggest fatigue to increase the amount of task errors by 3 to 10 times (Yung et al., 2020).

Measuring muscle activation or force exertion during the completion of a physical task represent the golden standards for fatigue assessment. As the resources required to complete a sustained activity decrease over time, this typically results in a decline in muscle activity (Enoka & Duchateau, 2008; Ferguson et al., 2013; Tetteh et al., 2020) and a reduction in the force generated (Morin et al., 2011; Potvin, 2012; Vøllestad, 1997).

There is a limited literature that has used alternative metrics for fatigue assessment. For example, the work by Geacintov & Peavler (1974) used pupil size – which measures changes in the pupillary surface of the eye– to track telephone operators’ fatigue during an 8-hour shift. Significant changes were observed with pupil size decreasing as workers became more tired. Fadda and colleagues (2015) utilized a similar approach with heavy machinery workers. Workers were instructed to use a high-fidelity quay crane simulator in a task requiring sorting shipping containers. The operators’ physiological activation – recorded as average heart rate – declined over time, with pupil size also decreasing from the first to the second hour of the study. Xiao-Yan and colleagues conducted a fatigue study on excavator operators (Xiao-Yan et al., 2020). Participants were instructed to operate a real excavator for 2 hours. Oculomotor metrics and self-reported ratings of fatigue were recorded. Consistently with the findings by Geacintov and Peavler (1974), a significant reduction in pupil size was recorded at the end of the job.

The above evidence suggest that pupil size might be sensitive to changes in fatigue during workplace tasks. However, most Human Factors research has adopted this metrics to track changes in human operators’ cognitive workload. Recent work has showed greater pupil size following conditions of higher cognitive task demand in fields like driving, aviation and human-computer interaction (Biondi et al., 2021; Chen & Epps, 2014b; Matton et al., 2020; Pillai et al., 2020; Ramakrishnan et al., 2021). In a recent study, we employed this metric to track changes in workers’ cognitive workload during the completion of automotive assembling operations (Biondi et al., 2023). As cognitive task demand increased, this resulted in greater pupil size.

Our objective with this study is to further investigate using pupil size to track changes in physical fatigue but during the completion of automotive manufacturing tasks. The work by Geacintov & Peavler (1974), Fadda et al. (2015) and Xiao-Yan et al. (2020) had participants complete workplace activities requiring workers maintain a stationary position for the entire task duration. We aim to test the effectiveness of using pupil size to track physical fatigue by having participants complete repeated, full-body operations requiring a wider range of upper-body and lower-body motions. In our study, participants are instructed to complete traditional manufacturing operations wherein their level of fatigue is tracked using force exertion. Pupil size is also measured alongside blink rate using a head-mounted eye-tracker throughout the duration of the manufacturing task.

Methods

Participants

Twenty-four University of Windsor students (13 men, 11 women) were recruited and received a $10 Amazon gift card in exchange for their participation. Their age ranged between 19 and 31 years old (M= 22, SD=2.84) and had no history of severe musculoskeletal injuries (Cort et al., 2013). The research complied with American Psychological Association Code of Ethics and was approved by the University of Windsor Research Ethics Board (#19–065) and Research Safety Committee.

Design

We adopted a factorial design with manufacturing task trial being the only independent factor. Manufacturing task trial (3 levels: trials 1, 2, and 3) was manipulated by having participants complete the manufacturing task a total of three times. Dependent measures were: normalized pupil size (total area in millimeters); peak force (in %MVE); force impulse (in N-sec); blink rate (number of blinks per minute).

Equipment and Procedure

Force and maximum voluntary exertions (MVE)

Upon entering the laboratory, participants were instructed to complete the Standardized Nordic Questionnaire (López-Aragón et al., 2017) to screen for musculoskeletal injuries and provide their demographics information. Participants then began familiarizing with the physical task which consistent in pushing and pulling on a handle positioned at a height of 122cm and attached to a resistance cylinder (see figure 1). This task was chosen as it resembles common automotive manufacturing tasks (Cimino et al., 2009; Zare et al., 2020). During the experiment, participants completed this task ten times (ten pushes + ten pulls) per trial over a three trials.

Figure 1.
Experimental setup. A shows the handle for the push/pull task and the vertically-placed force plate. B shows the participant completing the push/pull task.

A force plate (AMTI-OR6-OP, Advanced Medical Technologies, Inc, Watertown MA, USA) oriented vertically and attached to the back of the resistance cylinder was used to measure the force exerted in each trial. Prior to performing any of the experimental conditions, each participant provided maximum voluntary exertions (MVE) that were recorded for handle push and pull efforts.

Eye-tracker and baseline recording

Participants were also instructed on how to use the Pupil Labs wearable eye-tracker (Pupil Labs GmbH, Berlin, Germany). The eye-tracker uses three cameras: two eye cameras (one for each eye with a 120Hz sampling rate), and one world camera recording from the participant’s perspective. The headset was connected to a desktop computer via a USB cable. A 9-point calibration was conducted by having participants look at a 27-inch Lenovo monitor located approximately 80 centimeters away from the participant. Pupil Capture (v. 3.1.16) was used for the data recording, and Pupil Player (v 3.1.16) was used for data extraction.

Experimental phase

During the experimental phase, participants completed the physical task over three trials. Each trial consisted in participants completing ten pushes and ten pulls on the handle using their maximal force – a practice commonly used in the ergonomics literature (Bailey et al., 2013; Tomezzoli et al., 2022). No rest was provided between trials. Each trial took approximately 42 seconds to complete.

Data processing and analysis

Force. Force signals were analog-to-digital (A/D) converted at a sampling rate of 1000 Hz (USB-6216, National Instruments, Austin, TX), and all digitally converted data were smoothed using a sixth-order Butterworth low-pass filter with a cut-off frequency of 10 Hz in custom LabVIEW software (National Instruments, Austin, TX). While 3 axes of force were collected, we chose to analyze only the force in the “intended direction”, i.e., along the z-axis (push/pull). Peak force was calculated as the percentage of force exerted during each trial relative to the maximum voluntary exertion recorded in the pre-experimental phase (in % MVE). Impulse (N s) was calculated as the force integral with respect to time for each effort, where each effort was determined as the period at which force recorded was greater than zero. RStudio (Racine, 2012) was used for data processing and statistical testing. Missing data were imputed using the Multiple Imputation by Chained Equations (MICE) R library. Unlike traditional means of data imputations, this approach imputes the missing data through an iterative series of predictive models.

Pupil size

Pupil Play (Pupil Labs, Berlin, Germany) was used for the processing of pupil size and blink rate. The detection algorithm uses a 3D model to estimate the size of the pupil. Pupil size was calculated as the diameter of the pupil in millimeters. Values smaller than 2 mm and greater than 8 mm were considered artifacts and removed from the analysis (Binda et al., 2013; Mathôt et al., 2018). The mean pupil size recorded in the baseline condition was used to normalize the values in each trial as follows: (x-μ)/σ, where x is the observed value, μ is the mean in the baseline condition, and σ is the standard deviation in the baseline condition. Mean normalized pupil size for each trial was then calculated.

Blink rate

For blink detection, we adopted a filter length of 0.2 seconds and a confidence threshold onset/offset between 0.5 and 0.3. The onset and offset thresholds are, respectively, the thresholds that the filter response must rise above or fall below to classify the onset and end of a blink (PupilLabs, n.d.). The filter length represents the time interval wherein the blink detector attempts to find confidence drops and gains. These parameters were agreed upon empirically after consulting with the manufacturer. Data also underwent a visual inspection by the research assistant to ensure that no visible anomalies were present. Blink rate was calculated for each trial as the number of blinks per minute. The research assistant also manually coded blink rate to ensure the validity of the PupilPlay output. Many participants showed unusually low blink rates during the baseline recording. With this in mind, and given that analyzing uncorrected blink frequency is common practice in literature (Chen & Epps, 2014a; Faure et al., 2016), we decided not to baseline-correct blink rate.

Statistics

Linear models were adopted. Repeated-measure analysis of variance (ANOVA) were run. Mauchly’s tests of sphericity were conducted to check if the assumption of sphericity was met on the data, in which case Greenhouse-Geisser were applied to ANOVA analyses. Bonferroni-corrected post hoc tests were run to explore differences between pairwise groups.

Results

Force impulse

A repeated-measure ANOVA was conducted on impulse (N-sec). The analysis revealed a significant effect of trial, F(1.28, 30.74)=20.52, p<.05. Post hoc tests revealed significant differences between trials 1 and 2, and 1 and 3, with average impulse decreasing over time (table 1).

Table 1.
Mean impulse (in N-sec), standard error (SE) of impulse (in N-sec), and peak force (in % of MVE) across the three trials.

Trial Mean impulse SE impulse Peak force

1 3586.58 11.14 88.36

2 3243.60 8.28 75.58

3 3109.88 7.55 51.82

Peak force

A repeated-measure ANOVA was conducted on impulse (N-sec). The analysis revealed a significant effect of trial, F(2,48) = 26.82, p<.05. Post hoc tests revealed significant differences between trials 1 and 3 (table 1).

Pupil size

Mean normalized pupil size across conditions is presented in figure 2.

Figure 2.
Average normalized pupil size in trials 1 through 3. Error bars represent standard errors.

A repeated-measure ANOVA was conducted on normalized pupil size data. A significant effect of trial was found F(1.44, 30.35) = 3.42, p<.05. Holm-Bonferroni-corrected post-hoc tests revealed significant differences between trial 1 and trial 2, p<.05, and trial 1 and trial 3, p<.05.

Blink rate

Blink rate across the three trials is presented in figure 3. The repeated-measure ANOVA conducted on blink rate did not reveal a significant effect of trial, F(2, 48) = 1.44, p>.05.

Figure 3.
Average blink rate in trials 1 through 3. Error bars represent standard errors.

Discussion

A reduction in exerted force was observed over time. In particular, impulse declined from 3586.58 N-sec in trial 1 to 3109.88 N-sec in trial 3. Consistently with this pattern, peak force recorded as % MVE also decreased from 88.36% in trial 1 to 51.82% in trial 3.

With respect to our main goal, the observed reduction in exerted force was also accompanied by a significant decline in normalized pupil size. Pupil size decreased by approximately 0.3 millimeters, a pattern that is consistent with what found in similar studies (Khairat et al., 2020). This findings is key in that it adds to the limited literature showing smaller pupil size under conditions of physical fatigue. The work by Fadda et al. (2015) and by Xiao-Yan et al. (2020) observed this pattern with workers largely maintaining a sitting posture during the completion of workplace tasks. Our volunteers, instead, were engaged in manufacturing operations requiring the use of upper and lower body, and core muscles to stabilize the body while producing the push and pull efforts. This datum is interesting in that it proves that our eye-tracking equipment was able to pick up even relatively subtle differences in the workers’ pupillary response during the execution of such physically taxing task.

Blink rate analyses showed no difference in blink frequency over time. Blink rate is commonly used to measure tiredness, especially during prolonged driving or piloting tasks (FAA, 1998; Navastara et al., 2020). It is possible that the relatively short duration of our manufacturing task (42 seconds per trial on average) might have been insufficient to elicit significant changes in blink behavior. We posit that longer task durations may be necessary for blink frequency to serve as a reliable metric of fatigue.

Our study advances the use of pupil size as a potential future alternative for fatigue assessment. While our findings shed some light on the sensitivity of oculomotor metrics in tracking physical fatigue, there are outstanding questions that are left to be tackled. Our participants performed manufacturing tasks only for short periods of time. Future research should consider adopting a similar methodology with workers completing assembling tasks in more ecological, realistic settings. Future research should also further investigate the neurophysiological link between fatigue and the selective changes in pupil dilation.

These lingering questions aside, our exploratory findings add knowledge to the field of fatigue assessment in manufacturing. While our findings are by no means definitive and necessitate further investigation, we envision a future where, similarly to state monitoring systems in transportation (Ryan et al., 2021; SmartEye, 2020), the available machine vision and camera technology support more seamless, contact-less eye-tracking-based approaches for fatigue assessment.

Trial	Mean impulse	SE impulse	Peak force
1	3586.58	11.14	88.36
2	3243.60	8.28	75.58
3	3109.88	7.55	51.82

Footnotes

Acknowledgements

We acknowledge the generous contribution from Atlas Copco Inc. and Mitacs. We also thank SSHRC, NSERC, and WE-SPARK Health Institute for their valuable support.

References

Bailey

Sato

Alexander

Chiang

C.-Y. H.

Stone

(2013). Isometric force production symmetry and jumping performance in collegiate athletes. Journal of Trainology, 2(1), 1–5. https://doi.org/10.17338/trainology.2.1_1

Binda

Pereverzeva

Murray

S. O.

(2013). Pupil constrictions to photographs of the sun. Journal of Vision, 13(6), 1–9. https://doi.org/10.1167/13.6.8

Biondi

F. N.

Balasingam

Ayare

(2021). On the Cost of Detection Response Task Performance on Cognitive Load. Human Factors, 63(5), 804–812. https://doi.org/10.1177/0018720820931628

Biondi

F. N.

Saberi

Graf

Cort

Pillai

(2023). Distracted worker : Using pupil size and blink rate to detect cognitive load during manufacturing tasks. Applied Ergonomics, 106(August 2022), 103867. https://doi.org/10.1016/j.apergo.2022.103867

Chen

Epps

(2014a). Human – Computer Interaction Using Task-Induced Pupil Diameter and Blink Rate to Infer Cognitive Load Using Task-Induced Pupil Diameter and Blink Rate to Infer Cognitive Load. 0024. https://doi.org/10.1080/07370024.2014.892428

Chen

Epps

(2014b). Using task-induced pupil diameter and blink rate to infer cognitive load. Human-Computer Interaction, 29(4), 390–413. https://doi.org/10.1080/07370024.2014.892428

Cimino

Longo

Mirabelli

(2009). A multimeasure-based methodology for the ergonomic effective design of manufacturing system workstations. International Journal of Industrial Ergonomics, 39(2), 447–455. https://doi.org/10.1016/j.ergon.2008.12.004

Enoka

R. M.

Duchateau

(2008). Muscle fatigue: What, why and how it influences muscle function. Journal of Physiology, 586(1), 11–23. https://doi.org/10.1113/jphysiol.2007.139477

FAA. (1998). A Valid Psychophysiological Measure of Alertness As Assessed by Psychomotor Vigilance. In Tech Brief.

10.

Fadda

Meloni

Fancello

Pau

Medda

Pinna

Del Rio

Lecca

L. I.

Setzu

Leban

(2015). Multidisciplinary Study of Biological Parameters and Fatigue Evolution in Quay Crane Operators. Procedia Manufacturing, 3(Ahfe), 3301–3308. https://doi.org/10.1016/j.promfg.2015.07.410

11.

Faure

Lobjois

Benguigui

(2016). The effects of driving environment complexity and dual tasking on drivers’ mental workload and eye blink behavior. Transportation Research Part F: Traffic Psychology and Behaviour, 40, 78–90. https://doi.org/10.1016/j.trf.2016.04.007

12.

Ferguson

S. A.

Allread

W. G.

Rose

Marras

W. S.

(2013). Shoulder muscle fatigue during repetitive tasks as measured by electromyography and near-infrared spectroscopy. Human Factors, 55(6), 1077–1087. https://doi.org/10.1177/0018720813482328

13.

Geacintov

Peavler

W. S.

(1974). PUPILLOGRAPHY IN INDUSTRIAL ASSESSMENT. Journal of Applied Psychology, 59(2), 213–216.

14.

Khairat

Coleman

Ottmar

Jayachander

D. I.

Bice

Carson

S. S.

(2020). Association of Electronic Health Record Use with Physician Fatigue and Efficiency. JAMA Network Open, 3(6), 1–13. https://doi.org/10.1001/jamanetworkopen.2020.7385

15.

López-Aragón

López-Liria

Callejón-Ferre ángel

Gómez-Galán

(2017). Applications of the standardized nordic questionnaire: A Review. Sustainability (Switzerland), 9(9), 1–42. https://doi.org/10.3390/su9091514

16.

Mathôt

Fabius

van Heusden

Van der Stigchel

(2018). Safe and sensible baseline correction of pupil-size data. Behavior Research Methods, 94–106. https://doi.org/10.7287/peerj.preprints.2725

17.

Matton

Paubel

P. V.

Puma

(2020). Toward the Use of Pupillary Responses for Pilot Selection. Human Factors. https://doi.org/10.1177/0018720820945163

18.

Morin

J. B.

Samozino

Edouard

Tomazin

(2011). Effect of fatigue on force production and force application technique during repeated sprints. Journal of Biomechanics, 44(15), 2719–2723. https://doi.org/10.1016/j.jbiomech.2011.07.020

19.

Navastara

D. A.

Putra

W. Y. M.

Fatichah

(2020). Drowsiness Detection Based on Facial Landmark and Uniform Local Binary Pattern. Journal of Physics: Conference Series, 1529(5), 0–9. https://doi.org/10.1088/1742-6596/1529/5/052015

20.

NSC. (2018). 69% of Employees, Many in Safety-critical Jobs, are Tired at Work. https://www.nsc.org/in-the-newsroom/69-percent-of-employees-many-in-safety-critical-jobs-are-tired-at-work-says-nsc-report

21.

Pillai

Ayare

Balasingam

Milne

Biondi

(2020). Response Time and Eye Tracking Datasets for Activities Demanding Varying Cognitive Load. Data in Brief, 106389. https://doi.org/10.1016/j.dib.2020.106389

22.

Potvin

J. R.

(2012). Predicting maximum acceptable efforts for repetitive tasks: An equation based on duty cycle. Human Factors, 54(2), 175–188. https://doi.org/10.1177/0018720811424269

23.

Racine

J. S.

(2012). RStudio: A Platform-Independent IDE FOR R And Sweave. Journal of Applied Econometric, 27(1), 167–172. https://www.jstor.org/stable/41337225?seq=1#metadata_info_tab_contents

24.

Ramakrishnan

Balasingam

Biondi

(2021). Cognitive load estimation for adaptive human–machine system automation. Learning Control, 35–58. https://doi.org/10.1016/b978-0-12-822314-7.00007-9

25.

Ryan

O’Sullivan

Elrasad

Cahill

Lemley

Kielty

Posch

Perot

(2021). Real-time face & eye tracking and blink detection using event cameras. Neural Networks, 141(2021), 87–97. https://doi.org/10.1016/j.neunet.2021.03.019

26.

SmartEye. (2020). Driver Monitoring ( DMS ) on its way to becoming mandatory in vehicles around the world. https://smarteye.se/blogs/driver-monitoring-dms-on-its-way-to-become-mandatory-in-vehicles-around-the-world/

27.

Tetteh

Sarker

Radley

Hallbeck

M. S.

Mirka

G. A.

(2020). Effect of surgical radiation personal protective equipment on EMG-based measures of back and shoulder muscle fatigue: A laboratory study of novices. Applied Ergonomics, 84(January), 103029. https://doi.org/10.1016/j.apergo.2019.103029

28.

Tomezzoli

Fréchède

Duprey

(2022). Slouched and erect sitting postures affect upper limb maximum voluntary force levels and fatiguability: a randomized experimental study. IISE Transactions on Occupational Ergonomics and Human Factors. https://doi.org/10.1080/24725838.2022.2110544

29.

Vøllestad

N. K.

(1997). Measurement of human muscle fatigue. Journal of Neuroscience Methods, 74(2), 219–227. https://doi.org/10.1016/S0165-0270(97)02251-6

30.

Xiao-Yan

Kai

Chun-Peng

Hong-Xuan

Chun-Yuan

(2020). Analysis of pupil size amplitude signal in field fatigue detection. Proceedings - 2020 7th International Conference on Information Science and Control Engineering, ICISCE 2020, 302–305. https://doi.org/10.1109/ICISCE50968.2020.00071

31.

Yung

Kolus

Wells

Neumann

W. P.

(2020). Examining the fatigue-quality relationship in manufacturing. Applied Ergonomics, 82(March 2019), 102919. https://doi.org/10.1016/j.apergo.2019.102919

32.

Zare

Black

Sagot

J. C.

Hunault

Roquelaure

(2020). Ergonomics interventions to reduce musculoskeletal risk factors in a truck manufacturing plant. International Journal of Industrial Ergonomics, 75(December 2019), 102896. https://doi.org/10.1016/j.ergon.2019.102896

Testing Pupil Size as a Possible Alternative Metric of Physical Fatigue in Automotive Manufacturing Tasks

Abstract

Keywords

Introduction

Methods

Participants

Design

Equipment and Procedure

Force and maximum voluntary exertions (MVE)

Eye-tracker and baseline recording

Experimental phase

Data processing and analysis

Pupil size

Blink rate

Statistics

Results

Force impulse

Peak force

Pupil size

Blink rate

Discussion

Footnotes

Acknowledgements

References