Sage Journals: Discover world-class research

Abstract

The study investigated the feasibility and reliability of using generative artificial intelligence to conduct heuristic evaluations of workplace instructions. A custom GPT model, fine-tuned with examples and heuristic criteria, was tasked with evaluating aerospace-based work instructions. The AI’s output included identifying weaknesses and improvements, heuristic scoring, and providing rationales for its decisions and analyses. Results showed poor agreement between the AI and human experts, but consistent and reproducible AI scoring. Agreement with human experts was low due to high variability among human evaluators, but qualitative analysis confirmed the AI’s ability to identify common weaknesses and offer relevant feedback, which outperformed individual human evaluators. Finally, although the AI provided adequate explanations, some explanations lacked detail. The research demonstrates the potential of AI-driven heuristic evaluations to streamline assessment processes and augment human analysis in high-risk industries, while acknowledging the need for ongoing model refinement and improved transparency.

Keywords

work instruction procedures generative AI GPT human error prevention safety procedures work evaluation

Get full access to this article

View all access options for this article.

References

Fiona

F. N.

Ruilin

Jingyuan

Keng

Langtao

(2023). Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration. Journal of Information Technology Case and Application Research, 25(3), 277–304.

Fui-Hoon Nah

Zheng

Cai

Siau

Chen

(2023). Generative AI and ChatGPT: Applications, challenges, and AI-human collaboration. Journal of Information Technology Case and Application Research, 25(3), 277–304.

Goktas

(2024). Ethics, transparency, and explainability in generative ai decision-making systems: A comprehensive bibliometric study. Journal of Decision System, 1–29.

Health and Safety Executive. (2009). Procedures Audit Tool (version 1.1). https://www.hse.gov.uk/humanfactors/assets/docs/procedures-audit-tool.pdf

Lecaros

Paz

Moquillaza

(2021). Challenges and opportunities on the application of heuristic evaluations: A systematic literature review [Conference session]. International Conference on Human-Computer Interaction (pp. 242–261). Springer.

National Offshore Petroleum Safety and Environmental Management Authority. (2020). Human Factors - Procedures and Instructions (N-06300-IP1041). https://www.nopsema.gov.au/sites/default/files/documents/2021-04/A392397.pdf

Rawlings

Montalvo

Thai

Stephens

Guillén

Mehta

(2024). Leveraging AI to improve task-specific biomechanical safety instructions. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 68(1), 1622–1626.

Thorne

(2024). Understanding the interplay between trust, reliability, and human factors in the age of generative AI. International Journal of Simulation Systems Science & Technology, 25(1), 10.1–10.5.

AI-Driven Heuristic Analysis: Enhancing Work Instruction With Generative Models

Abstract

Keywords

Get full access to this article

References