Sage Journals: Discover world-class research

Abstract

Commercial-Off-The-Shelf (COTS) games and Large Language Models (LLMs) are enabling new empirical paradigms in the study of human-machine teaming (HMT). COTS games that allow for modifications are lowering barriers to the design and conduct of controlled experimental testbeds, while advances in LLMs have dramatically broadened the scope of possible interaction modes between humans and machines. In this paper, we present the iterative design and development of a Minecraft-based tower defense testbed to investigate the impacts of agent and team composition on HMT performance and team processes. Our study builds on insights from the DARPA Artificial Social Intelligence for Successful Teams (ASIST) in designing two versions of LLM-enabled team-mates that work alongside humans in a task-performer role. We developed our testbed with interactive agents for real-time, action-oriented human-agent teaming. Our focus is on the iterative design and implementation of the testbed, including game design trade-offs between difficulty and performance measurement, as well as our approach to conducting remote experimental data collection.

Keywords

artificial intelligence machine learning automation design human factors simulation gaming testbed development large language model human-AI teaming

Get full access to this article

View all access options for this article.

References

Aponte

M.-V.

Levieux

Natkin

(2011). Difficulty in videogames: An experimental validation of a formal definition. In Proceedings of the 8th international conference on advances in computer entertainment technology (pp. 1–8). Association for Computing Machinery. https://doi.org/10.1145/2071423.2071484

Cooke

Demir

Huang

(2020). A framework for human-autonomy team research. In Jeschke

Rich

(Eds.), Human–automation interaction: Research and practice (pp. 131–146). Springer. https://doi.org/10.1007/978-3-030-49183-3_11

Corral

C. C.

Tatapudi

K. S.

Buchanan

Huang

Cooke

N. J.

(2021). Building a synthetic task environment to support artificial social intelligence research. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 65(1), 660–664. https://doi.org/10.1177/1071181321651354a

Farah

Y. A.

Banuelos-Moriel

Dorneich

M. C.

(2023). Evaluating the consistency of cooperative video games in inducing teamwork Behaviors. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 67(1), 104–110. https://doi.org/10.1177/21695067231196239

Freeman

J. T.

Huang

Woods

Cauffman

S. J.

(2021). Evaluating artificial social intelligence in an urban search and rescue task environment. https://keep.lib.asu.edu/items/162284

Huang

Fouse

Cooke

Weiss

(2024). Artificial social intelligence for successful teams (ASIST) Study 4 Dragon testbed dataset (Version 2) [Dataset]. ASU Library Research Data Repository. https://doi.org/10.48349/ASU/ZO6XVR

Huang

Freeman

Cooke

Colonna-Romano

“JCR,”Wood

Buchanan

Caufman

(2023). Artificial Social Intelligence for Successful Teams (ASIST) Study 3 [Dataset]. ASU Library Research Data Repository. https://doi.org/10.48349/ASU/QDQ4MH

Lematta

G. J.

Corral

C. C.

Buchanan

Johnson

C. J.

Mudigonda

Scholcover

Wong

M. E.

Ezenyilimba

Baeriswyl

Kim

Holder

Chiou

E. K.

Cooke

N. J.

(2022). Remote research methods for Human–AI–Robot Teaming. Human Factors and Ergonomics in Manufacturing & Service Industries, 32(1), 133–150. https://doi.org/10.1002/hfm.20929

Miro . (2022). Miro | Welcome to the innovation workspace. https://miro.com/

10.

Raimondo

F. R.

Wolff

A. T.

Hehr

A. J.

Peel

M. A.

Wong

M. E.

Chiou

E. K.

Demir

Cookea

N. J.

(2022). Trailblazing Roblox virtual synthetic testbed development for human-robot teaming studies. Proceedings of the Human Factors and Ergonomics Society Annual Meeting, 66(1), 812–816. https://doi.org/10.1177/1071181322661470

11.

Wang

Xie

Jiang

Mandlekar

Xiao

Zhu

Fan

Anandkumar

(2023). Voyager: An open-ended embodied agent with large language models (arXiv:2305.16291). arXiv. https://doi.org/10.48550/arXiv.2305.16291

Building an LLM-Based Teammate in Minecraft: A Testbed for Human-AI Collaboration

Abstract

Keywords

Get full access to this article

References