Sage Journals: Discover world-class research

Abstract

This article explores the complexity of team interactions, emphasizing its multimodal layers containing verbal, paraverbal, and nonverbal cues. It highlights the potential of AI, particularly social signal processing, for understanding and enhancing team dynamics. Future research should embrace genuine interdisciplinary collaboration that combines expertise from social and computer science to address the messiness of real team interactions.

Keywords

team interaction social signal processing future of team research

Introduction

Whether groups collaborate in face-to-face, hybrid, or fully virtual settings, and with or without the help of AI tools—this basic tenet still holds true and will continue to keep us busy for at least another decade of groups and teams research: “for members to achieve the collaboration and interdependence that make them a group rather than co-present individuals, they must interact” (Bonito & Sanders, 2011, p. 343). To me, taking this notion seriously means that we need to study actual group interactions, not subjective individual experiences or emergent states or group “processes” that are then captured using static self-report surveys.

So, the question of what the next decade of teams research will look like made me think of AI, sure, but also about finding new ways to tackle seriously messy social interaction data in teams and other interesting interpersonal constellations. Bear with me as I invite you to picture my kids’ bedroom for an illustration (I promise this will make sense in a bit). When my 5-year-old looks for a toy or an essential piece of Lego, this tends to have an explosive effect, both physically and emotionally. He dismantles every available Lego structure, empties all the boxes, and mixes his toys in an apparently random, but rapid manner into a wild soup of shapes, colors, and materials. The toy is typically not found until an adult (usually me) helps him dissect the chaos. This situation is further complicated by my 8-year-old offering to “help”. By the time the little Lego villain/plastic part of Optimus Prime’s leg/monster truck¹ is found, the paraverbal signals in the room have escalated significantly and I am secretly counting the days until my next escape to the INGRoup conference.

Multimodal Team Interactions

Digging through team interaction data is decidedly more fun for me than digging through toy soup. But there are a few common elements. Like my kids’ bedroom after several hours of intense play, team processes are fascinating, dynamic, complex, frequently unpredictable, and seriously messy phenomena—especially when we study real teams in the wild (see also Klonek, 2026). Like the moment of actually locating the missing piece of Lego, identifying a micro-level behavioral mechanism that explains successful team collaboration can feel like a serious triumph. And like first combing through the top layer of apparently random mess and starting to group similar toys (monster trucks on one pile, pirate gear on another pile), quantitative team interaction research also often starts with identifying broader categories of team behavior (e.g., problem-solving versus relational communication), then narrowing it down to more specific types of behaviors within each category. Indeed, this is what my colleagues and I have been doing for much of the past decade, trying to understand the interaction behaviors and patterns underlying successful teamwork (e.g., in this journal: Allen & Lehmann-Willenbrock, 2024; Kauffeld & Lehmann-Willenbrock, 2012; van der Meer et al., 2022).

Notably, all of these SGR examples focus on one modality of team interaction behavior: speech, analyzed as functions of various verbal statements.² Focusing on only modality, however, is like only looking at the vehicles in my kids’ bedroom, while ignoring all the other fun items and thus probably never finding the missing but essential piece of Lego.

Team interactions are beautifully messy multimodal puzzles. Especially when groups collaborate face-to-face, but also in virtual settings, their interaction is a multilayered composition of verbal statements (e.g., “Hey, good idea!”), paraverbal cues (e.g., the voice pitch that accompanies “Hey, good idea!”), and nonverbal cues (e.g., the accompanying facial expressions and gestures). Analyzing all of these cues simultaneously may make your head hurt; however, this reflects the true complexity of real interactions in groups and teams, and therefore, we need to account for it in our research.

Leveraging AI to Understand Multimodal Group Interactions

As one way to address this complexity and implement a “high-resolution” approach to team interaction (Klonek et al., 2019), social signal processing holds great promise. Social signal processing is a subdomain of computer science that uses sensing methodology (e.g., cameras, microphones, individual movement trackers) and machine learning to model, analyze, and synthesize so-called social signals in human as well as human-machine interactions (Vinciarelli, 2017). The core idea is to automatically extract behavioral cues from the sensor data (e.g., automatically trace individual movement) and then train machine learning models to predict meaningful behaviors from those cues. This work process essentially still requires human annotators in order to establish a “ground truth” for a machine learning model—especially when the model is tasked with predicting dynamic group phenomena, compared to the relatively simpler task of automatically detecting individual members’ behavioral conduct (e.g., individual members’ overall dominance in a group interaction; Bai et al., 2019).

For example, in two recent interdisciplinary collaborations, we applied social signal processing to detect dynamic cohesion in team meetings (Lehmann-Willenbrock & Hung, 2024) and to model convergence and divergence in group affect (Prabhu et al., 2025). What I hope to illustrate with these two examples is how interdisciplinary efforts that leverage social signal processing can provide much more fine-grained, complex, multimodal empirical analyses and theory testing than previously possible. There is a wealth of untapped potential for interdisciplinary collaborations leveraging social signal processing (for an overview, see Kozlowski et al., forthcoming). Once this research area becomes more populated by interdisciplinary efforts, “killer apps” (Buengeler et al., 2017) for understanding—and eventually enhancing—group processes become more likely. Of note, if we want to collaboratively build the basis for such killer apps, we need to be willing to invest serious time and energy into true interdisciplinary efforts. For example, inserting off-the-shelf AI methodology into group research projects, such as using large language models to facilitate analyses of group and team interactions (for an overview, see Kush et al., 2025), will only get us so far and cannot address the multimodal nature of group and team interactions. Interdisciplinary research projects that really push the frontier at the intersection of group process research and computer science need to be mutually beneficial and move away from producer-consumer types of collaboration (for more detailed discussions, see Allen et al., 2017; Lehmann-Willenbrock & Hung, 2024).

Around the Corner (or a Little Further): AI to Enhance Multimodal Group Interactions

The not-so-new, but continuously relevant quest to study interactions as core mechanisms of collaboration in groups and teams (e.g., Bonito & Sanders, 2011; Keyton, 2017) also has implications for the potential of AI to eventually function as a group member. Human-AI synergy is a neat but frequently not achieved idea (e.g., Vaccaro et al., 2024), and certainly no small feat when you consider the complexity of group interactions. AI should eventually be able to insert seamlessly and understand as well as synthesize complex group interaction behavior. Synthesis in this context means that an AI would be able to understand and respond to multimodal signals by group members just like a human would. Though the advance of intelligent virtual agents in virtual reality settings promises new opportunities for multimodal agentic AI, we are decidedly so not there yet (and maybe that’s a good thing). But if we want to get there, the community of groups and teams researchers needs to frequently and happily mingle with the social signal processing crowd (a partnership we called “geeks and groupies” a while ago; Lehmann-Willenbrock et al., 2017). From my own experience, I can report that this can be a ton of fun, while challenging a lot of the often implicit assumptions of our own disciplines (in my case, a tendency to obsess about constructs; debatable templates for a “research contribution”; etc.) and frequently owning up to feeling dazed and confused (e.g., when terms such as “coding” and “modeling” can mean very different things; or when discussing feature extraction and combination possibilities).

As a final thought and point of caution, AI may also distract us even more than we often already are, especially in remote group collaborations where multitasking is a serious challenge (Cao et al., 2021). Moreover, increasingly habitual AI usage binds group member attention that is then not available to focus on the group interaction. This potentially also challenges the group work skills of the generations that follow. I think we can avoid this by remembering that AI should serve humans, not the other way around; by educating our students how to balance AI usage; and by cherishing real-life, messy group interactions, even when they come in the shape of crazy toy soups (remind me later).

Footnotes

ORCID iD

Nale Lehmann-Willenbrock

Funding

The author received no financial support for the research, authorship, and/or publication of this article.

Declaration of Conflicting Interests

The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Notes

Author Biography

Nale Lehmann-Willenbrock is professor and department head of Industrial and Organizational Psychology, Director of the Center for Better Work, and Vice Dean of Research and Transfer at the Faculty of Psychology and Human Movement Science, University of Hamburg (Germany). She studies behavioral patterns that explain successful collaboration in organizational teams (preferably observed in the field during regular workplace meetings), among leaders and followers, in coaching sessions, and during business negotiations. Her research program blends organizational psychology, management, communication, and social signal processing. Her research has been published in outlets such as the Journal of Applied Psychology, Journal of Organizational Behavior, Academy of Management Journal, The Leadership Quarterly, Organizational Research Methods, and of course, Small Group Research.

References

Allen

J. A.

Fisher

Chetouani

Chiu

M. M.

Gunes

Mehu

Hung

(2017). Comparing social science and computer science workflow processes for studying group interactions. Small Group Research, 48(5), 568–590. https://doi.org/10.1177/1046496417721747

Allen

J. A.

Lehmann-Willenbrock

(2024). Story-telling, well-organized, or solution-focused meeting? Investigation of behavior-based group profiles and performance. Small Group Research, 55(2), 264–289. https://doi.org/10.1177/10464964231182098

Bai

Bolonkin

Kumar

Leskovec

Burgoon

Dunbar

Subrahmanian

V. S.

(2019, August). Predicting dominance in multi-person videos [Conference session]. Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 4643–4650. New York: IEEE.

Bonito

J. A.

Sanders

R. E.

(2011). The existential center of small groups: Member’s conduct and interaction. Small Group Research, 42(3), 343–358. https://doi.org/10.1177/1046496410385472

Buengeler

Klonek

F. E.

Lehmann-Willenbrock

Morency

L.-P.

Poppe

(2017). Killer apps: Criteria and interdisciplinary opportunities for developing novel team applications. Small Group Research, 48(5), 591–620. https://doi.org/10.1177/1046496417721745

Cao

Lee

C. J.

Iqbal

Czerwinski

Wong

P. N.

Rintel

Hecht

Teevan

Yang

(2021, May). Large scale analysis of multitasking behavior during remote meetings [Conference session]. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (pp. 1–13). New York: IEEE. https://doi.org/10.1145/3411764.3445243

Kauffeld

Lehmann-Willenbrock

(2012). Meetings matter: Effects of team meetings on team and organizational success. Small Group Research, 43(2), 130–158. https://doi.org/10.1177/1046496411429599

Keyton

(2017). Communication in organizations. Annual Review of Organizational Psychology and Organizational Behavior, 4(1), 501–526. https://doi.org/10.1146/annurev-orgpsych-032516-113341

Klonek

F. E

. (2026). Artificial Intelligence, rising global division, and co-bot trolls: Group researchers – we got some work to do! Small Group Research, 57(3).

10.

Klonek

Gerpott

F. H.

Lehmann-Willenbrock

Parker

S. K.

(2019). Time to go wild: How to conceptualize and measure process dynamics in real teams with high-resolution. Organizational Psychology Review, 9(4), 245–275. https://doi.org/10.1177/2041386619886674

11.

Kozlowski

S. W. J.

Hung

Lehmann-Willenbrock

Salah

A. A.

(forthcoming). Computational Group and Team Dynamics: Forging an Interdisciplinary Science (Organizational Science, Translation, and Application Series). Oxford University Press.

12.

Kush

Kane

Lehmann-Willenbrock

Paletz

Van Swol

(2025). Using text to analyze team dynamics: Revisiting McGrath’s ABCs. Team Performance Management, 31(7–8), 1352–7592. https://doi.org/10.1108/TPM-02-2025-0020

13.

Lehmann-Willenbrock

Hung

. (2024). A multimodal social signal processing approach to team interactions. Organizational Research Methods, 27(3), 477–515. https://doi/10.1177/10944281231202741

14.

Lehmann-Willenbrock

Hung

Keyton

(2017). New frontiers in analyzing dynamic group interactions: Bridging social and computer science. Small Group Research, 48(5), 519–531. https://doi.org/10.1177/1046496417718941

15.

Lehmann-Willenbrock

Meyers

R. A.

Kauffeld

Neininger

Henschel

(2011). Verbal interaction sequences and group mood: Exploring the role of team planning communication. Small Group Research, 42(6), 639–668. https://doi.org/10.1177/1046496411398397

16.

Prabhu

N. R.

Tsfasman

Oertel

Gerkmann

Lehmann-Willenbrock

(2025). Dynamics of collective group affect: Group-level annotations and the multimodal modeling of convergence and divergence. IEEE Transactions on Affective Computing, Advance online publication. https://doi.org./10.1109/TAFFC.2025.3643752

17.

Vaccaro

Almaatouq

Malone

(2024). When combinations of humans and AI are useful: A systematic review and meta-analysis. Nature Human Behaviour, 8(12), 2293–2303. https://doi.org/10.1038/s41562-024-02024-1

18.

van der Meer

S. A.

Lehmann-Willenbrock

Delahaij

Homan

A. C

. (2022). The influence of intrusions on team interaction: An explorative field study. Small Group Research, 53(5), 644–669. https://doi.org/10.1177/10464964211073590

19.

Vinciarelli

(2017). Introduction: Social signal processing. In Burgoon

Magnenat-Thalmann

Pantic

Vinciarelli

(Eds.), Social signal processing (pp. 1–8). Cambridge University Press. https://doi.org/10.1017/9781316676202.001

The Beauty of Dismantling Everything: Team Processes as a Multimodal Puzzle

Abstract

Keywords

Introduction

Multimodal Team Interactions

Leveraging AI to Understand Multimodal Group Interactions

Around the Corner (or a Little Further): AI to Enhance Multimodal Group Interactions

Footnotes

ORCID iD

Funding

Declaration of Conflicting Interests

Notes

Author Biography

References