Abstract
Background and objectives
Communication is a complex skill and key component for professionals in healthcare. Complex skills are best learned through experiential learning like role-plays or simulated patient encounters. The aim of the present study is to explore how students and lecturers assess the conditions under which the use of an artificial intelligence (AI)-based feedback system can promote the learning process.
Methods
An interview study with health profession students and lecturers was conducted using a qualitative descriptive design. Recorded audio data was transcribed and evaluated by structuring qualitative content analysis. The research process was conducted and continually reflected by an interprofessional research team. Ethical approval was obtained.
Results
Using qualitative content analysis, 4 major themes were identified. These are “conditions,” “communication scenarios,” “AI-based learning platform” and “debriefing.” Lecturers and students welcome the idea of AI providing output on verbal and para-verbal aspects. To implement AI-based output into a teaching programme AI functionality should be adaptable to the specific situation. Lecturers and students highlighted that AI could be particularly valuable for speech qualities which are often difficult for humans to assess. AI could give freedom to focus on additional aspects of the conversation by documenting desirable speech qualities. Lecturers and students prefer for the AI-based output to be given at the end of rather than within the role-play. Furthermore, they wish for communication scenarios to be analysed repeatedly in order to track progress.
Conclusion
Practicing in a safe environment and receiving competent credible feedback, with lecturers trained in facilitation, is a pre-requisite for the entire learning progress. The integration of an AI-based feedback system should be characterised by both flexibility of the AI application and standardisation of the communication scenario.
Keywords
Introduction
Communication skills are essential for healthcare professionals’ ability to deliver patient-centred care. Empathetic conversations foster positive relationships between healthcare professionals and patients 1 and contribute to the success of therapy. 2 The best way to acquire communication skills is to train in a simulated environment as complex skills like communication are best learned through experiential learning. 3 Evidence suggests that communication skills training improves students’ communication abilities4,5 and equips them with enhanced communication skills for clinical practice. 6
As a result, simulation-based medical education has gained prominence in healthcare profession programmes, including those at the University of Lübeck. The University of Lübeck offers 9 health degree programmes, which provides an ideal environment for patient-oriented and interprofessional learning, as well as the acquisition of communication skills
The project LABORATORIUM comprises several phases. The analysis presented in this article preceded the ELSI workshops (Ethical, Legal and Socio-cultural Implications) and serves as the foundation for the development of CoSy. It is widely acknowledged that technology providers should align their developments with user intentions in order to enhance acceptance. 15 Accordingly, the primary objective was to define criteria for a system that is both highly usable and capable of delivering a satisfactory user experience, thereby facilitating the effective integration of CoSy into educational settings.
To support this objective, the present study explores the experiences and perceptions of both students and lecturers regarding the potential implementation of an AI-based feedback system. In line with the goals of a comprehensive requirement analysis, this study aims to gain insights into users’ perspectives on possible applications of the system, as well as their expectations for its use –particularly within emotionally challenging learning environments.
From the students’ and lecturers’ perspectives
What are the required pre-requisites for integrating an AI-based learning platform into communication training in healthcare programmes? How should the communication scenarios be designed? Which speech qualities are considered useful when integrating an AI-based learning platform? In what way should the output of the AI-based learning platform be didactically embedded in the communication situation?
Methods
Prior to the start of data collection, the interview guide was pretested in 2 interviews, and minor adjustments were made, especially regarding the order of the questions and the openness of the wording. Data from these 2 pilot interviews were used solely to refine the interview guide and were not included in the subsequent data analysis. Socio-demographic data of the participants was collected using a structured data collection sheet. The interviews took place between May and August 2022. To create a trusting atmosphere and avoid socially desirable answers, focus groups were conducted by members of the interprofessional research team from differing programmes (including the authors HB, AD and MMK). All interviews were audio-recorded. Following the data collection, the interviewers prepared a protocol (interview postscript).
Results
Twenty-seven health professionals participated in this study: 7 lecturers and 20 students (Table 1). A total of 8 health teaching modules (mono- and interprofessional) and 7 health professions, respectively: medicine with 3 teaching modules: general practice (M1), emergency medicine (M2) and medical psychology (M3), and nursing sciences, midwifery sciences, occupational therapy, physiotherapy, and speech and language therapy as well as clinical psychology (each represented in at least one teaching module). In addition to the medical modules, there were also the following mono-professional modules: the physiotherapy module on pain therapy (M4) and the midwifery module on communication and counselling (M5). The paediatrics (M6) and dementia module (M7) were interprofessional. Clinical psychology (M8) is a future module in a new study programme in which no teaching took place at the time of data collection. It was therefore not possible to conduct a focus group or expert interview. CoSy is to be used in this module in the future. There were no focus groups for modules M1 and M3 as there were no students willing to participate. For every other module a focus group was conducted in which students, who had already completed the module, took part and an expert interview was held with a lecturer, who taught the module.
Characteristics of Study Population.
The 7 individual interviews with the lecturers lasted an average of 71 min (ranging from 59 to 101 min), while the 5 focus group interviews lasted an average of 112 min (ranging from 95 to 130 min). Overall, a total of 17 h and 35 min of audio has been recorded.
Results in this article specifically focus on AI-related findings and revealed 4 major themes which were addressed in the interviews and focus groups. The 4 major themes are ‘conditions’, ‘communication scenarios’, ‘AI-based learning platform’ and ‘debriefing’ (Figure 1). The analysis of the interviews shows that the participants have confidence that the AI output will have the potential to deepen the learning process and to complement the debriefing. This shows that CoSy is not perceived as an alien piece of the puzzle. The optimal framework conditions and pre-requisites for the communication training became apparent. They are summarised under the topic ‘conditions’. Analysis shows that the following presented 3 individual parts ‘communication scenarios’, ‘AI-based learning platform’, and ‘debriefing’ influence each other and must be well harmonised.

Identified Major Themes.
Conditions
The participants assume that the use of CoSy has an impact on the achievement of communication skills. They have the understanding that CoSy can record and analyse what is being said. According to the interviewees, CoSy needs to have information about a conversation in order to be able to analyse what is being said in a dialogue. Examples of important contextual information was mentioned, for example, the age of the patient, the room in which the conversation takes place and the time frame.
Communication Scenarios
This theme outlines the essential pre-requisites and preferences of interviewees regarding the design of communication scenarios in general. Additionally, the limitations perceived by the interviewees concerning the use of CoSy are described.
AI-based Learning Platform – CoSy
The ideas on what special functions and characteristics CoSy must have, what CoSy should look like, and how the system should be accessed in detail vary. There is an agreement that it must be accessible to both students and lecturers.

On Which Speech Qualities Would You Like to Have Artificial Intelligence (AI) Feedback – Lecturer.

On Which Speech Qualities Would You Like to Have Artificial Intelligence (AI) Feedback – Students.
The need to repeat communication training creates requirements for the function of CoSy. Students and lecturers suggest that CoSy should document the speech qualities and the development of learning over time. CoSy output should be stored to enable students to compare their CoSy output parameters of current communication with CoSy output parameters of previous communication situations. This additionally implies the existence of a user profile, so that the students gain access to their data. This proposal applies to both the comparison of the own individual performance and the comparison with a reference group. Interviewees suggest that the reference group should consist of students from the same profession or students who have a similar level of learning (from beginners to experts).
Debriefing
Role-play scenarios are commonly framed by briefing and debriefing phases as stated by Eppich and Cheng 20 or Zigmont. 21 In this current article the ‘learning conversations’ 7 following the role-play scenarios are called ‘feedback’ or ‘debriefing’ interchangeably as the interviewees did not differentiate between these terms. Debriefing is considered important by the interviewees, also quite independently of CoSy, and takes up a large part of the narratives. Students in particular share many of their experiences.
Discussion
The interviews have given insight into the experiences and perceptions of students and lecturers in communication training in higher education. The 4 research questions were answered in varying degrees of depth. There are clear indications that the acceptance of CoSy depends on specific framework conditions that affect the entire communication skills training. The acceptance particularly depends on how the CoSy output will be handled. (It is a tool that must be applied in accordance with its intended use.) Overall, the participants are very open to the use of CoSy as a supplementary tool in communication skills training. Without stating it directly and from the perspective of quantitative criteria, the ideas and perceptions of the interviewees show that participants assume that the CoSy output will meet the quality criteria of objectivity and reliability. The issues raised by the interviewees are therefore not related to the quality of the CoSy output, but mainly to the nature of the output and the framework in which the output is embedded.
The first question, ‘What are the required prerequisites for integrating an AI-based learning platform into communication training in healthcare programmes?’ led to 3 main requirements. These requirements can be simply interpreted to mean that CoSy is an addition and not a replacement. Firstly, the social interaction with a demand for an appreciative and respectful attitude. As the most mentioned and experienced communication scenarios were lecturer-guided, the attitude of the lecturers rather than that of the students was the subject of discussion. These finding is in accordance with previous research literature since the most used and most studied method for simulation debriefing is the facilitator-guided method and not the self-guided method.
22
Students expressed a desire for emotional support from their lecturers, including encouragement when their communication skills are still developing. This aligns with empirical findings, as numerous studies have consistently demonstrated that learning processes are positively
Secondly, the analysis shows that participants wish to be guided through the debriefing process. This was already mentioned while talking about the communication scenarios but was demanded more clearly for the debriefing phase. Our results are consistent with literature which stated that it is important that debriefers or facilitators be aware of how they communicate and which questions they ask to promote optimal learning.24,25 The interviewees themselves have mentioned the idea of using a guideline during the debriefing. A debriefing tool, a scripted guide, seems to be an optimal tool to help to transform experience into learning. It would support both, the high level of facilitation by following a defined structure and a high student-centred reflection by using (preformulated) phrases. Students in our interviews provide valuable insights and clearly demand that they want to be involved in the process and participate actively. This agrees with theory which describes that reflection is an essential learning component of debriefing 26 and that students should be iteratively engaged in the reflective cognitive work. 27
Thirdly, even though an evaluation of the reported speech qualities by CoSy would be possible, and was mentioned as an idea in the interviews, the interpretation of the data, the cognitive work, should be carried out by the participants. It is much more likely that communication training becomes even more demanding when the implementation of CoSy output is added since this introduces objective data. It is not a question of whether one perceived a speech rate to be fast or slow, because the speech rate can now be quantified. This could lead to taking the CoSy output as a diagnosis, as a fact which does not need to be discussed. Against the background of the interviews, however, CoSy output should be interpreted more as an instrument that gives the participants clues. Just as an X-ray gives clues about a fracture, the CoSy output gives clues about our speech behaviour. However, in both examples, the way one deals with the information received must be interpreted individually: What do I do with the data supplied, how do I interpret it? Is my fast speech rate appropriate or do I want to take care and speak more slowly in similar situations in the future? It seems to be all about the right questions. By exchanging and engaging with other people's knowledge and thinking, ones’ views and the perception of the communication situation can be broadened, with CoSy output being a piece of knowledge. Lecturers and students emphasise that CoSy is an additional and complementary tool and cannot replace existing communication training due to the non-verbal and emotional aspects in communication. These are highly complex and individual, so that, according to the interviewees, corresponding AI-based derivations alone are viewed sceptically. We assume that CoSy output can be used to deepen and to stimulate the discussion and a stimulating discussion in turn leads to the lecturer being able to retreat. Fanning and Gaba 28 already formulated this paradox our interviewees described: A high level of facilitation implies a low level of involvement by the facilitator.
To answer the second question, many elements of the design of the communication scenarios could be identified. The participants had positive experiences with both actors and peers, but in the interest of realism, lecturers and students prefer the involvement of actors. For the communication situations themselves, the integration of the CoSy is more easily imaginable in standardised situations which are more predictable in their structure. Closely related to this, participants state the need to repeat communication training sessions to make skills development visible. Furthermore, standardised situations would have the advantage of higher comparability, when learning trajectories are to be presented idiosyncratically.
This in turn presupposes a capability on the part of CoSy, namely the data storage, which leads to many ethical questions like to whom does the data belong, how long and where should it be stored? In this context, the question of who has access to the CoSy output must be answered not only for data storage in the long run, but also be answered directly for the didactic embedding during the debriefing phase. Do all involved, peers, lecturers and actors, have access to the output, does the individual output belong to the actors or is the lecturer the one who introduces the output in a suitable place and puts it up for discussion during debriefing? These are questions that need to be discussed in the future ELSI workshops. The interviews showed that the experiences to date were more lecturer-guided than peer-guided. In this current study mainly lecturers were the ones keeping an eye on the target. This suggests that the CoSy output in the debriefing process should be provided by the lecturers. On the other hand, the students emphasised that they want to be actively involved in the process, from which it could be concluded that the actors themselves decide which speech qualities should be discussed in the debriefing and when. Although experiences were mentioned and shared by the lecturers, information on applied methods was vague. Very little mention was made on how the CoSy output should be didactically embedded in the communication training (question 4). This question could not be answered sufficiently and needs to be sensitively tried out in practice and evaluated in future training sessions.
The amount of time needed for the whole training depends on different factors. This is an important aspect, as elements that were repeatedly rated negatively by students were time constraints. An experienced lecturer may be highly efficient in the appropriate timing and use of questions in contrast to a novice. Lecturers might place different emphasis on self-guided discussions in contrast to lecturer focused discussions, with learner-centred discussions requiring more time. Overall, communication training needs time. As aforementioned CoSy is seen as an additional piece of the puzzle. For the framework of future skills training with CoSy, interpretation of interviews leads to the assumption that more time will be needed at various points for teaching and for courses themselves. For this, it cannot be assumed that the use of CoSy will save time, quite the contrary.
The thought of conversation optimisation by CoSy reflects the high level of openness towards CoSy implementation. This, in connection with the desire and demand for a safe learning environment, means that care must be taken, especially on the part of lecturers, to handle this output responsibly. The implementation of CoSy requires high quality and reliability of AI analysis. To prevent the use of false data and produce a best didactical result CoSy must quantify and explain its output reliably. It is recommended to integrate cognitive forcing functions that provoke analytical thinking. 29 The CoSy output should be used to track the learning objective and to report back relevant speech qualities. The lecturers should see themselves as co-learners. CoSy output should not be used, and this was clearly stated by students, to measure performance.
Even though the interviewees were asked about their concerns they expressed few reservations about the use of CoSy. We did not expect this, as a global study in the trust in AI has documented that many people feel ambivalent about the use of AI, reporting optimism and excitement as well as fear and worry. 30 Therefore, the future ELSI workshops should be mentioned again at this point. They will be conducted to identify and discuss the specific challenges and risks in relation to the opportunities and the potential of CoSy. Speech contains a lot of information that provides insight into the speaker's inner state, mood, goals and motives. 31 CoSy will only be able to reproduce a limited amount of this information. This is partly because CoSy will hold more information on textual aspects than on emotional aspects. The combination of both research steps, the currently described requirement analysis and the ELSI workshops, will provide a more complete picture on how students and lecturers perceive the risks of CoSy.
Factors, that influence user acceptance and ultimately determine the actual use of CoSy, were addressed to varying degrees. Based on the interviews, it can be assumed that students as well as lecturers believe that training with CoSy will improve students’ performance. The Technology Acceptance Model (TAM) and the Theory of Acceptance and Use of Technology (UTAUT) are the most prevailing models in explaining what affects the acceptance of various healthcare technologies. 15 The UTAUT describes the factor Performance Expectancy, 32 the TAM Perceived Usefulness. 33 The TAM is based on the assumption that 2 main factors influence the acceptance of a technology: The aforementioned Perceived Usefulness, that is, if users believe that a technology will help them to perform their tasks better, they are more willing to accept it. Secondly, the Perceived Ease of Use, that is if users believe that a technology is easy to learn and use, they are more likely to accept it. 33 According to the UTAUT user-friendliness leads to greater acceptance, summarised under the term Effort Expectancy. Additionally, the factors Social Influence and Facilitating Conditions influence user acceptance and the use of technologies. Resources and support that students and lecturers need to be able to use CoSy effectively, such as technical support, were mentioned in the interviews (Facilitating Conditions). Social Influence, for example, whether other colleagues or fellow students will use CoSy, remains to be seen. These factors must be considered and evaluated not only in the current development phase of CoSy, but also in its practical use.
The interviews provided valuable information to understand which expectations the interviewees have for the use and implementation of CoSy. The future use of CoSy will show which of the reported framework conditions and subcategories will be particularly important in communication training. It is possible that a hierarchical structure will emerge that did not yet become clear in the interviews. Some subcategories could fade into the background, while others become more important.
The study has several limitations. The results reflect the perspectives of lecturers and students at the University of Lübeck and are applicable to this context. They cannot be transferred to other universities without limitations. Purposive sampling and sampling maximal variation 19 allowed for the participation of faculty and students from a variety of disciplines. However, due to the different levels of development of the healthcare programmes, not all perspectives could be considered. At the time of data collection, the clinical psychology and psychotherapy programme was still in the process of being established, so no experience of the students could be drawn on. Likewise, the perspective of the speech and language therapy students is missing. Overall, the recruitment and scheduling of appointments for focus groups with the students proved to be challenging. The reason for this was probably the high workload in the healthcare programmes and the number of examinations during the period of data collection. As a result, only 3 to 5 students participated in the focus groups, which possibly limited the interactions between them. In addition, only students with positive attitudes towards communication training may have participated, so negative attitudes are underrepresented in the sample. To support critical voices in the discussions the interviewers were from another faculty. The interviewed lecturers had a broad background of experience in teaching. Again, no lecturer from the clinical psychology and psychotherapy and speech and language therapy programmes could be included.
At the time of data collection, students and lecturers had no experience with the integration of an AI-based platform into teaching. The identified perspectives and attitudes of the interviewees are based on their imaginations. Especially the students thought beyond the prelims. They imagined interaction with CoSy, assuming that it would have the characteristics of a voice chatbot or conversational agent, by expressing the wish that it could make suggestions for conversational optimisation or by providing an avatar. This goes beyond the planned capabilities of CoSy as a learning platform and speech recogniser without the capability of response in a natural way. It is possible that more detailed and concrete settings for the integration of the AI-based learning platform would be described if practical experience had already been gained.
Overall, the sample showed diversity in teaching and study experience, age and clinical expertise. The structuring qualitative content analysis 18 enabled a systematic approach, the identification of central themes and the interrelationships of the categories. Strengths of the analysis are the systematic linking of perspectives through triangulation and the analysis by an interprofessional team. Individual pre-assumptions and category development were continuously reflected and discussed in the interprofessional team. In addition, the results were presented and reflected several times in the entire team and in the scientific advisory board of the project LABORATORIUM. Students are also members of the advisory board; unfortunately, a member checking 19 with the students participating in the data collection was not possible due to organisational reasons. Data saturation was neither achieved nor intended as the objective of the study. Instead, the focus was gathering the perspectives of students and lectures from the modules where CoSy will be implemented in the next phase of the project. Further qualitative surveys are planned in the project to collect the experiences of lecturers and students in the use of CoSy in communication training.
Conclusions
By implementing CoSy output in an appropriate way it seems to be able to support the process of learning and to enrich the communication skill training. Lecturers are key to students learning in simulation training, but facilitating through debriefing is not a linear, one-way distribution of learning. As the skill of the debriefer is of great importance in ensuring the best learning experience, lecturers should receive facilitation training. A tool for structured debriefings should be used to facilitate the debriefing phase and sufficient time must be available for debriefing sessions. Although the participants, especially the lecturers, see great potential for contributions to reflective learning, concrete ideas for methodological embedding need to be developed in the future. More homogeneity in the curricula and evaluation of communication training should be sought in the future. This will allow for a better understanding of how communication skills training works.
Supplemental Material
sj-docx-1-mde-10.1177_23821205251358089 - Supplemental material for Student and Lecturer Perceptions of the Use of an AI to Improve Communication Skills in Healthcare: An Interview Study
Supplemental material, sj-docx-1-mde-10.1177_23821205251358089 for Student and Lecturer Perceptions of the Use of an AI to Improve Communication Skills in Healthcare: An Interview Study by Hanna Brodowski, Anna Dammermann, Muriel Marieke Kinyara, Martina Andrea Obst, Fabian Samek, Corinna Peifer and Katharina Roese in Journal of Medical Education and Curricular Development
Footnotes
Ethical Considerations and Consent to Participate
This study was conducted in accordance with the Declaration of Helsinki and approved by the ethics committee of the University of Lübeck (number 2022-408). Informed consent to participate was obtained from all the participants for study. Participants had the opportunity to withdraw their written consent for the interview at any time. The participating students were compensated with 20 €.
Author Contributions
HB was the major contributor in writing the manuscript, recruited participants, conducted interviews, analysed, and interpreted the data. AD contributed to the writing of the results, recruited participants, conducted interviews, analysed, and interpreted the data. MMK contributed to the writing of the results, recruited participants, conducted interviews, analysed, and interpreted the data. MAO contributed to the writing of the introduction, critically revised the manuscript, and interpreted the data. FS and CP critically revised the manuscript and interpreted the data. KR conceived the original idea and supervised this step of the project, contributed to the writing of the methods, analysed and interpreted the data, and critically revised the manuscript. The manuscript has been read and approved by all the authors. All the authors have contributed substantially to the work presented in this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is funded by the German Federal Ministry of Education and Research (BMBF) and the regional government of the state of Schleswig-Holstein, Germany under project number 16DHBKI075.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
The anonymised dataset used and/or analysed during the current study is available from the corresponding author on reasonable request.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
