Abstract
The majority of adolescents communicate via text-based messaging, particularly through WhatsApp, a widely used free communication application. Written content on WhatsApp has the methodological potential to provide rich qualitative interview data. This study compares data collected using text-based WhatsApp versus face-to-face interview techniques. Semi-structured interviews were conducted with a sample of early school leavers in South Africa, using WhatsApp (n = 9) and face-to-face (n = 27) followed by a focus group discussion with interviewers. Mann–Whitney U and chi-squared were used to assess associations. WhatsApp text-based interviews took significantly longer to complete but were comparable to face-to-face on the number of themes generated. Rapport, measured as the number of statements from the interviewer aimed at creating a sense of affinity, comfort, and distress reduction, differed between interview conditions. It may be methodologically appropriate for researchers to offer participants a choice of a preferred method of interviewing or consider pragmatic blended approaches of face-to-face and WhatsApp.
Introduction
In the last decade, internet-mediated communication platforms such as Skype, WhatsApp, and Zoom have been identified as useful qualitative data collection tools (Archibald et al., 2019; Gibson, 2022; Kaufmann & Peil, 2020; Krouwel et al., 2019; Reñosa et al., 2021). The sudden emergence of the COVID-19 pandemic and the restrictions on face-to-face engagement have increased technology use and have forced researchers globally to consider switching to online data collection methods (Chen et al., 2020; Dodds & Hess, 2020). The digital world particularly plays an important role in an adolescent’s social development (Gibbs et al., 2020) as they are increasingly using digital spaces for education, entertainment, communication, and connection, representing one in three internet users globally (Stoilova et al., 2021). Given the increased use of technology for communication and interaction among adolescents, adolescent health research may particularly benefit from online data collection methods.
While there is a wide range of digital communication methods, including video and audio communication, text-based communication makes up a large part of adolescent social interaction (Ehrenreich et al., 2020). Technology offers researchers the opportunity to engage with adolescents using creative ways to elicit self-expression and communication on platforms that they are already comfortable and familiar with. It allows access to diverse groups of adolescents, gaining the voices of historically underrepresented or hard-to-reach adolescent populations (DeJonckheere et al., 2017; Elgar et al., 2015). Online data collection also has the advantage of being less dependent on scheduling restrictions and geographic location (Jowett et al., 2011; Shapka et al., 2016). It is convenient, saves costs, and allows for automatic and accurate storing of raw data (Janghorban et al., 2014; Jowett et al., 2011; Kazmer & Xie, 2008).
One promising method for collecting data online with adolescents in low- to middle-income countries is the use of WhatsApp. WhatsApp is a free smartphone and computer-mediated instant messenger application (App), with two billion users worldwide in 2021 (Iqbal, 2021). The widespread use of smartphones and internet penetration in Sub-Saharan Africa has made WhatsApp the most downloaded App in Africa, with, for instance, 52% of the South African population using it (Steup, 2019). WhatsApp allows users to send real-time text, locations, images, voice recordings, documents, and videos. Compared to other instant messaging Apps, prior registration is not required which is more favourable if internet bandwidth is poor (Church & De Oliveira, 2013). A review of the literature has shown that WhatsApp has been used in many ways including dissemination of surveys through sharing an online survey link (Chen et al., 2020; de Gruchy et al., 2021; Kamel Boulos et al., 2016) in clinical practice to communicate with patients (Kamel Boulos et al., 2016; Mars & Scott, 2016), to support interventions (Cheung et al., 2020; Durmaz et al., 2019; Patel et al., 2018; Pereira et al., 2020) and research projects (Jailobaev et al., 2021), or as an educational tool (Coleman & O’Connor, 2019; Kamel Boulos et al., 2016). Relative to the number of WhatsApp users in low- and middle-income countries, to date, the use of WhatsApp as a data collection tool for qualitative interviews has been limited (Kaufmann & Peil, 2020; Reñosa et al., 2021; Singer et al., 2020).
Face-to-face interviewing has been the golden standard for decades in qualitative research, with published guidelines on best practices (Opdenakker, 2006). However, in a digitalized world, online interviewing methods may also be a plausible tool for gaining insights into the realities of research participants. In a qualitative interview, text-based communication affords adolescents more control of the conversation, and flexibility in responding, and the absence of verbal cues may potentially decrease fear of social judgment (Lee, 2007). The lack of face-to-face contact in online discussions allows participants to speak more freely, especially around sensitive topics (Singer et al., 2020) and to exit discussions with less explanation. The distance created online between researchers and participants potentially limits social judgment and power differentials tend to recede to the background, which can be problematic in face-to-face interactions (Cleary & Walter, 2011; Voida et al., 2004).
Nonverbal cues (head nods, smiles, eye contact, etc.) in face-to-face communication provide speakers and listeners with a richness of meaning and information and are important aspects of qualitative interviewing. These non-verbal elements are generally absent in online discourse and some researchers raised concerns about the flow of conversation and the difficulty in reading or responding to emotions expressed in text from the participant (Opdenakker, 2006; Reñosa et al., 2021). However, studies have also shown that with time and experience, the same richness of face-to-face communication could occur over a virtual text-based medium (Gajadhar & Green, 2003; Jowett et al., 2011; Kazmer & Xie, 2008; Kumari & Ganagwar, 2018). Users have found ways to increase the richness of online communication that is comparable to face-to-face encounters through the creation and use of emoticons, pictograms, and internet slang abbreviations such as “LOL” (Laughing out Loud) (Hancock et al., 2007). These evolving modes of communication have become a distinct style of non-verbal communication in the digital world that is widely understood and an accepted form of communicating emotion or non-verbal elements. Furthermore, while it is possible for an individual to assume a false identity and share false information in an online text-based interview, a longitudinal study validating self-report data with actual behaviour and external data found that the risk of adolescents assuming a false identity and sharing false information tends to be small (Stieger & Göritz, 2006). Given the lack of consensus in the literature and the relatively limited research conducted to date, this study explores the feasibility of text-based interviewing among adolescents.
The few studies that have examined this issue globally have used quantitative methods (measuring interview duration, word count, and themes) to assess the quality of online and face-to-face interview methods. These studies compared telephone (Irvine, 2011), video calling (Krouwel et al., 2019), and computer-mediated instant messenger interviews (such as Microsoft Instant Messenger) (Shapka et al., 2016) to face-to-face interviews. Despite reported variations in the duration and number of words produced in the online condition, results showed that data quality was unaffected by the mode of data collection (online vs. face-to-face) with few differences in the number, depth, and type of themes discussed (Irvine, 2011; Krouwel et al., 2019; Shapka et al., 2016). To our knowledge, no studies have systematically compared face-to-face interviewing techniques to text-based interviewing using WhatsApp among adolescents.
The main objective of this study was to compare the feasibility and acceptability of interviews delivered via WhatsApp to interviews delivered face-to-face, in a sample of adolescents. This study takes a pragmatic approach (Kaushik & Walsh, 2019), which is a strategy that quantifies interview quality based on established best practices (Flick, 2013), but also consider the lived experiences of participants and researchers on the different methods of interviewing. This study, therefore, examined the following two research questions:
What were the differences between the text-based WhatsApp and verbal face-to-face interview conditions with respect to the themes, flow, focus and depth of the interview, rapport (interviewers creating a sense of affinity, comfort, and distress reduction), and interview duration?
What were the opinions, experiences, and reflections of the interviewers and participants using WhatsApp and face-to-face interviewing methods?
Methods
Study Participants
The interviews were conducted among early school leavers in South Africa. Early school leavers are considered a hidden population, with no existing register or database of adolescents once they drop out. Participants were recruited from four urban communities in the Western Cape province in South Africa, due to the high percentage of early school leavers in this region (Desai et al., 2019). Early school leavers were defined as individuals aged between 13 and 20 years who had not completed their schooling or obtained their national senior certificate, and who were not currently attending school. Individuals who were enrolled in college or vocational training were excluded from the study.
Participants were recruited and tracked using respondent-driven sampling (RDS) (Heckathorn, 2002). Using the RDS method, the initial sample or “seeds” of early school leavers were obtained through various channels such as community youth groups, referrals, and by approaching people and then were confirmed to have met the eligibility criteria in the selected communities. Our goal was to obtain four seeds in each interview condition. Simple randomization was used to assign the seeds to either the face-to-face (white in Figure 1) or WhatsApp (grey in Figure 1) interview conditions. Each seed was required to identify up to two early school leavers to participate in the interview condition that they themselves had been allocated to. These participants formed the “first wave” of sampling and were themselves asked to identify and refer a further two more early school leavers. Up to two waves of recruitment were conducted for each interview condition (Figure 1). Participant eligibility was further confirmed through contacts that the potential participant provided. Participants who expressed their inability to participate in the WhatsApp or face-to-face interview that they were initially allocated to were excluded from the study. Each respondent who completed the interview received a monetary reimbursement in the face-to-face condition and mobile data in the WhatsApp condition. Also, for every successful recruit, the recruiter received an additional monetary incentive in the face-to-face condition and mobile data in the WhatsApp condition.

Respondent-driven sampling for out-of-school youth—a schematic representation of four seeds.
Three interviewers were selected from a pool of applicants based on research experience. Given that the Western Cape is characterized by diverse cultures and languages, the study aimed to gain a diverse representation of interviewers based on demographics and languages spoken in the communities where the study participants were recruited. Up to three interviewers were selected to recruit, conduct face-to-face and WhatsApp interviews, and participate in a focus group discussion at the end of data collection. Interviewer 1 described herself as White, had the most prior experience in qualitative interview training and conducted the interviews primarily in English. Interviewer 2 identified herself as Colored, had moderate experience in conducting interviews and conducted the interviews in Afrikaans and/or English. Interviewer 3 identified as Black African, had less interview experience compared to the other interviewers and conducted the interviews in either English or Xhosa. Race was classified according to the South African Department of Labor designated categories (1 = black African, 2 = Coloured, 3 = Indian, 4 = White, 5 = Other) (Stats SA, 2011).
Data Collection Tools
The interviewers used a semi-structured discussion guide to conduct a one-on-one in-depth interview and each interviewer conducted a mix of face-to-face and WhatsApp interviews. Participants were given the option to have the interview conducted in either English, Xhosa, or Afrikaans, which are the predominantly spoken languages in the region.
The discussion guide covered questions related to participant demographics (race, age, gender), the reasons for dropping out of school, social influences, opinions and experiences of health behaviours that placed them at risk, and their opinions and reflections of the interview. A complete discussion of these findings from the conducted interviews has previously been published (Desai et al., 2020). The semi-structured discussion guide was designed in English and translated into two official South African languages, namely Xhosa and Afrikaans. The researchers used anchor questions to introduce different topics during the interview or to clarify comments made by the participants. The content and format of the interview were identical for both interview conditions. A different semi-structured interview guide was used to guide the focus group discussion among the three interviewers which took place once all interviews had been conducted. Interviewers were asked to describe their educational qualifications, experiences, and opinions on recruiting participants via RDS and administrating the WhatsApp and face-to-face interviews.
Procedures
The face-to-face interviews were conducted verbally and at locations mutually agreed upon by the interviewer and participant, that is, in a private and confidential space at the participant’s home or at a public library, and were audio recorded. The WhatsApp interviews took place at an agreed time between the interviewer and participant. A dedicated WhatsApp account for this study was used by the researchers and participants were contacted using their personal WhatsApp accounts. The participants in the WhatsApp interview had the option of conducting the interview in a single session or over a few days. Interviewers initially introduced themselves over a voice note message. Thereafter, participants and interviewers only used text in the WhatsApp condition and were encouraged to use emoticons.
Once the interview was over, participants were requested to delete the interview conversation on WhatsApp. The WhatsApp conversation log was converted into an interview transcript. Once the chat was downloaded and saved on a password-protected computer, the chat was deleted from the App by the interviewer. The audio recordings of the face-to-face interviews were transcribed verbatim. To check for consistency and correct translation, the transcripts for all interviews were translated into English from Afrikaans and Xhosa, and then back-translated into the original languages by two independent researchers proficient in English and Afrikaans or Xhosa.
A total of 39 participants were interviewed; however, three participants were excluded due to the following reasons: in the face-to-face condition, one participant had a language barrier. In the WhatsApp condition, one participant had a prior relationship with their allocated interviewer, and one participant did not complete the interview.
After the data collection had been completed, all three interviewers participated in an in-depth focus group discussion. The focus group was facilitated by a neutral researcher that was not part of this study but had sufficient research and interviewing experience. The focus group took place face-to-face at a research office. Consent was obtained from the interviewers for the interview to be audio recorded and to maintain confidentiality. The focus group discussion was approximately 1 hour and transcribed verbatim by the researcher who facilitated the focus group.
Analysis
The authors adopt a hybrid approach of inductive and deductive coding (Fox, 2008). The interview transcripts from both interview conditions (1 = WhatsApp, 2 = face-to-face) were coded according to dimensions indicative of content, flow, focus, and depth of the interview process (Shapka et al., 2016). These dimensions include: (1) interview duration, calculated as the total number of hours, (2) the total number of words spoken by the interviewer and participant, (3) the number of interviewer probes, calculated as the total number of active invitations for comment, clarification of a statement or question, question re-phrases, follow-up questions by the interviewer specific to the interview guide questions, (4) rapport, calculated as the total number of statements from the interviewer aimed toward creating a sense of affinity, comfort, and distress reduction, (5) the number of off-topic statements made by the interviewer, (6) the number of interview questions answered by the participant, and (7) the number of themes that emerged from each interview condition relating to the reasons for leaving school (social interactions, vocational aspirations, family factors and poor academic performance). In addition, the number of emoticons used by participants and interviewers was also counted in the WhatsApp interviews. For standardization, the interview started from the beginning of the first interview question until the end of the response to the last interview question.
The first and second authors (RD and AM) constructed the variables from the interview transcripts. Using Atlas Ti version 8 (ATLAS.ti Scientific Software Development GmbH), a deductive approach was used to code the interviews according to predefined codes indicative of the flow, focus, and depth of the interview process (Burnard, 1991). The first and second authors blindly coded four transcripts and then compared and discussed the coding and any discrepancies found. The last author served as a third independent reviewer of the codes to reach a consensus on the codebook. A final codebook was developed, and the remaining transcripts were coded. The counts of each code were analyzed quantitatively using SPSS (IBM SPSS Statistics for Windows, Version 25.0. Armonk, NY: IBM Corp) by RD and LM. A description of the sample for each interview condition was analyzed. Mann–Whitney U and Fishers’ exact chi-squared tests were conducted to identify associations and differences in the interview process that emerged from each of the interview conditions.
Guided by the interview questions from the individual participant interviews and the interviewer focus group, a qualitative categorical analysis approach using Atlas Ti (Burnard, 1991) was performed. Following several readings of the transcripts, data were initially coded and collated between RD and AM. Subsequently, connections between the codes were examined to identify emerging themes. Themes were then reviewed and refined. Extracts were selected, and the results of the analyses were organized and presented in this manuscript.
Ethics and Consent
Ethics approval was obtained from the Human Sciences Research Council (Protocol number: REC 2/23/08/17). Adolescents who are out of school, may be out of school without their caregiver’s knowledge or have unstable family support structures and therefore might have been unwilling to participate if they had to divulge the nature of the research to their parents or caregivers to obtain parental permission. Given the low risk of the study on respondents, obtaining independent consent from the adolescent participants themselves was feasible and increased participation. Therefore, in line with the South African National Department of Health Ethics guidelines (2015) section 3.2.2.4., permission for independent consent for minors was obtained (Department of Health, 2015). Once eligibility for participation was established, full written informed consent was obtained from each participant. The interviews in both conditions were not anonymous as the researchers could identify the participants using their WhatsApp account names, but the interviews were confidential and all identifying information was not included in the manuscript.
Results
Demographics and Characteristics of Participants and Interviewers
In the WhatsApp condition, on average each seed recruited 0.5 participants, and the first wave of participants on average recruited 1.5 participants. In the face-to-face condition, on average, each seed successfully recruited two participants and the first wave of participants recruited an average of 1.9 participants. As shown in Table 1, approximately half the participants were female (52.8%) and the majority described themselves as Colored (61.1%). The mean participant age was 18.34 years.
Demographics and Characteristics of the Participants and Interviewers.
p Value is significant at the .05 level.
Comparison of Interview Conditions on Dimensions of Interview Quality
Mann–Whitney U and Fisher’s exact chi-squared tests were conducted to identify differences and associations between the interview conditions on key characteristics indicative of the content, flow, focus, and depth of the interview process. As seen in Table 1, the participant’s race (χ2 (1) = 7.636, p = .014) and interviewer’s experience (χ2 = 7.094, p = .029) were significantly associated with interview groups. As seen in Table 2, on average, WhatsApp interviews took significantly more time (U = 0, p < 0.001), but face-to-face interviews were longer in terms of the number of words stated by the interviewer (U = 50.5; p = .008). Interviewers also appeared to engage in different ways across the two conditions, with face-to-face conversations having significantly more probes (U = 2.5; p < .001) and WhatsApp interviews having significantly more off-topic statements (U = 64.5; p = .0015). Only one theme (social interactions) was significantly different between the WhatsApp and face-to-face interview conditions (U = 72; p = .038).
Comparison of Interview Conditions on Content, Flow, Focus, and Depth of the Interview Process.
p Value is significant at the .05 level.
Opinions and Experiences of the Participants and Interviewers
Extracts of quotations from the face-to-face and WhatsApp interviews on the opinions, experiences, and reflections of conducting the interviews of the three interviewers and the participants are presented in Table 3. For each interview condition, the quotations are categorized into the following emergent themes from the categorical content analysis: recruitment, rapport, duration, questions, conveniences, interview preferences, and recommendations.
Quotations Extracted From Participants and Interviewers From WhatsApp and Face-to-Face Interview Conditions.
Note. ID refers to the participant’s identity number. N = number of participants that stated the quote.
M refers to the Moderator in the focus group discussion with interviewers.
Recruitment
The recruitment numbers using the RDS method was lower in the WhatsApp condition (n = 9) compared to the face-to-face condition (n = 27). RDS recruitment was considered convenient in the face-to-face setting as participants found it easy to recruit in their neighbourhood and conduct the interview on the same day. Participants honoured their appointments in the face-to-face condition as it was convenient for them to attend the interview in their own communities. In contrast, the number of invitations sent through text on WhatsApp was higher compared to the Face-to-face condition but recruitment in the WhatsApp condition was lower, took longer and required several follow-ups.
Seeds recruiting were also familiar and well-known within the community that they were recruiting from which may have contributed to participant’s willingness to participate in face-to-face interviews. According to the interviewers, the monetary reimbursements offered in the face-to-face condition were more valuable than the mobile data reimbursement offered in the WhatsApp condition, which they attributed to the low recruitment numbers.
Rapport
Participants in the face-to-face condition (n = 4) felt that they could relate more easily to the interviewers from a similar sociodemographic background as well as speak in a language that was comfortable to them. Participants, particularly in the face-to-face interview, enjoyed the interview and learned something about themselves. Participants in the WhatsApp interview condition did not have any face-to-face contact with the interviewer, which they claimed initially made them feel suspicious and distrustful. Given the absence of visual cues in the WhatsApp condition, one participant requested videos of the interviewer during the interview. At the end of the interview, some participants (n = 4) in the WhatsApp condition reported that the distance and the increased anonymity between researcher and participant made power differentials less visible, eventually leading to confidence in the interviewer and the interview process itself. There was trust that the interview remained confidential and that they could not be identified due to a lack of face-to-face contact.
All interviewers in both the WhatsApp and face-to-face conditions found difficulty in building trust and rapport at the beginning of the interview. In the face-to-face condition, a participant and the interviewers commented on the importance of the interviewer’s skills in building rapport, which included effective communication skills, reading body language, the use of probes, and the ability to reserve judgment. These strategies used by all interviewers in the face-to-face condition allowed participants to feel more comfortable with the interview, have trust in the interviewer and disclose sensitive information.
Duration
When participants were asked what they disliked about the interview, most participants did not find any difficulties in the face-to-face interview but those in the WhatsApp interview condition (n = 6) found that the interview was time consuming, uninteresting, and tedious. Participants also reported that they were multitasking and sometimes struggled with slow internet bandwidth.
The three interviewers shared the same sentiment and felt that a single WhatsApp interview was time consuming as participants would chat at their convenience, which would be over several days and sometimes during odd hours of the night. The flow of questions was interrupted resulting in long breaks between interview questions. Following up on participants and motivating them to continue an interview was difficult and resulted in repeating questions, further increasing the duration of the interview.
Questions
Most participants were satisfied with the questions asked in both interview conditions; however, some considered the questions sensitive and found it difficult to disclose information (n = 4). From the interviewers’ perspective, the WhatsApp condition was advantageous in that it allowed participants time to read the questions and think of their responses, possibly resulting in more in-depth responses.
Conveniences
Participants (n = 4) stated that the WhatsApp condition was convenient as it accommodated participants who preferred not to speak face-to-face and felt more comfortable communicating online at their convenience. The main convenience of the WhatsApp condition for interviewers was the lack of transcription involved as the chat served as an interview transcript. Interviewers (n = 2) also felt that conducting the face-to-face interview in the privacy of the participant’s own home or environment may play a role in the depth of responses obtained from participants due to the comfort and familiarity of their own environment, as compared to conducting the interviews in a foreign space.
Interview preferences
When face-to-face participants were asked if they were willing to do the interview over WhatsApp, there were mixed responses. One participant in the face-to-face condition expressed a preference for face-to-face interviewing over writing or texting. One participant also felt that information shared on WhatsApp is not private, confidential, or secure. The interview experience would not be genuine and they would be uninterested. Other participants (n = 4) preferred the WhatsApp condition due to increased anonymity between researchers and participants, allowing participants to talk more freely about sensitive topics without fear of judgment. Two participants did not have a smartphone and would require permission from their parents to use their phones. Among those in the WhatsApp condition, one participant claimed that they would be comfortable with both interview conditions. Some participants (n = 3) in the WhatsApp group would have preferred to have done their interview face-to-face or telephonically to decrease the duration of the interview.
Although transcribing the face-to-face interviews was time consuming, all interviewers in this study still preferred conducting the interviews face-to-face as it was time efficient for the participant, a technique that they were familiar with, and that they could easily make use of rapport-building strategies to gain in-depth responses.
Recommendations
Recommendations by participants (n = 4) and interviewers were mainly for improvements to be made to the WhatsApp interview condition. Participants (n = 3) and interviewers recommended the use of alternative features in WhatsApp such as video or telephonic calls and voice notes to minimize the time taken to type a response. However, this would increase the time spent on transcribing by the interviewers.
Interviewers in the WhatsApp condition found that the questions were more appropriately designed for a face-to-face interview but needed to be tailored for use via WhatsApp, possibly by using language such as abbreviations and slang which are commonly used in text messaging. To increase motivation and participation in the WhatsApp interview, the interviewers suggested that the reimbursement offered could work differently to the face-to-face conditions, with participants receiving monetary reimbursements for every few questions answered in the WhatsApp interview rather than receiving their reimbursement at the end of the interview.
Discussion
The COVID-19 pandemic has brought an urgent need for remote data collection methods for research. This study examined the differences in qualitative data obtained from a group of adolescent early school leavers when using two different methods, namely, text-based WhatsApp, and traditional face-to-face interviews. Given that the interviews consisted of different communication modes (verbal and text), it was found that text-based WhatsApp and face-to-face interviews each presented with unique opportunities, as well as challenges.
Rapport was effectively developed in both interview conditions using techniques appropriate for the interview mode. The number of rapport-building statements used in the WhatsApp and face-to-face interview conditions was equivalent; however, significantly more probes were used by the interviewers in the face-to-face condition and significantly more off-topic statements initiated by the interviewer were present in the WhatsApp condition. Previous studies found that building rapport took longer over instant messaging with an increased number of rapport-building statements used by the interviewer compared to face-to-face interviews (Jowett et al., 2011; Kazmer & Xie, 2008; Shapka et al., 2016).
In this study, participants and interviewers made use of emoticons and acronyms (such as “lol”) to express non-verbal cues more explicitly. The number of off-topic statements initiated by the interviewer may also have contributed to greater rapport. In keeping with previous studies, we also found that the anonymity and distance between the interviewer and participant in the WhatsApp condition obscured power differentials, enabling rapport building (Cleary & Walter, 2011; Shapka et al., 2016). In the WhatsApp condition, rapport still benefitted from the interviewer introducing themselves over a voice recording as well as the interviewer’s skill in expressing emotion over text through the use of internet vernaculars such as punctuation, emoticons, and internet abbreviations and keeping participants engaged over text using off-topic conversations. The use of voice, video, and text offer participants and researchers many modes of communication that they feel most comfortable with. In the face-to-face condition, rapport still benefitted from conventional interviewer skills including attentiveness, flexibility, social skills, and effective communication.
The qualitative findings show that participants in the face-to-face condition felt that they could relate more easily to the interviewers from a similar sociodemographic background. Given that each interviewer had varying experience levels in conducting qualitative research and that participant race was not equally distributed between the interviewers, it is important to acknowledge possible interviewer and participant effects. A qualitative interview is more than just a measure of the frequency of a conversational interaction between two people (Gibson, 2022). Rather, it is the social interaction and relationship between the interviewer and interviewee that affects the process and outcome of the interview. It is pertinent to therefore acknowledge that the quality of an interview is shaped by variables such verbal and non-verbal messages, skills and knowledge of the interviewee, culture and demographics such as race, which also transpire during the interview process and affects the data produced (Frances et al., 2009; Potter & Hepburn, 2005). Future studies should consider the interviewer and participant characteristics and experiences of conducting the WhatsApp and face-to-face interviews, to understand how the different methods of interviewing shape the research encounter and the data produced.
The advantages of the text-based WhatsApp interview included time saved from the automatic transcript generated from the chat, participants could respond at their own convenience and reflect on their responses, and it accommodated participants who preferred not to talk about sensitive information in person. With the structure of the interview guide identical in both interview conditions, the number and types of themes were almost equivalent. This finding suggests that despite the differences in the number of words produced and the duration of the interview, the information obtained was similar across the interview conditions. This finding is consistent with previous studies that compared instant messaging to face-to-face interviews (Hinchcliffe & Gavin, 2009; Shapka et al., 2016; Singer et al., 2020).
Given the novelty of WhatsApp interviewing, this method presented challenges. While significant progress may have been made to extend internet access to under-served areas prior to the COVID-19 pandemic, socio-economic barriers to accessing the internet include the high cost of mobile data, the price of devices, and poor signal (Desai & Burton, 2022). Researchers should keep these digital inequalities in mind and select an approach that is suitable for the sociodemographic context in which the research is taking place. Having friends or family recruit into the WhatsApp condition yielded low recruitment. This could be attributed to the nature of the reimbursement being mobile data compared to the monetary incentive offered in the face-to-face interview for recruiting participants. Those recruited may have also not had access to a smartphone and this may have led to them not participating. A previous study that used Zoom to conduct qualitative interviews used phone calls and social media platforms to successfully recruit participants (Reñosa et al., 2021). Once participants were recruited in this study, they were still sceptical of the WhatsApp interview upon first contact with the interviewer. This challenge was somewhat mitigated by interviewers introducing themselves over a voice note, explaining the nature of the study and ensuring that the interview was secure and confidential. Furthermore, both participants and interviewers found the online interview long, and tiring and participants said that they began to lose interest. Participants and researchers recognized that a way to mitigate the challenges encountered in the WhatsApp condition would be to use alternative multimedia features available on WhatsApp such as phone, voice, and video. These features have been validated in other studies (Kaufmann & Peil, 2020; Reñosa et al., 2021; Singer et al., 2020) with relatively modest differences in data quality and participant satisfaction between face-to-face and online modes. However, incorporating these features would require further transcription time from the researcher.
This study is not without its limitations. The authors acknowledge that face-to-face nonverbal communication contains many complexities such as body language and tone that are unique to this form of communication. WhatsApp also contains unique non-verbal cues such as the use of emoticons, non-standard punctuation, text formatting, and letter repetition which mimics the tone of voice (Hancock et al., 2007). The authors unfortunately could not analyze the full range of non-verbal cues present in each of these interview conditions. An in-depth semantic analysis of non-verbal communication in the different interview conditions can be explored in future analysis to account for the complexities of nonverbal communication during interviews. The sample size of the study is within the recommended range for a qualitative study but does not hold adequate power to conduct complex quantitative analysis. However, the sample size was sufficient to identify numerous significant descriptive differences between the interview conditions which can further be explored in future studies with larger sample sizes. The relatively small sample size in the WhatsApp condition compared to the face-to-face condition should also warrant future studies to test this methodology on a larger scale. This study focused on adolescent early school leavers and their reason for leaving school. Education level may have played a role in the depth of participant’s responses and preference to conduct interviews verbally or over written communication. Although previous studies have found that adolescents are comfortable with communicating in online environments, future work is needed to test the compatibility of WhatsApp interviewing among other adolescent and adult populations. We acknowledge that sections of data may be included in multiple themes with some overlap between themes. However, the researchers considered how each theme fitted into the overall story within the entire data set. Furthermore, due to adolescent early school leavers in this study being a hard-to-reach population and the limited resources such as funding and time constraints, transcripts were not returned to the participants for comment and/or correction due to difficulty in contacting them again.
Conclusions
This study adds to the literature that is empirically comparing online text-based and face-to-face interviewing methods. This study contributes to a growing body of knowledge in the qualitative community on using remote data collection tools amid an ongoing pandemic. Remote data collection is not a replacement for face-to-face interviews but can be a highly beneficial complement to this approach. Using WhatsApp to collect qualitative data among adolescents bodes well for researchers because the number of themes generated and the rapport-building process is almost equivalent to that obtained face-to-face. Although the length of time taken in the WhatsApp condition was a concern, it is recommended that the WhatsApp interview protocol be adapted to include, for example, an open-ended question sent to participants once a week over approximately a month, which will shorten the interview time, increase interest, and allow an analysis of the responses to that question in real-time. Internet vernaculars and other multimedia features such as voice or video could also be incorporated into the WhatsApp interview. Participants should also be given a choice in how they wish to be interviewed, which could possibly have a positive impact on response rates and interest in the study.
Footnotes
Acknowledgements
We want to acknowledge all the research assistants and adolescents without whom this study would not have been possible.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Ethical Approval
Ethics approval was obtained from the Human Sciences Research Council (Protocol number: REC 2/23/08/17).
