Abstract
In recent years, the field of qualitative data collection, management, and analysis has undergone significant evolution, with researchers worldwide increasingly leveraging technological advancements to streamline and simplify their practices. Despite the advantages of using qualitative data coding software, several challenges persist, particularly in resource-constrained settings. Issues such as the high cost of software licenses, the time required to learn and use these tools, and the complexity of advanced features often hinder researchers, especially students and early career researchers in low-income contexts, and contribute to the generation of poor-quality research evidence. This paper aims to provide a simplified, step-by-step guide to manual qualitative data coding using Microsoft Word, a widely accessible tool in low-income contexts. It is designed to support students and early-career researchers in low-income settings, helping them to conduct effective qualitative data management and analysis without the need for expensive software. We provide a five-step detailed guide for qualitative data coding using Microsoft Word, based on a practical example from previous research on maternal and child health in rural Tanzania. We also present a hypothetical example of a student research study focusing on the drivers behind pursuing a Bachelor of Science in Nursing degree. The process begins with examining the research objectives and questions to develop preliminary themes and sub-themes, identifying relevant data sources within study interview guides, and creating guided transcription and coding templates tailored to each interview guide, followed by data coding by integrating interview data into a structured template. Practical insights are provided on transforming coded data into meaningful interpretations and analyses. This systematic approach aims to facilitate effective qualitative data management and analysis using accessible tools like Microsoft Word, thereby supporting students and early career researchers in conducting small-scale studies effectively.
Keywords
Introduction
In recent years, the field of qualitative data management and analysis has evolved significantly. Researchers worldwide are increasingly leveraging technological advancements to streamline and simplify their data management and analysis practices (Abdekhodaie et al., 2018; Brailas et al., 2023; Cypress, 2019; Dalkin et al., 2021). Coding, a critical stage in qualitative data analysis, has particularly benefited from these technological advancements, with many researchers transitioning from manual to technology-based coding methods. The introduction of various software tools has revolutionized the coding process in qualitative research (NYU, 2024). Widely used programs such as NVivo, ATLAS. ti, MAXQDA, Taguette, Dedoose, and others have simplified the process, allowing researchers to efficiently label, merge, or split codes and to export and import coding schemes (NYU, 2024). The technology-based coding and qualitative analysis are expected to further benefit from the innovation of artificial intelligence (AI)-powered coding and data analysis. This advancement can facilitate rapid coding, processing, and recognition of data patterns (Friese, 2023; Morgan, 2023). This will continue to challenge the traditional methods of coding and analysing qualitative data, particularly in resource-constrained settings where the application of these new technologies is still in its early stages.
It is important to acknowledge that, despite advancements in technology, coding will remain an essential activity within the qualitative research process. Coding involves the organization and categorization of data to uncover patterns, themes, and meanings (Brailas et al., 2023; Coates et al., 2021). Through coding, researchers apply labels (codes) to segments of data, transforming raw information into a structured format that can be analysed effectively (Coates et al., 2021). This process is crucial for making sense of qualitative data and for identifying meaningful interpretations and insights.
Coding softwares have become essential in handling large volumes of qualitative data, offering increased flexibility, and improving the validity and auditability of research findings (Basit, 2003). The use of technology-based coding software not only simplifies qualitative analysis through improved data management but also bridges geographical barriers by facilitating collaboration among researchers in different contexts, thereby enhancing the rigor of qualitative research (Basit, 2003; Chris, 2008; Church et al., 2019; LaPelle, 2004). Despite the advantages of using qualitative data coding software, several challenges persist, particularly in resource-constrained settings. Issues such as high costs of software licenses, the time required to learn and use these tools, and the complexity of advanced features often hinder researchers, especially students and early career researchers in low-income regions, and contribute to generation of poor-quality research evidence (Basit, 2003; Berthet et al., 2023; Church et al., 2019; Cypress, 2019; NYU, 2024; St John & Johnson, 2000). For example, Ose et al. (2016) affirms that computer-assisted qualitative data coding and analysis software is overly advanced and complex while the goal (common among students and early career researchers) is simply to organize and structure the data. Some authors have deliberated on the necessity of reevaluating Western methodological approaches to qualitative data management when applied in low-income countries, acknowledging the financial and technical constraints that researchers in this context face (Cypress, 2019; Halme et al., 2024). Halme et al. (2024, p. 36) affirms that “research in low-income settings, however, poses significant challenges for qualitative scholars… their contextual richness cannot easily be conveyed with methodological conventions originating from Western contexts”. As a result, there is an unmet need for alternative, simplified methods of qualitative coding that can be utilised by students and early career researchers in low-income contexts without heavily relying on expensive or complex software.
Cypress (2019) asserts that although there is a growing reliance on software-powered coding, there’s recognition that qualitative data coding and analysis can still be carried out manually employing diverse techniques for data categorization and theoretical development. Cypress (2019) concluded that early career researchers undertaking small studies would be advised to opt for a manual approach initially to grasp the intuitive facets of analysis, which may in the long run (we argue that) form the fundamental basis of any analytical method, including computerized methods. In support, Basit (2003, p. 152) acknowledges that “the use of software may not be considered feasible to code only a few interviews”. Obviously, students and early career researchers in resource-constrained settings often conduct few interviews due to inability to manage the software license costs and the time required to learn their application amidst other academic/work demands. In such scenarios, manual coding offers several pedagogical advantages among students and early career researchers including eliminating the need for specialized software, reduces costs, and provides a straightforward method for organizing and analysing qualitative data (Abdekhodaie et al., 2018; Basit, 2003; Bree & Gallagher, 2016; Chris, 2008; LaPelle, 2004; Ose, 2016). Manual coding may also offer more engagement with data as the students and early career researchers transition to technology-assisted coding. This is in line with Basit (2003, p. 152) who suggests that coding “allow [s] the researcher to communicate and connect with the data to facilitate the comprehension of the emerging phenomena”. Compared to technology-assisted coding, manual coding may therefore allow students and early career researchers to immerse themselves more deeply in their data, fostering a nuanced understanding of the material that can be critical for developing analytical skills. Manual coding encourages meticulous data examination, which can be particularly beneficial for students and early career researchers who are still learning the intricacies of qualitative analysis. By engaging directly with the data, they can develop a stronger grasp of the coding process, which serves as a foundation for more advanced analytical techniques (Ose, 2016; St John & Johnson, 2000). As these students and early career researchers advance in their studies and research careers, the transition to technology-assisted coding becomes smoother, enabling them to better appreciate the efficiencies and capabilities of technology-based coding software due to their solid grounding in manual coding.
Manual coding, using tools like Microsoft (MS) Word and MS Excel, offers a viable solution for students and early career researchers in resource-constrained settings. Unlike qualitative data coding software, MS Office is often preinstalled on computers sold in local outlets in resource-constrained settings. This practice makes MS Office accessible to students and early career researchers, despite the high cost of licenses (Ghosh, 2003), thereby providing a viable alternative for qualitative data management. It is important however, to acknowledge that the use of MS Excel presents a drawback of duplicated effort, as researchers are required to navigate through numerous cells to transfer data to MS Word for interpretive writing, a process that can be both time-consuming and cumbersome (Bree & Gallagher, 2016; Ose, 2016). By using MS Word as a manual coding tool, students and early career researchers can easily employ techniques such as ‘cut and paste’ and the use of coloured fonts to categorize data and develop theoretical explanations (Cypress, 2019; Ose, 2016). This paper aims to provide a simplified, step-by-step guide to manual qualitative data coding using MS Word. It is designed to support students and early career researchers in low-income settings, helping them to conduct effective qualitative data analysis without the need for expensive software.
Methods
This paper draws on reflections of years of experience in conducting qualitative research and teaching in higher learning institutions in low-income African countries to provide a detailed, step-by-step guide for qualitative data coding using MS Word, utilizing a modified example from previous research (Isangula et al., 2024) and a hypothetical example of a student research. The process begins with a thorough examination of the research objectives and questions, which is essential for
The third step involves Summary of manual coding steps.
A Practical Case
Study Objectives, Participants and Interview Guides.
Steps for Developing Manual Coding Framework in MS Word
Step 1: Develop Preliminary Themes and Subthemes
Themes and Subthemes From Objectives and Interview Guides.
aThese can be combined into financial barriers.
bA new theme from interview guide.
Several considerations should be noted at this stage. First, students and early career researchers may choose to generate themes and subthemes from the research questions rather than the objectives, depending on their preference. Second, there may be additional themes or subthemes emerging from the interview guides that do not fit into those generated from specific objectives or research questions. These should be accommodated as additional themes, see for example, the ‘improvement needed’ theme (above). Third, some themes may evolve into subthemes and vice versa. Fourth and finally, consensus building between students, early career researchers, and their supervisors/mentors on the final list of preliminary themes and subthemes from research objectives and interview guides is crucial for ensuring rigor through investigator triangulation (Carter et al., 2014). We recommend that students, early career researchers, and supervisors/mentors examine the objectives and interview guides separately and then come together to generate a consensual list of initial subthemes, a critical step in qualitative research (Cascio et al., 2019; Raskind et al., 2019). This collaborative approach is essential for maintaining the integrity and validity of the research findings.
Step 2: Identifying the Data Sources Within the Study Interviews Guides
Themes and Subthemes and Their Data Sources From Interview Guides.
Step 3: Developing Guided Transcription and Coding Templates
Transcription is a vital part of qualitative research because it converts raw audio or video data into written text, facilitating easier coding and interpretative analysis (McMullin, 2023; Oliver et al., 2005). This process ensures that researchers can systematically examine and interpret the data, maintaining accuracy and enhancing the overall rigor of the study. Now that we have mapped themes and subthemes to their corresponding data sources, the next step is to develop guided transcription templates for each interview guide. In the context of this paper, guided transcription refers to the process of converting audio or video responses into written text using a standardized template. The purpose of guided transcription templates is to facilitate easy coding and ensure that each coded text corresponds accurately to its original data source. This is crucial because any addition, omission, or misnumbering of questions during transcription could disrupt the linkage between the text and its relevant themes and subthemes. Using guided templates helps maintain the integrity of the data, ensuring that the responses remain aligned with their designated themes and subthemes. This alignment is essential for accurate data analysis and interpretation. By systematically structuring the transcription process, we can prevent errors and ensure that the thematic analysis remains robust and reflective of the participants’ true responses. This step is also vital for maintaining the rigor and validity of the research findings.
Guided Transcription Template for FGD With Pregnant Women.
Guided Transcription Template for KII With Healthcare Workers.
Data Coding Template.
Step 4: Guided Data Coding
The fourth step involves data coding, a crucial process that helps organize and structure data from individual transcripts into meaningful categories within the coding template. The process begins by extracting pertinent information from individual interview guides and integrating it into a single template or set of ‘themes’ and ‘subthemes’. For example, transferring responses from the four FGD and eight KII transcripts into a coding template. This consolidation aids in effective data management and lays the groundwork for comprehensive analysis.
Coded Data for Cultural Beliefs and Practice’s Theme and Subtheme.
Step 5: Coded Data Cleaning and Interpretative Writing
It is crucial to conduct data cleaning after guided coding. This process involves meticulously cross-checking whether data from individual transcripts are correctly categorized under relevant themes and subthemes. This consideration becomes even more crucial when individuals other than students and early career researchers are involved in the coding process. Additionally, it includes adding or modifying key demographic details of participants that may enhance interpretative analysis. This may involve reviewing a separate sheet of participants’ demographic characteristics to identify demography influencing responses that should be integrated into coded texts, and then incorporating them into corresponding quotes or statements to enrich interpretative analysis. Eliminating template labels, such as ‘data sources’ and ‘insert responses from all transcripts’, is also part of this process. These steps ensure that the transcribed text is prepared and optimized for interpretative analysis. By ensuring data accuracy and relevance, researchers can confidently proceed with the qualitative analysis, focusing on deriving meaningful insights and drawing valid conclusions from the coded data. This meticulous approach enhances the overall integrity and reliability of the research findings, supporting robust interpretations and contributing to the study’s overall credibility.
After coding and data cleaning, the next and final step is interpretative analysis. This phase involves thoroughly examining all the coded data to uncover underlying patterns, relationships, and even new themes within the participants’ perspectives. Rather than just organizing data into categories, interpretative analysis seeks to understand the nuanced experiences, beliefs, and behaviors expressed in the coded text (Gopaldas, 2016; Mays & Pope, 2000Mays & &ope, 2000). Researchers engage in comparing and contrasting perspectives, identifying contradictions or confirmations, and synthesizing findings to develop a comprehensive understanding of the research phenomenon.
Sample Analysis From Manual Coded Data Using MS Word.
Additional Example
Research About Drivers Choosing Bachelor of Nursing Degree.
Manual Coding Process
Guided Transcription Guide and Coding Template for a Study of Drivers of Choosing Bachelor of Nursing Degree.
Coded Data for a Study on Study of Drivers of Choosing Bachelor of Nursing Degree.
It is important to note that more advanced qualitative researchers may code these interviews differently and develop themes that differ from those presented here. We should acknowledge that these themes are designed to facilitate learning among students and early-career researchers rather than advanced qualitative experts. “The theme of career aspirations and professional goals emerges as a significant driver for individuals pursuing a Bachelor of Science in Nursing. Many students express a deep desire to make meaningful contributions to healthcare, particularly through direct patient care. Aspirants often recognize that while other fields, such as biology or public health, provide valuable insights, they lack the hands-on clinical experience integral to nursing. The Bachelor of Nursing program is viewed as a pathway that equips students with essential skills and knowledge, enabling them to specialize in areas such as critical care or community health. Additionally, the holistic approach to patient care inherent in nursing resonates with those aiming to address healthcare disparities, especially in underserved communities. This alignment between personal aspirations and the practical training offered by nursing solidifies the choice for many, making it the ideal field for achieving their professional goals.”
Conclusions and Discussion
Strategies aimed at facilitating rapid qualitative research methods are gaining significant recognition in scholarly articles. Most of these studies focus on developing rapid practices, such as employing targeted transcription methods, to ensure smooth and rapid generation of qualitative insights (Lewinski et al., 2021; Vindrola-Padros et al., 2020; Vindrola-Padros & Johnson, 2020). For example, Vindrola-Padros et al. (2020) employed rapid assessment procedures during COVID-19 by training field researchers to use daily summary sheets for swift synthesis and continuous reporting to stakeholders during the study. This signifies that qualitative methods are continuously evolving to ensure both simplicity and systematic generation of evidence across various contexts, needs, and circumstances.
Recognizing the necessity for a simplified yet systematic approach to generating qualitative evidence in resource-constrained settings, this paper systematically outlined a methodical approach to manual coding of qualitative data using MS Word, emphasizing its significance as a foundational skill for students and early career qualitative researchers. Manual coding, akin to computer-assisted methods, involves organizing and categorizing textual data to uncover patterns, themes, and relationships crucial for deriving insights and interpretations from qualitative datasets (Basit, 2003; Berthet et al., 2023; Coates et al., 2021; St John & Johnson, 2000). The key distinction lies in the simplicity, applicability and accessibility of MS Word for manual coding, requiring less specialized training, time and resources (Ose, 2016).
Manual coding using MS Word offers distinct advantages over automated software for novice researchers. Firstly, students and early career researchers in low-income settings often face significant challenges due to limited financial resources for their research projects. The high cost and time demand for learning the use computerized coding software (Basit, 2003; Berthet et al., 2023; Church et al., 2019; Cypress, 2019; NYU, 2024; St John & Johnson, 2000) necessitates alternative tools that can simplify and systematically aid in generating evidence, making them highly preferred. Therefore, manual coding using MS Word empowers students and early career researchers to adapt coding frameworks to fit specific research questions and contextual constraints, such as limited time and financial resources. Unlike rigid algorithms in automated software (Abdekhodaie et al., 2018; Brailas et al., 2023; Cypress, 2019; Dalkin et al., 2021), MS Word allows flexibility in coding to capture emergent themes and unexpected findings encountered during analysis. This adaptability is particularly valuable in qualitative research, where iterative refinement of coding frameworks is often necessary to explore diverse perspectives and experiences (Braun & Clarke, 2006). Nevertheless, while computerized techniques are faster and more efficient in managing and analysing data compared to manual methods, both manual and computerized software often yield similar results in analysis (Owan & Bassey, 2018). Therefore, researchers should feel confident using either technique without hesitation guided by available resources, time and expertise. Second, manual coding may foster methodological rigor and reflexivity by involving students and early career researchers in the coding process from start to finish. The development and application of transcription and coding templates empower students and early career researchers to actively engage in data management and analysis under mentorship of their supervisors, thereby gaining insights that are valuable as they transition to technology-assisted research processes. This active engagement under mentorship enables critical reflection on assumptions and biases, thereby enhancing the transparency and validity of research findings before exposure to automated software.
Third, students and early career researchers directly engage with data through user-friendly MS Word templates. Unlike automated tools, manual coding encourages students and early career researchers to develop templates, perform guided coding, meticulously review and analyse each piece of data, gaining a deeper understanding of the dataset’s complexities and nuances without a need to own an expensive license for a coding software (Church et al., 2019; Hacking et al., 2023; Ose, 2016; St John & Johnson, 2000). This inexpensive hands-on approach allows for the discovery of insights within MS Word that automated tools may overlook. Moreover, manual coding with MS Word may cultivates essential analytical skills among students and early career researchers, including systematic data analysis, pattern recognition, integration of diverse viewpoints, and synthesis of findings into coherent narratives (Basit, 2003; Ose, 2016). These skills are transferable across disciplines, enabling researchers to conduct rigorous qualitative analyses and contribute substantively to academic discourse.
In conclusion, proficiency in manual coding using MS Word is essential for students and early career researchers engaged in qualitative research. It may enhance engagement with data, strengthen methodological rigor, encourages flexibility in coding frameworks, and develops critical analytical skills. Mastering manual coding techniques with MS Word may equips students and early career researchers in resource-constrained settings to navigate qualitative data complexities effectively, generate robust research outcomes, and make meaningful contributions to their fields of study. Therefore, investing in the development of manual coding skills using MS Word is pivotal for achieving high-quality qualitative research outcomes and advancing knowledge across diverse academic disciplines, particularly in settings with limited resources.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Ethical Statement
Data Availability Statement
All data supporting the descriptions of this paper are included within the article.
