Abstract
Moral distress (MD) has become a seriously negative problem experienced by healthcare professionals, especially clinical nurses. Early and accurate detection of MD by the validated and reliable instrument is critically important to further develop an effective intervention strategy. We performed the current systematic review to comprehensively summarize the evidence of instruments for the detection of MD in clinical nurses. The research design was a systematic literature review. We assigned 2 investigators to independently search potential studies in PubMed, EMBASE, and China National Knowledge Infrastructure (CNKI) from their inception to June 2020. We used data extraction table extracting essential information, and the modified critical appraisal tool evaluating the reliability and validity of eligible instruments. Finally, we qualitatively summarized results of all included instruments. No ethical approval was required because this systematic review was performed based on published studies. We included 16 eligible studies covering 10 original and 6 revised or modified instruments for the final analysis. The overall quality of all instruments was moderate because test-retest reliability was inadequately examined in most instruments. Of 16 instruments, MDS-revised (MDS-R) was broadly validated and employed in different working or cultural settings. Meanwhile, it also extensively expands to specifically detect MD. Moreover, other instruments including moral distress risk scale (MDSR) and moral distress thermometer (MDT) should be further validated and utilized because it covered the gap missed by most instruments. Although several instruments have been made available for clinical nurses, some of them have inadequate psychometric properties test, especially test-retest reliability evaluation. Meanwhile, most of them have not be validated and employed in other working or cultural settings. We therefore suggested further studies to validate the psychometric properties of existing instruments and then employed instruments with high reliability and validity to detect MD in clinical nurses.
What do we already know about this topic?
Clinical nurse extensively experiences moral distress in route clinical practice, and several instruments have been developed to measure the incidence and severity of moral distress among clinical nurses.
How does your research contribute to the field?
Although numerous developed instruments have been reported, however currently no systematic review to determine the optimal recommendation for appropriately selecting the instrument for the measurement of moral distress in clinical nurses.
What are your research’s implications toward theory, practice, or policy?
We made some recommendations for the selection of appropriate scale and propose suggestions for further development of instruments. Researchers, practitioners, policymakers, and other healthcare professionals in clinical nursing can obtain valuable and practical evidences from our current review for detecting the level of moral distress, identifying influential factors of moral distress, making management strategy for moral distress, and designing further studies focusing on moral distress.
Introduction
Moral distress (MD) has become a critically ethical problem experienced by health care professionals, especially clinical nurses in route clinical practice. 1 After it was first defined by Jameton, much work has been done to refine MD as a concept for improved understanding. A core tenet of MD is that clinical practitioners cannot take action in accordance with their professional ethical standards. 2
A clinical nurse who is experiencing MD may suffer from a series of serious consequences,3-5 which have negatively impact on psychological and physical wellbeing. And then a series of distressing symptoms will be occurred, such as frustration, powerlessness, anger, sleep disorders, fatigue, insecurity, and guilt.6,7 Studies also suggested that MD is also associated with decreased job dissatisfaction, increased the risk of turnover, early retirement, and long-term absences from work.2,8-12 As a result, the safety and quality of patient care provided by these nurses who were staying in serious MD may be significantly impaired because they will feel having not ability to provide quality care for patients because of job dissatisfaction13-15 or abandonment of the profession. 16
So when a clinical nurse is forced to endure serious MD for a long time and also experience burnout, he/she will avoid aspects of patient care with reduced patient advocacy. It is not surprising that MD will also have a negative impact on organizations and system because this serious status occurred in clinical nurses has been demonstrated to be associated with increased nursing turnover, poor quality of care, and decreased patient satisfaction. 17
Developing effective intervention strategies to mitigate the negative impact of MD on on clinical nurses, organizations, and system is the presently critical issue.18-20 However, development of valid and reliable instruments is the basis of accurately and early detecting MD, and then providing references for development of intervention strategies. To date, numerous instruments in healthcare have been reported, however it is a problematic issue how to select an effective and specified one because no systematic review to comprehensively analyze and compare available instruments for MD in clinical nurses. Precise validated psychometric instruments play an important role in accurately measuring the level and degree of MD among clinical nursing practitioners. More importantly, however, the psychometric characteristics of all available instruments for measuring MD have not yet been systematically and objectively evaluated, which also deeply obstacle the precise selection of instruments.
Objective
Although several instruments have been reported in literature and can be used to measure the frequency and intensity of MD in health care professionals, no systematic literature review has been performed to comprehensively summarize the psychometric characteristics of instruments of clinical nurses’ MD. Therefore, we designed the current systematic review to document and analyze all potential instruments for detecting clinical nurses’ MD in order to contribute to appropriately select tool to measure the MD in clinical nurses.
Research Design
We designed the framework of the present systematic review in accordance with the recommendations proposed by the Cochrane Collaboration (CC). 21 And then, we reported our findings in line with the structure of the preferred items for systematic review and meta-analysis (PRISMA) statement. 22 We did not need the ethic approval and informed consent because all results in our systematic review were obtained from published studies.
Identification of Citations
A comprehensive search, which was performed by 2 independent investigators, was performed in PubMed, Embase, and China National Knowledge Infrastructure (CNKI) database from their inception to June 2020. We updated the search results per week in order to identify any newly potential records. We developed the search strategy with a combination of medical subject heading (MeSH) and text words. All search strategies were amended according to the unique criteria of each database. The search strategies were summarized in Table 1. We also hand checked bibliographies of topic-related reviews and eligible studies for the purpose of capturing any potential studies. Finally, we used EndNote software to manage all records. The principle of consulting a third senior investigator was introduced to solve any divergency about identification of studies.
Search Strategies of All Targeted Databases.
CNKI = China National Knowledge Infrastructure.
Selection Criteria
In this systematic review, we only considered methodology article of developing and validating original and expanded instruments of detecting MD among clinical nurses or healthcare professionals involving clinical nursing practitioners. For instruments with several versions, we only included the latest version for the final analysis. Moreover, we also considered studies which enrolled clinical nurses as the part of the sample for psychometric evaluation to be eligible. We excluded studies for validation of the modified version of original instruments, and also excluded studies reporting the development and validation of instruments for other health care professionals rather than clinical nurses. In our systematic review, only studies published in English- or Chinese language were taken into account for inclusion.
Data Extraction
In this systematic review, all essential data including leading author, publication year, country, number of items, sample for psychometric properties test, and factor structure were obtained as a result of 2 independent investigators with a standard data extraction sheet. The corresponding author was contacted when we cannot obtain sufficient information from an included version. Any divergency about data extraction was solved through consulting a third senior investigator.
Assessment of Quality
As the goal of the current systematic review was to qualitatively summarize the evidence of all available instruments for detecting MD in clinical nurses, we did not aim at assessing the overall quality of all eligible studies. Instead, we critically assessed the reliability and validity of included instruments with some items from critical appraisal tool (CAT) which was developed by Brink and colleagues in 2012 as part of the scope of this systematic review, and not as a criterion to exclude studies or to assess the validity of the results of single studies. 23 According to the aims, we utilized item 1 (if human subjects were used, did the authors give a detailed description of the sample of subjects used to perform the test?), 8 (was the stability of the variable being measured taken into account when determining the suitability of the time interval between repeated measures?), 10 (was the execution of the test described in sufficient detail to permit replication of the test?), 12 (were withdrawals from the study explained?), and 13 (were the statistical methods appropriate for the purpose of the study?) to assess the psychometric properties of all included instruments. 23
Method of Analysis
In our current study, no quantitative information will be accumulated. Considering our specific objective, we qualitatively summarized the essential information of eligible instruments, and then evaluated the advantages and disadvantages as following 3 aspects: psychometric evaluation, factor structure, and application objects.
Findings
Result of Identification
We delineated the process of retrieval and selection of studies in Figure 1. We captured 863 records at initial search stage in PubMed (n = 438), Embase (n = 328), and CNKI (n = 97) and added additional 3 records from application studies, and 846 records remained after removing duplicates using the literature management software EndNote X7. We further excluded 786 records after carefully checking the title and abstract, and thus, 60 items were included to be checked in full-text. Finally, 16 eligible studies12,24-38 including 10 original and 6 revised instruments were included for the final analysis after excluding 44 studies due to several reasons including unrelated to topic (n = 29), validation of instruments in different populations (n = 3), cross-cultural adaptation and validation of instruments (n = 7), validation of modified instrument (n = 1), and development and validation of other instruments such as moral distress map (n = 4).

Flow chart of identification and selection of studies.
Characteristics of Included Studies
We documented the basic characteristics of all eligible studies in Table 2. Of these 16 eligible studies,12,24-38 5 were performed in USA,27,29,30,34,38 3 were in Italy,26,31,32 2 were in Sweden,24,37 and remaining studies were in other countries such as Brazil, Japan, and Iran. All studies were published between 2000 and 2019. Ten original instruments were reported, and other 6 instruments were all developed after revised the MDS or MDS-R. Out of these 16 instruments, 8 were designed as generic purpose and remaining 8 were as exclusive to certain population such as pediatric nurses and intensive care nurses.
Characteristics of All Instruments Included in the Final Qualitative Analysis.
We extratced the essential information of MDS-PV because original version of this instrument was not accessable.
MDS-CN was initially created by Muccio et al in 2017 and then validated by Lazzari and colleagues in 2019.
Assessment of Psychometric Properties
The overall result of assessment of psychometric properties of each seligible instrument is shown in Figure 2. According to the assessment result, the development of most of the instruments adequately considered reliability and validity, however test-retest reliability was not adequately evaluated.

Assessment of reliability and validity of all included instruments in the present study. These assessments were performed as following: (1) if human subjects were used, did the authors give a detailed description of the sample of subjects used to conduct the test?, (2) was the stability of the variable being measured taken into consideration in assessing the suitability of the time interval between repeated measures?, (3) was the execution of the test described in sufficient detail to allow replication of the test?, (4) were withdrawals from the study explained?, and (5) were the statistical methods appropriate for the purpose of the study? The overall quality was moderate because most of instrument have inadequate test-retest reliability evaluation.
Qualitative Summaries of All Instruments
Moral distress scale and revisions
Moral distress scale (MDS) is the first instrument which was initially developed to measure the frequency and intensity of MD in intensive care nurses in 2001 by Corley and colleagues. 17 Original MDS was designed to have 32 items with 3 factors structure on a 7 points Likert scale. The psychometric properties of MDS were examined among 214 intensive care nurses from American hospitals, and test result indicated a Cronbach’s alpha of .98 for individual responsibility, .82 for not in patient’s best interest, and .84 for deception, respectively. After validated, MDS has been extensively used to detect the frequency and intensity of MD in intensive care nurses.3,39 For example, Rice and colleagues performed a study to determine the prevalence and contributing factors of MD in 260 medical and surgical nurses, and found that MD is common and can be elicited from different types of situations encountered in the work environment.
Considering the limitations of the original version, Corley and colleagues reported a modified version in 2005. 27 Additional 6 items were added in the modified version, and a content validity index of 1.0 was obtained for 38 items. After employed this modified scale in 106 registered nurses, a Cronbach’s alpha of .98 was produced. To date, this modified scale has also been extensively implemented in empirical studies. For example, Elpern et al performed a study to assess the level of nurses’ MD working in the medical intensive care unit, and found a high level of MD. 40
Fruet et al adapted MDS and then assess the applicability of this version in the context of nursing in hemato-oncology services in Brazil in 2017. 41 Finally, 26 questions were included in this modified version with 3 factors structure including lack of competence in the team, denial of the nursing role as the patient’s advocate, and disrespect for the patient’s autonomy. Psychometric test suggested a Cronbach’s alpha of .98. To date, however, this revised version of MDS has not been validated and employed in further studies.
In 2017, Badolamenti et al modified MDS and developed a brief instrument (MDS-11) to determine MD. 26 Factor analysis of 347 nurses generated a 2-factor structure with 11 items. Psychometric test suggested a high reliability with Cronbach’s alpha of .823 for the futility dimension and .756 for the potential damage dimension, respectively. However, this version was not presently validated and utilized in further studies.
In 2010, for the purpose of proposed a reliable and valid MD assessment tool which can be used in multiple health care settings and with multiple disciplines, Hamric and colleagues adapted MDS to cover more distressing situations and shortened to include 21 items, which were evaluated using a 5-point Likert scale. 30 Authors utilized this scale in 169 registered nurses and 37 physicians to demonstrate its reliability and construct validity. As the first generic instrument of MD for healthcare professionals, moral distress scale - revised (MDS-R) has 3 parallel versions for nurses, physicians, and other health care providers, and has also been extensively utilized by other subsequent survey or comparative studies,42-44 as well as been extensively validated in other countries including Turkey, 45 Italy,32,46 Australia, 47 China, 48 Korea, 49 Brazil, 50 Sweden, 51 and Iran.52,53 To date, the MDS-R has 6 versions, designed for different healthcare provider populations: adult-nurse, adult-physician, adult-other, pediatric-nurse, pediatric-physician, and pediatric-other. 29
In 2015, Shoorideh expanded the MDS-R to specifically use in intensive care units. 12 This scale was developed with the content analysis approach based on the published theoretical frameworks including Jameton’s conceptualization of MD, House and Rizzo’s role conflict theory, and Rokeach’s value theory. The initial scale was structured with 3-factor including 30 items, which were answered at 0 to 4 Likert scale format, with 0 indicating none and 4 indicating great extent. The content validity index was .98, .95, and .96 for relevance, clarity, and simplicity, respectively. Cronbach alpha of .96 also indicated a highl reliability. Unfortunately, this instrument has not previously been extensively employed in subsequent empirical studies.
Although the aim of the MDS-R and corresponding expanded version are to detect MD in healthcare professionals or specific populations, some root causes were not covered by them.54-56 As a result, Epstein et al constructed and validated an updated instrument, measure of moral distress for healthcare professionals (MMD-PH), to detect the MD in healthcare professionals. 29 Application in 653 professionals demonstrate its reliability and validity with a Cronbach’s alpha of .93. These authors therefore recommend to replace the MDS-R with the MMD-HP as a generic instrument of MD in healthcare professionals. However, further studies should be designed to extensively validate MMD-PH as a preferred tool of measuring MD in healthcare professionals because several limitations existed in the methodology study. 29 Certainly, the validation in different cultural settings should also be considered.
Moral distress measures for specific populations
The parallel version MDS primarily focused on pediatric setting has been used in clinical practice, and the psychometric properties of revised version of the original MDS-PV has also been validated in Iran 57 and Sweden. 58 Moreover, Professor Corley also developed moral distress scale neonatal-pediatric version (MDS-NPV), which which has also been employed in neonatal intensive care unit nurses 59 although the Corley’s original version was not published. In 2014, Lazzarin et al revised and employed MDS for nurses in pediatric context (MDS-PV), 32 which was initially developed by Corley et al based on MDS. In their empirical study, Lazzarin et al shorten items from initial 38 to 33 which was answered with 0 to 6 Likert scale, and produced a highly reliability with a Cronbach’s alpha of .959. Meanwhile, Professor Corley also confirmed the equivalence between adapted version and the original version. The psychometric properties of MDS-NPV in nurses working other pediatric settings rather than intensive care should be further investigated.
In 2017, Muccio et al initially created a MDS for correctional nurses (MDS-CN) based on MDS-R, which was subsequently employed and validated by Lazzari and colleagues in 2019. 31 This scale was structured with unidimensiona with 20 items, which should be indicated using the numerical number from 0 to 5 format. Content validity index, Cronbach’s alpha, and test-retest reliability was 99.0, .91, and .99 in original version respectively, and the subsequent study of 238 correctional nurses confirmed its reliability with a Cronbach’s alpha of .89.
Eizenberg et al found that systematic examination for measurement equivalence of all applicable instruments has not been reported, and thus they developed a culture-sensitive MD questionnaire for nurses working in across working settings in 2008. 28 Authors firstly elicit the culture specific themes based on focus group interview, and then tested the psychometric properties of questionnaire in 179 nurses from a variety of work settings. The psychometric analysis suggested a Cronbach’s alpha of more than .79, with a Cronbach’s alpha of .851, .791, and .804 for the first, second, and the third factor, respectively. Although as the first generic instrument of measuring MD in clinical nurses, no published study utilized it to measure MD. Therefore, further studies are needed to evaluate the measure in different cultural settings.
Because MDS was not administered in psychiatric setting, Ohnishi et al developed the MDS for psychiatric nurses (MDS-P) through combining the 24 items derived from the MDS with and then employed it in Japanese psychiatric nurses in 2010. 33 MDS-P consists of 3 factors structure with 15 items in a 7-Likert scale. In this scale, the frequency and intensity were all marked from 0 to 6, and the score was positively associated with the severe levels of MD. A total of 369 psychiatric nurses from 6 Japanese hospitals were enrolled to respond the scale, and subsequent psychometric analysis suggested a Cronbach’s alpha of .90 for the whole scale, with .85, .82, and .79 for factor 1, 2, and 3 structure, respectively. As an instrument specific to psychiatric nurses, further studies should be designed to test its psychometric properties across cultural settings.
Moral Distress in Dementia Care Survey (MDDCS) tool was created by Awosoga and colleagues in 2018, with the purpose of specifically examine MD among nursing staff caring for the unique population with dementia. 25 The team conducted an exploratory sequential mixed method to generate item pool, and then enrolled 389 sample of consisting of registered nurses, licensed practical nurses, and healthcare aides to test the psychometric properties. After completing factor analysis, authors got a 3-factor structure covering 55 items on a 5-Likert scale. The construct validity was demonstrated by the CFA method. Psychometric evaluation indicated a Cronbach’s alpha of .94, .92, .93, and .83 for the frequency of MD, severity, effects, and mitigating factors, respectively.
Other moral distress measures for nurses
Sporrong et al found that, in 2006, several instruments can be used to closely measure MD, however the scope of application of these instruments is limited. 37 Thus, they developed a 9-item MD questionnaire in 2006 to cover this gap, 37 which can be called as moral distress questionnaire for clinical nurse (MDQ-CN). And then, authors tested the validity and reliability of this questionnaire in 259 staff members working in 200 departments and 59 pharmacies, and obtained a Cronbach’s alpha of .78 for factor 1 and a Cronbach’s alpha of .62 for factor 2, respectively. Although this study enrolled numerous professionals with diverse working settings to validate the reliability and validity of the questionnaire, inadequate root causes limited the application value. Moreover, no further empirical study to determine the psychometric properties.
Brazilian scale of moral distress in nurses was initially developed by Ramos and colleagues in 2017 in order to specifically detect the frequency and intensity of MD. 35 The initial version included 57 questions but did not performed factor analysis. Since then, original authors performed a methodology study to test the psychometric properties of this scale. 60 Factor analysis shorten items from 57 to 49 and thus generated a 6-factor structure including (a) acknowledgment, power, and professional identity; (b) safe and qualified care; (c) defense of values and rights; (d) work conditions; (e) ethical infractions; and (f) work teams. Psychometric test indicated a high reliability with a Cronbach’s alpha of .980 for the whole scale. Moreover, these authors also investigated the application of this scale in the primary health care setting and also obtained a high reliability. 61 Moral distress thermometer (MDT) is a tool of rapidly measuring MD and tracking changes in MD over time, as well as quantitatively the time-specific level of MD, which was developed by Wocial and colleagues in 2012. 38 Authors analyzed the data from 529 nurses to demonstrate the convergent validity and concurrent validity of MDT, and then indicated that MDT has great potential as a screening tool for use in research, evaluating the effectiveness of interventions designed to decrease a nurse’s level of MD. To date, some empirical studies have employed this instrument to examine the level of MD in school nurses62,63 and intensive care unit nurses.64,65 However, the application in other clinical nurses or cultural settings has not yet been reported.
Other measures of moral distress and related topics
We also identified another 4 instruments including Moral distress risk scale (MDRS), 36 MD assessment questionnaire (MDAQ), 66 ethics stress scale (ESS), 34 and stress of conscience questionnaire (SCQ), 24 which were all also designed to measure the level of MD in nurses.
Moral distress risk scale (MDRS) was created by Schaefer et al in 2019 in order to offer a tool that could be useful beyond hospital care settings. 36 After identified 53 items, the team enrolled 268 nurses from hospitals and primary healthcare setting to perform factor analysis and psychometric evaluation. Finally, this scale was developed with 7 factors covering 30 items on a 4-Likert scale (1 indicates never and 4 represents always). Psychometric test indicated a good internal consistency of .913. Compared to previous instruments which were all built based on causes, MDRS was developed at the basis of risk factors which may reduce the negative impact of subjectivity of MD on the measurement result. Thus, considering this advantage, further studies should be performed to validate the psychometric properties of MDRS in other working and cultural settings.
The MDAQ, which was established in 2002 by Hanna, can measure MD from type, intensity, frequency, and duration of the experience. According to published literature, 37 this questionaire can be utilized across disciplines; however psychometric evaluation for it has not yet been performed.
In 2000, Raines and Tymchuk developed a novel and self-administered instrument, which was named as ESS, to measures the individual’s perception for MD. 34 In this scale, 56 questions were designed and the first 52 questions rated on a Likert-type scale from 1 (agree strongly) to 5 (disagree strongly). Questions 53 through 56 are designed to provide additional information to the researcher and are intended to be answered in various ways. After developed this scale, Raines employed it in emprical study and obtained a content validity of .89 (P < .05), a test-retest reliability of .82 (P < .005), and a Cronbach α of .87. Unfortunately, this scale can not be extensively validated or utilized in subsequent studies.67,68
SCQ was developed by Glasberg in 2007 with the purpose of measuring stress arising from a disturbed conscience. 24 The questionnaire was structured with 2 parts covering 9 items. In part 1, the respondent was asked to indicate the stressful situation with a 6-Likert scale from 0 (indicating never) to 5 (indicating every day). In part 2, the respondent was required to indicate the level of bad conscience based on the degree of guilt for every situation in the part 1 on a 10 cm visual analogue scale (VAS) from 0 (indicating no, not at all) to 10 (indicating yes, gives me very bad conscience). 69 In this questionnaire, higher total scores represents higher the perceived stress level. Psychometric evaluation indicated a Cronbach’s alpha of .83, which also demonstrated in empirical study. 69 Moreover, SCQ was also revalidated by Ahlin and colleagues in 2012 to be valid for Swedish settings. 70
Discussion
Moral distress has become the critically negative phenomenon faced by clinical nurses in various workplaces. 71 Moral distress has demonstrated to be associated with several negative consequences including increased risk of turnover, dissatisfaction with work, intention to leave from work.26,29 Appropriate detection and subsequent interventions play a critical role in mitigating the negative impact of moral distress on physical and psychological aspects of nurses. 29 Validated and reliable instrument is the effective approach to early detect moral distress and then provide references for developing intervention strategy.
After development of the original MDS in 2001, several other instruments which were designed to measure the MD in healthcare professionals or clinical nurses working in different settings have been created. 26 In the current systematic review, we captured 10 original and 6 revised instruments which were all created to measure the frequency and intensity of MD in healthcare professionals or clinical nurses.12,24-38 Of these instruments, 5 were all reported to be specific to psychiatric nurses, 33 pediatric nurses, 32 dementia care nurses, 25 and correctional nurses, 31 and intensive care nurses. 12 It is noted that 6 instruments were developed based on previous 2 instruments including MDS and MDS-R.25,26,29,31-33 Out of 16 instruments, 2 including original MDS and MDS-R have been extensively translated to other language and have also been validated.5,50-52 As stated above, generic instrument for measurement of MD in nurses and healthcare professionals has been extensively investigated, however the investigation on the specific instrument is still inadequate. Further studies should be performed to develop and validate instruments specific to certain populations, and also to demonstrate the application of existing specific instruments in other working or cultural settings.
Theoretical framework play a critical role in designing and developing instrument in order to evaluate and correlate indicators and constructs, strengths, and deficiencies of theories. 72 For the purpose of guarantying the scientific and logic characteristics of an instrument, it is imperative to report the theoretical framework as followed in method section. Meanwhile, considering this issue, theoretical framework must be followed when revised or adapted from existing instruments for application in other cultural or working settings. Of these 16 instruments captured in our study, 7 instruments clearly stated the followed theoretical frameworks.25,27-30,32,35,38 Therefore, the remaining 9 instruments which did not reported the followed theoretical framework should be selected cautiously in empirical study, and should also be further validated in different settings.
MD may be changed as the time change, and thus longitudinal study of exploring the dynamic chang model of MD in clinical nurses is decisive for further developing effective intervention strategy. In order to address this issue, analysis of test-retest reliability must be performed. 73 Of these 16 analyzed instruments in the present study, only 4 clearly reported to have satisfactory test-retest reliability, and thus can be utilized to investigate the application value in other settings.27,28,30,31 This issue was also the leading cause of reducing the overall quality of all instruments in the present study. Therefore, we strongly suggested that remaining 12 instruments without test-retest reliability test must be still to be tested for assessment of psychometric properties.
Regardless of the fact that several instruments have been reported to date, and some of these instruments have also been translated or adapted into other cultural or working settings, no study has been reported to test the measurement equivalence of the original and revised or modified versions. Meanwhile, measurement equivalence of 1 instrument in the application of different populations such as adult and pediatric patients was not reported. In order to detect reliable frequency and intensity of MD in diverse cultural or working settings, further studies should be designed to test the measurement equivalence of these instruments.
Our study systematically summarized the evidence of instruments of measuring MD in clinical nurses after searching and checking potential studies, however some limitations must be further interpreted. Firstly, despite the fact that we searched relevant studies from 3 electronical databases including PubMed, EMBASE, and CNKI, others such as SCOPUS and Web of Science did not search. So some potentially relevant instruments may be missed. However, 3 targeted databases searched in our study were recommended mainly by Cochrane handbook for performing systematic review because it covers most of the published literature. 21 So, we convinced this issue may not significantly impaired our findings. Secondly, we did not search gray literature, and thus those developed instruments but not published were also missed. However, some gray instruments have been employed and validated in some published studies, we therefore extracted indispensable information from these instruments for further analyzed.31,32 So, we convinced this issue may also not negatively the reliability of our findings. Thirdly, our systematic review only considered studies published in English and Chinese language, and thus some instruments developed and evaluated in other languages or cultures may be also missed.
Conclusion
As one of the most common negative phenomenons experienced by clinical nurses daily, MD has greatly attracted the attention from health care professionals. Regardless of the fact that several instruments have been developed to measure the frequency and intensity of MD, most of them have not been extensively validated in other working or cultural settings, and some were not assessed for test-retest reliability. Meanwhile, some instruments were not developed based on established theoretical frameworks, and most instruments specific to certain populations were created through modifying or revising existing instruments. Therefore, further studies were suggested in order to validate the existing instruments and develop new instrument under consideration for variability of diverse populations and cultures.
Footnotes
Acknowledgements
The authors would like to thank the authors who performed all studies included in the systematic review.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed the following financial support for the research, authorship, and/or publication of this article: This study was supported by Grant from the Technological Innovation and Demonstrational Application Project of Chongqing Science and Technology Bureau (project no. cstc2018jscx-msybX0030).
