Abstract
Objective
Nonhuman primates (NHPs) are suitable for being model animals in the study of consciousness and loss of consciousness (LoC) with a similar brain structure and function to humans. However, there is no effective consciousness assessment scale for them. This study aimed to develop a behavioral assessment scale of consciousness for NHPs.
Methods
We constructed an initial indicator framework based on the clinical consciousness disorder assessment scales and the physiological characteristics, consciousness, and arousal behavior of NHPs. A two-round online Delphi method was conducted by a multidisciplinary expert panel to construct a behavioral assessment scale of consciousness for NHPs. The indicators and descriptions were revised according to the experts’ feedback and then sent out for repeated consultations along with a summary of the results of the previous round of consultations. The accepted competencies of indicators were established with mean scores in two scoring criteria (importance and feasibility) ≥4.0, agreement rate with a rating of importance or essential ≥70.0%, and a coefficient of variation ≤0.25, as well as discussions of the research group.
Results
Consensus was achieved after the second round of consultations, which was completed by 28 experts who specialized in rehabilitation, neuroscience, psychology, neurosurgery, and neurology. A new behavioral assessment scale of consciousness for NHPs, including 37 items organized hierarchically within seven dimensions including visual function, auditory function, motor function, orofacial movements, arousal, brainstem reflexes, and respiration, was developed in this study.
Conclusions
This study has successfully developed a behavioral assessment scale for measuring the conscious state of NHPs or NHP models with LoC. This tool is expected to facilitate future research into the underlying mechanisms of consciousness by providing a detailed and comprehensive means of measurement.
Introduction
Consciousness is thought not to be directly observed, but to be identified primarily through reasoning from the perspective of a third person. 1 This mainly relies on subjective reports of the participants to evaluate their consciousness, which poses considerable challenges to the reasoning when their subjective reports are not available. Currently, the understanding of consciousness and loss of consciousness (LoC) is largely dependent on individual case studies or clinical trials in different patients or healthy controls, 2 which have limited ability to export data and implement controlled experimental designs. For example, long-term clinical trials are subject to selection bias or cohort effects which reduce the reproducibility of results, and many outcomes cannot be directly measured in humans for ethical or moral reasons. This may lead to the mechanisms responsible for consciousness and LoC being incompletely understood. Additionally, the lack of such information may limit physicians in the accurate determination of levels of consciousness ultimately, 3 which has a negative influence on making treatment plans and prognosis assessments for patients with LoC.
The use of preclinical animal models is already widespread for elucidating unknown mechanisms underlying consciousness and LoC because they offer precise control with reduced variability, rich data output, and limited ethical complexity. 4 Among these, rodents have been studied more and play an irreplaceable role, however, their neural network connectivity and circuit function differ greatly from that of humans, and some types of LoC model cannot be realized, such as diffuse axonal injury. 5 Nonhuman primates (NHPs) have evolutionary similarities to humans in terms of anatomy, physiology, genetics, and neurological function as well as their cognitive, emotional, and social behavior 6 enabling them to play a vital role in medicine and biomedical research. Especially, neuroscience research using NHPs has led to a deeper understanding of the mechanisms of advanced brain function and diseases. 7 Furthermore, numerous experiments have shown that NHPs have consciousness.8,9 Therefore, we believe that it is feasible and valuable to use NHPs to study the mechanism of consciousness and LoC.
Although electroencephalography (EEG), 10 positron emission tomography-computed tomography (PET-CT), 11 and functional magnetic resonance imagery (fMRI) 12 can be used as tools to assess consciousness, behavior evaluation is still the gold standard currently because of their expensive, time-consuming and inconvenient operation. Scales such as the Glasgow Coma Scale (GCS), 13 Full Outline of Un-responsiveness (FOUR), 14 and Coma Recovery Scale-revised (CRS-R) 15 are widely used in clinical diagnosis and treatment. There is still a lack of systematic research on the assessment tools for the state of consciousness of NHPs. Gennarelli et al. 16 found it difficult to describe the consciousness state of the NHP model with diffuse axonal injury. Then, they selected several behavioral indicators based on clinical scales used in patients with LoC, such as corneal reflexes, eye-opening, and visual following. 16 Similarly, Tasserie et al. 17 and Redinbaugh et al. 18 have developed evaluation indexes for the anesthetic awakening behavior of NHPs based on clinical assessment scales, including limb face movements, oral signs, and body movements. Although there were fewer items in the above scales, the previous studies indicated that we can extract assessment indexes suitable for NHPs from human clinical behavioral consciousness assessment scales. However, it is necessary to develop a scientific, effective, and operable tool for NHPs.
The Delphi method is a research technique used for collecting and transforming experts’ opinions and information into a group consensus. 19 The investigators draw up a draft scale and consult with the members of the expert group, respectively, in the form of letters according to a certain procedure, while the experts submit feedback anonymously. After several rounds of survey and feedback, the opinions of the members of the expert group gradually became consensus, and finally, obtain a collective judgment result with high accuracy. 20 Implementing an online Delphi study may overcome some limitations found with decision-making processes in face-to-face committee meetings, with the advantage of less cost and feedback being provided in a controlled form. 21 It is commonly chosen as a way to achieve a consensus for clinical standards and the construction of scales. 22
This study utilized a two-round online Delphi method to formulate a behavioral assessment scale of consciousness for NHPs, incorporating the consciousness and behavior characteristics of NHPs and the clinical conscious behavior assessment scales used for patients with disorders of consciousness (DOC). This work provided a potential tool for further research on the neural mechanisms of consciousness and LoC.
Materials and methods
Delphi procedure
In the present study, we first formulated an original draft of the behavior assessment scale for NHPs’ consciousness state through a literature review, theoretical analysis, and discussion within the research group. Then, the experts’ opinions were collected using an e-Delphi method with two-round online questionnaire surveys. If a consensus was reached in round two, the decision could be made to stop the process, conversely, another round might be set up. After the first round of consultation, the suggestions and opinions of the experts were sorted and analyzed. The researchers appropriately modified the questionnaire for the second round of consultation, based on the outcomes of the preliminary consultation and group discussion. After two rounds of online consultation, expert opinion on the evaluation of each item tended to be consistent and the consultation was stopped, and the final version of the behavioral assessment scale of consciousness in NHPs was formed finally.
Formulating an original draft of the evaluation scale
We organized a research group responsible for literature review, theoretical analysis, selection of the expert panel, compilation and distribution of questionnaires, data process, and analysis, including three professors, two associate professors, and five postgraduates.
The literature on consciousness assessment tools for NHPs and humans in PubMed, Elsevier Science Direct, OVID, Web of Science, Google Scholar, China National Knowledge Infrastructure (CNKI, in Chinese), China Science and Technology Journal Database (VIP, in Chinese), Wanfang Data (in Chinese) and the Chinese Biomedical Database (CBM, in Chinese) were searched and reviewed comprehensively before the Delphi survey to gather evaluation indicators, evaluation manual, and descriptions. Researchers also identified additional relevant articles using experiential knowledge of the LoC scale. The relative studies on rodents, dogs, horses, and other animals were also included. As a result, 27 behavioral scales on the assessment of consciousness and LoC were screened out, which are detailed in Supplemental Table S1. 23 Further, the grading standards and operating procedures of each evaluation indicator in those scales were classified and summarized. From this, the evaluation indicators suitable for NHPs were selected according to their physiological characteristics, and conscious and arousal behavior. Finally, the original draft of the consciousness behavioral evaluation scale on NHPs was generated, including 38 items organized hierarchically within seven dimensions, as well as detailed assessment methods and response scores, which are detailed in Supplemental Table S2.
Construction of correspondence questionnaire
Formation of the expert panel
In the present Delphi study, experts were selected based on their academic credentials, clinical and/or research experience in consciousness, LoC, NHPs, and constructing a scale. The inclusion criteria were as follows: (a) at least 10 years of work and academic experience, (b) an assistant professor title or above, (c) a bachelor's degree or above, and (d) being willing to answer the expert consultation questionnaire in every round. There were no rigorous standards for the sample size of the Delphi survey. 24 Thirty potential experts were invited by purposive sampling, and they were anonymous to each other and did not receive any service fee for participation.
Data collection and management
We conducted the online Delphi method with two iterations of questionnaires from 1 July 2022 to 30 October 2022. The self-designed questionnaires were distributed via an online system https://www.wjx.cn/ (Questionnaire Star, Changsha Ranxing Information Technology Co., Ltd, Changsha, China). Next, we shared the URL with experts via the social media WeChat and requested them to complete the questionnaires within two weeks. Experts remained anonymous to one another throughout the study, and they could contact the researcher at any time if the questionnaire was not clear or items were missing. Every response to the questionnaire was automatically collected into an Excel spreadsheet by the Questionnaire Star for further data analyses.
Data process and statistical analysis
The scientific soundness and rationality of the Delphi method are reflected by four indicators: positive coefficient, concentration degree, authority coefficient, and coordination coefficient of experts.
The experts’ positive coefficient
The positive coefficient reflects the importance that experts attach to this research and determines the credibility and scientific basis of the results, which is usually calculated by the effective response rate of the consultation questionnaire. Generally, the more active the experts are, the higher the experts attach importance to this questionnaire, and then the more they pay attention to this research project. Response rate = [(Number of returned responses)/(Number of invitations)×100%]. Relevant authoritative studies show that an effective response rate of 50% is the minimum acceptable value and over 70% is a very good standard for the Delphi method. 25
The experts’ concentration degree
The concentration of expert opinion is expressed by the mean value of importance, standard deviation, the CV, and the agreement rate of participants with a rating of 4 or above. In this study, the Likert 5-level scoring method was used to score the importance of each item. The values from 1 to 5 were "not important" to "very important," and the mean value was from 1 to 5. The higher the mean value, the more important the item accordingly. The agreement rate of participants refers to the ratio of the number of experts who think that an item is very important (5) or relatively important (4). The range of agreement rate value is 0–1. An agreement value ≤0.5 was regarded as an indication of small importance, while, an agreement value >0.6 means this item can be included in the next round of the Delphi study. The CV reflects the fluctuation degree of experts’ opinions on each item, also an important basis for item deletion. The lower value of CV means a lower divergence of views among experts and better consistency of views on the importance or feasibility of this item. 26
The experts’ authority coefficient
There is a close relationship between the authority of experts and the consultation results. The authority coefficient (Cr) is determined jointly by two dimensions: the familiarity coefficient (Cs), which represents the expert's familiarity with the investigation, and the judgment coefficient (Ca), which reflects the evidence for the expert to make a judgment. As shown in Table 1, we used a scoring matrix that has been widely used and validated for Delphi studies in China. 27 Cr can be measured by the formula Cr = (Cs + Ca)/2. Generally, a Cr value >0.7 is regarded as an acceptable indicator of reliability. 28
Judgment basis and familiarity for consultation from experts.
The experts’ coordination coefficient
Kendall's coefficient of concordance (Kendall's W) is used to assess the degree of coordination between expert opinions on the evaluation item. Its value ranges from 0 to 1 and mostly fluctuates around 0.50. A higher Kendall's W value indicates a better consistency of coordination among experts. Generally, the statistical significance (
The inclusion and exclusion criteria
The accepted item inclusion criterion in all two scoring criteria (importance and feasibility): mean score ≥4.0, agreement rate ≥70.0%, and CV ≤0.25. If a CV >0.25, the research team further discussed and decided whether to retain or delete the corresponding item by combining the agreement ratio and mean value comprehensively.
Statistical analysis
Two researchers entered and checked the consultation results from each round independently. All quantitative data analysis was undertaken using the Statistical Package for the Social Sciences (SPSS software; Version 26, IBM Inc., Chicago, IL, USA).
29
The level of statistical significance was predetermined as the
Results
Experts’ socio-demographic characteristics and enthusiasm
In the round one consultation of the Delphi study, 30 experts were invited to participate, and 28 completed responses were received, resulting in a response rate of 93.33%. In the round two consultations, all 28 experts who participated in the first round responded, resulting in a 100.00% response rate. These experts were selected from diverse regions and disciplinary backgrounds and were highly authoritative and representative, contributing significantly to the validity and reliability of the research. Specifically, the expert panel consisted of 20 males (71.43%) and 8 females (28.57%). Twenty-three experts (82.14%) held a PhD degree, and all experts held a mid-level or higher professional title. Twenty-seven experts (96.43%) held either a deputy senior or senior professional title. Of the panel, two experts (7.14%) were employed in universities and 26 experts (92.86%) in hospitals. The panel's professional expertise spanned several disciplines, including neuroscience, neurosurgery, rehabilitation, and related fields. The experts were from 13 different provinces, including Beijing (5), Shanghai (5), Guangdong Province (5), Zhejiang Province (2), Jiangsu Province (2), Tianjin (1), Hebei Province (1), Hubei Province (1), Hunan Province (1), Anhui Province (1), Shaanxi Province (1), Guangxi Province (1), and Jiangxi Province (1). See Figure 1 for details.

Characteristics of experts in the Delphi study.
Experts’ authority coefficient
The authority coefficients of Ca, Cs, and Cr were 0.882, 0.736, and 0.809 in round one consultation (Table 2). After the round two consultations, the coefficients increased to 0.896, 0.743, and 0.820, respectively. These results show that the authority coefficient was satisfactory, as it exceeded 0.70. The high authority coefficients indicate that the results of this study are more reliable.
Authority coefficient of the Delphi survey experts.
Experts’ opinion coordination coefficient
In round one, Kendall's W on the importance of the first, and second level and all items were 0.493, 0.561, and 0.493 respectively, meanwhile, Kendall's W on feasibility was 0.591 (all
Opinions’ coordination coefficient of the Delphi survey experts.
The experts’ concentration degree
The grading on the original scale in round one
The original questionnaire consisted of 38 items organized hierarchically within seven dimensions, as well as detailed assessment methods and response scores. The concentration of expert opinion was assessed by the mean value of importance and feasibility, standard deviation, CV, and agreement rate of participants with a score of 4 or 5.
All mean values on the importance of the items were >3.5, and only 1 item was <4.0 (Ⅳ-5, 3.964 ± 0.999). The IQR of all these items was ≤1.00. All agreement rates of the items were >60.00% with seven items achieving 100.00% agreement, and only one item had <80.00% (Ⅳ, 78.57%). CV of seven primary indicators and 34 secondary items were <0.25, and the items Ⅳ-1, Ⅳ-4, Ⅳ-5, and Ⅳ-5 were valued at 0.260–0.265.
All mean values on the feasibility of the secondary items were >4.0. The IQR of all these items was ≤1.00. All agreement rates of the items were >60.00% with 10 items achieving 100.00% agreement, and only 1 item had <80.00% (Ⅶ-5, 78.57%). CV of 36 secondary items was <0.25, and the items Ⅳ-4 and Ⅳ-6 valued at 0.260 and 0.267. These data are reported in Table 4.
The grading on the original scale in round one.
SD: standard deviation; IQR: interquartile range; CV: coefficient of variance.
* represents the coefficient of variance value greater than 0.25.
Experts’ opinion and revision
It was evident that during the first round Delphi study, further discussion was required for potential items that had statistical indicators at the critical level. Specifically, attention was focused on the secondary items of IV (orofacial movements), as shown in Table 5. After considering the mean score, agreement rate, and CV, as well as holding discussions within the research group, we deleted item V-6 and made appropriate modifications and retentions to dimension IV and its items. They were subsequently modified and adjusted before being proposed for a new rating in the second round of the study.
The potential items need further discussion after round one.
The grading on the original scale in round two
All mean values on the importance of the items were >4.0. The IQR of all these items was ≤ 1.00. All agreement rates of the items were >80.00% with 15 items achieving 100% agreement (33.33%), and only 3 items had <85.00% (Ⅳ, Ⅳ-6, and Ⅶ-1). All CVs of the items were <0.25.
All mean values on the feasibility of the secondary items were >4.0. The IQR of all these items was ≤ 1.00. All agreement rates of the items were >80.00% with 12 items achieving 100.00% agreement (31.58%), and only 3 items were also <85.00% (Ⅳ-6, Ⅶ-1, and Ⅶ-5). All CVs of the items were <0.25. These data are reported in Table 6.
The grading on the original scale in round two.
SD: standard deviation; IQR: interquartile range; CV: coefficient of variance.
The comparison of the two-round Delphi survey is shown in Figure 2. Figure 2A and D indicates that the mean scores of importance values for dimension in the second round (4.433 ± 0.183) were significantly higher than those in the first round (4.372 ± 0.199) with a statistically significant difference (

Comparison of importance and feasibility values in the two-round consultations: (A) comparison of dimension importance scores between the two rounds; (B) comparison of item importance scores between the two rounds; (C) comparison of feasibility scores between the two rounds; (D) comparison of total mean scores between the two rounds. The green vertical line in (A)–(D) represents the score of
Final items after the Delphi study
Finally, a behavioral assessment scale of consciousness in NHPs was established, including 37 items organized hierarchically within seven dimensions (Table 7), as well as detailed assessment methods and response scores, based on the sufficient synthesis of the mean score, and agreement rate and CV, as well as discussions of the research group in this Delphi study.
Framework of behavioral assessment scale of consciousness for nonhuman primates (NHPs).
Discussion
It is always a complex problem to study the mechanism of consciousness and LoC. 30 Preclinical animal models provide precise controls, reduced variability, rich data output, and reduced ethical complexity. 31 The differences in neural network connectivity and loop function between humans and rodents are enormous. While NHPs have evolutionary similarities and highly conserved gene sequences with humans, 32 the animal models developed with NHPs have the most significant similarity to humans. Findings based on these models best explain the mechanisms of human brain function and diseases, making them ideal animal models for neuroscience. However, no proper assessment tool is available for the state of consciousness in NHPs.
Consciousness is mainly evaluated through subjective evaluation in clinical diagnosis and treatment. The behavior evaluation on noticeable and operable behavioral signs of patients quantitatively by clinical scales is still the gold standard. 33 Consciousness is the awareness of the external environment and/or the self. To evaluate the conscious behavioral response, we can assess the behavioral response caused by external stimuli and the spontaneous conscious behavioral response under non-external environmental stimuli. In other words, the assessment scales mainly reflect the possible state of consciousness by evaluating the behavior response to a stimulus and identifying what actions cause the related consciousness behavior. 34
In recent years, the use of anesthesia-induced loss and recovery of consciousness paradigm in NHPs to study the neurobiological mechanisms of consciousness, DOC, and rehabilitation techniques of DOC has attracted increasing attention. 35 Schroeder's team used five indicators to assess the state of consciousness in rhesus monkeys: limb/facial movements, mouth movements, body movements, eye movements, and vital signs. 36 Redinbaugh et al. 18 used no response to external stimuli as a sign that rhesus monkeys were in a state of LoC and selected five indicators such as spontaneous movement, pedal reflex, blink reflex, limb manipulation, and vertical nystagmus to evaluate the level of consciousness of rhesus monkeys. Bastos et al. 37 defined LoC as the moment when a rhesus monkey's eyes closed for the last time. Recovery of consciousness was defined as the moment when the eyes were open and kept continuously open. Yanagawa et al. 38 believe that when rhesus monkeys are unable to control their hands and do not respond to a cotton swab touching their nostrils during the experiment, it is considered to be a state of loss of consciousness caused by anesthesia induction, while rhesus monkeys were judged to be in a state of consciousness when their hands responded to a cotton swab to touch their nostrils and were able to sit calmly for 20 min. It can be seen that behavioral assessment is also the primary way to evaluate the level of consciousness and behavior of NHP animal models, and the eye, mouth, limb movements, and spontaneous movements are the focus. By manipulating the conscious state of animals, the relationship between behaviors, neural activities, and circuits in different conscious states can be established to infer and explain the neural mechanism of consciousness. 39 There is still a lack of systematic research on the assessment tools for the state of consciousness of NHP animals.
In a previous study, we proposed that behavioral indicators suitable for assessing consciousness in NHPs could be extracted from clinical DOC assessment scales used for human patients with DOC based on the behavioral characteristics and physiological features of NHPs. 23 The Delphi method is commonly chosen to achieve a consensus for clinical standards and the construction of scales. 40 This present study was the first attempt to develop a comprehensive and systematic behavioral assessment scale of consciousness for NHPs based on this hypothesis using a Delphi method. Specifically, 46 dimensions were extracted from 27 human clinical consciousness and behavior assessment scales that may suit the NHPs. Then, we formed an original entry pool for the Delphi study according to their physiological characteristics, conscious and arousal behavior, etc. The experts’ opinions were collected using an e-Delphi method. A consensus was reached after two-round online questionnaire surveys. Patients with consciousness impairment may have dysfunction of one or more sensory channels. 41 Multiple sensory channels are often examined to ensure that more reliable consciousness behaviors are triggered. 42 The scales widely used in clinical practice, such as GCS, FOUR, and CRS-R, include several subscales. Each item assesses the presence or absence of a specific physical sign that represents the integrity of brain function. 43 In addition, the compilation of clinical consciousness assessment scales is mainly based on the clinical experience and theoretical knowledge of experts and a reference to other similar scales. The three aspects of behavioral response, motor response, verbal response, and eye-opening, which GCS chose to examine, were selected from many previous reports on coma. 44 The seven subscales of CRS-R were selected from an original pool of 41 indicators designed by a team of multidisciplinary professionals specializing in acute traumatic brain injury rehabilitation. 45 The Nociception Coma Scale (NCS) was developed based on pain scales for non-communicative patients with dementia and newborns to evaluate behaviors linked to nociceptive pain in the vegetative and minimally conscious state. 46 Similarly, this present study finally developed a behavioral assessment scale for NHPs comprising 37 items organized hierarchically within seven dimensions, including visual function, auditory function, motor function, orofacial movements, arousal, brainstem reflexes, and respiration after a two-round Delphi study. Each dimension comprises items arranged in a hierarchical order, signifying a progression in neurological functioning. The lowest items indicate a lack of function, while the highest represent cortical-based capabilities. The visual function dimension (I) items were extracted from clinical evaluation scales such as the CRS-R, western neurosensory stimulation profile, and the coma near coma scale, among others. Visual function is crucial for maintaining and regulating consciousness and is responsible for processing most sensory information reaching the brain. 47 Therefore, visual function evaluation is important for assessing patients with LoC. The auditory function dimension (II) items were extracted from clinical evaluation scales such as the Loewenstein Communication Scale, DOC Scale, and CRS-R. The auditory cortex is a significant brain area involved in processing acoustic stimuli and is closely connected to other brain areas affected by consciousness. Studies have demonstrated that activation of the auditory cortex correlates with the level of consciousness, 48 making auditory function assessments a valuable part of clinical consciousness assessment scales. The motor function dimension (III) items were extracted from clinical evaluation scales such as the GCS, FOUR, and CRS-R. Motor function plays a vital role in maintaining consciousness and is often evaluated in the clinical assessment of patients with altered mental status. 49 Assessment of motor function can aid in the differential diagnosis of underlying etiologies of altered mental status and provide valuable information about a patient's neurological status and potential for recovery. The arousal dimension (V) items were extracted from clinical evaluation scales such as the CRS-R, comprehensive level of consciousness scale, and the Steward awakening score. The relationship between the arousal dimension and consciousness is complex and multifaceted. It is closely linked to attention, perception, and cognition and plays a crucial role in regulating the brain's information processing flow. Research has shown that changes in arousal can significantly impact an individual's level of consciousness, ranging from drowsiness to deep coma. 50 The brainstem reflex dimension (VI) items were extracted from multiple clinical assessment scales, including the Glasgow-Liege scale, Innsbruck coma scale, FOUR, etc. The brainstem reflexes are automatic responses mediated by the brainstem that are critical for maintaining vital physiological functions such as respiration, blood pressure, and heart rate. 51 These reflexes can also provide important clinical information about the integrity of the brainstem and its connections to higher brain centers. Assessing brainstem reflexes is particularly important for evaluating the level of consciousness in patients with neurological injury or illness. Research has shown a correlation between the presence and strength of brainstem reflexes and the level of consciousness, where their absence or significant reduction is often associated with deeper levels of coma and worse prognoses. 52 Therefore, including brainstem reflex testing in the neurological examination of patients with altered levels of consciousness is essential. The respiratory function dimension (VII) items were extracted from clinical evaluation scales such as the Loewenstein communication scale, FOUR, modified Adrete scale, etc. The stability and orderliness of the respiratory function are crucial for maintaining a normal consciousness state. 53 Abnormal respiratory function can affect the brain's oxygen supply and carbon dioxide elimination, resulting in impaired brain function and altered levels of consciousness. Conversely, changes in consciousness can also affect respiratory function. Hence, monitoring and evaluating the respiratory function status is essential for assessing and managing the level of consciousness in clinical practice. Notably, the statistical indicators of orofacial movements dimension (IV) were found to be at a critical level and prompted serious discussion after the first round of survey. The orofacial movement dimension and its secondary items were primarily extracted and incorporated from clinical scales such as the facial expression subscale of the NCS, 54 the coma exit chart, 55 the oromotor/verbal function subscale of CRS-R, 56 and the eating and emotional response subscales of the Chinese vegetative state scale, 57 among others. These indicators were also derived from similar studies on NHPs, such as the limb/face movements and oral signs indicators developed by Redinbaugh et al. 18 based on clinical arousal scales. Since NHP animals cannot express their mental states through language, orofacial movements may be a valuable indicator for assessing consciousness. Therefore, after discussion among the research group, it was modified and retained in the evaluation scale.
The identification and selection of experts are crucial in a Delphi study.
58
The experts involved in this study had high levels of professional representativeness and authority. Their opinions were well-coordinated and focused. Additionally, they displayed remarkable enthusiasm and engagement in participating in this study. Twenty-eight experts with senior professional titles or doctoral degrees from multiple relative academic backgrounds, including rehabilitation medicine, neurosurgery, neurology, neuroscience, and psychology, participated in this study. They were clinical scientists, professors, or researchers from 13 provinces in China. The response rate to this Delphi survey was high (93.33% and 100.00% in the two rounds, respectively), and several constructive suggestions and comments were received. The sufficient positive coefficient also indicated that experts cared about this research and were enthusiastic about participation.
59
Usually, the expert authority coefficient ≥0.70 is considered acceptable.
60
Meanwhile, it is 0.809 and 0.820 in the two rounds of this work, respectively, indicating that the experts consulted had high authority. All items’ scores on the importance and feasibility met the pre-established criteria, meaning that expert opinions were concentrated and the items were of high importance and feasibility. The coefficients of variation of all items were <0.25, indicating a high degree of coordination among them. Kendall's coordination coefficient W, used to evaluate the agreement among 28 experts, was 0.493, 0.561, and 0.493 in the first round and 0.448, 0.588, and 0.502 in the second round, with all
This study presents several limitations. While the Delphi method was utilized to compile the behavioral assessment scale of consciousness for NHPs, which benefits from the intelligence and expertise of the participating experts, it still has subjective limitations. Therefore, further evaluation of the self-designed scale is required to establish its reliability, validity, and other psychometric indicators, making it a more reliable and effective tool for subsequent studies. For instance, an NHP model with anesthesia-induced LoC can be developed to combine the behavior analysis obtained from the scale with objective techniques such as EEG, PET-CT, fMRI, and others to validate the scale. Moreover, limiting the expert sample for the Delphi study to only those from China may introduce bias and potential limitations. By excluding experts from other regions, the results may not fully reflect the global perspectives and experiences relevant to the topic of interest. Future studies could aim to include experts from diverse regions and backgrounds. This could be achieved through collaborations with international research institutions or organizations to recruit participants from different countries and cultures.
Conclusions
In this study, we developed a behavioral assessment scale of consciousness for NHPs using a two-round online Delphi method. The consensus achieved by a multidisciplinary expert panel provides a comprehensive and detailed tool for measuring conscious states in NHPs or NHP models with LoC. This tool will facilitate the study of the underlying mechanisms of consciousness and LoC in NHPs and have significant implications for translational research into human consciousness disorders. Additionally, this study highlights the importance of developing assessment tools tailored to NHPs to bridge the gap between animal and human studies, and further advance our understanding of consciousness.
Supplemental Material
sj-doc-1-sci-10.1177_00368504231200995 - Supplemental material for Behavioral assessment scale of consciousness for nonhuman primates: A Delphi study
Supplemental material, sj-doc-1-sci-10.1177_00368504231200995 for Behavioral assessment scale of consciousness for nonhuman primates: A Delphi study by Weiming Sun, Xiangli Dong, Guohua Yu, Yang Yang, Binjun He, Yingming Wei, Shijin Li, Zhen Feng and Chaolin Ma in Science Progress
Footnotes
Acknowledgments
We wish to express our sincere gratitude to all the experts who participated in the Delphi process, which provided invaluable insights and expertise that significantly contributed to the research presented in this publication. Furthermore, we would like to extend our heartfelt thanks to Professor Haibo Di and his research team at the International Unresponsive Wakefulness Syndrome and Consciousness Science Institute, School of Basic Medical Sciences, Hangzhou Normal University, for their guidance and support.
Author contributions
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethics approval
This study was approved by the Medical Research Ethics Committee of the First Affiliated Hospital of Nanchang University on 7 March 2022, with the code number 2022–60–2.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The work was supported in the Natural Science Foundation of Jiangxi Province (20232BAB216090), and the Special Fund for Postgraduate Innovation of Jiangxi Province (YC2021-B021).
Informed consent statement
Informed consent was obtained from all subjects involved in the study.
Supplemental material
Supplemental material for this article is available online.
Author biographies
Weiming Sun obtained his master’s degree in the Department of Psychology, Nanchang University in 2015 and worked in the Department of Rehabilitation Medicine, the First Affiliated Hospital of Nanchang University since 2016. He obtained his PhD in 2023 under the supervision of Prof. Chaolin Ma and Prof. Zhen Feng in the Institute of Life Science, Nanchang University. His research focuses on neurobiology and neurological rehabilitation.
Xiangli Dong obtained her master’s degree in the Department of Psychology, Nanchang University in 2017 and worked in the Department of Psychosomatic Medicine, the Second Affiliated Hospital of Nanchang University since 2019. Her research focuses on psychological assessment and psychological therapy.
Guohua Yu is a distinguished associate professor with Department of Rehabilitation Medicine, Hospital of Nanchang University. Her research focuses on rehabilitation of neurological diseases.
Yang Yang is a master student in Neurobiology from the School of Life Sciences, Nanchang University since 2021 under the supervision of Prof. Chaolin Ma.
Binjun He obtained his master’s degree in Neurobiology from the School of Life Sciences, Nanchang University in 2023 under the supervision of Prof. Chaolin Ma.
Yingming Wei obtained her master’s degree in Neurobiology from the School of Life Sciences, Nanchang University in 2023 under the supervision of Prof. Chaolin Ma.
Shijin Li is a master student in Neurobiology from the School of Life Sciences, Nanchang University since 2021 under the supervision of Prof. Chaolin Ma.
Zhen Feng is a distinguished professor with Department of Rehabilitation Medicine, Hospital of Nanchang University. She obtained her master’s degree from Wuhan University, China. Her research focuses on rehabilitation of neurological diseases, especially the clinical research and basic research of consciousness disorders.
Chaolin Ma is a distinguished professor with Institute of Life Science, Nanchang University and the principal investigator of Cognitive Function and Disease Lab. He obtained his PhD in Neurobiology from Fudan University, China. His research focuses on the functions of the prefrontal cortex in working memory, decision, making and attention-deficit hyperactivity disorder, etc.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
