Abstract
Background
Intracerebral hemorrhage (ICH) is the most devastating stroke subtype, characterized by high mortality and disability rates. With the rapid growth of short-video platforms, TikTok and BiliBili have become important channels for the public to obtain health information. However, the quality and reliability of ICH-related videos on these platforms have not been systematically evaluated.
Methods
Up to October 17–18, 2025, the research collected the top 100 comprehensively ranked videos from TikTok and BiliBili separately, using the Chinese term “脑出血 (ICH)” as the search keyword. After screening, 146 videos were included. Two independent reviewers assessed video quality and reliability using three standardized tools: the Global Quality Scale (GQS), the modified DISCERN (mDISCERN) instrument, and the JAMA benchmark. Correlation analysis was used to evaluate the relationship between video quality and engagement metrics.
Results
Videos on BiliBili scored significantly higher than those on TikTok on both the GQS and mDISCERN (P < 0.001 and P = 0.039, respectively). Furthermore, BiliBili outperformed TikTok in terms of content completeness. However, TikTok demonstrated significantly higher engagement metrics (likes, favorites, comments, and shares) than BiliBili (P < 0.01 for all). Videos uploaded by healthcare professionals achieved the highest quality, with a median GQS score of 3 (IQR 2-4). Correlation analysis revealed a positive correlation between video length and quality scores, while the number of comments was negatively correlated with both video quality and length.
Conclusion
The quality and reliability of ICH-related videos were superior on BiliBili compared to TikTok, whereas TikTok exhibited overwhelming advantages in user engagement. Longer video duration was associated with better quality and reliability. Although videos from healthcare professionals scored higher in quality and reliability than those from individual users, they still did not meet the standard for high-quality information. This indicates the ongoing challenge of effectively translating specialized medical knowledge into reliable, practical, and easily understandable information for the public.
Introduction
According to the latest Global Burden of Disease (GBD) study, stroke remains the second leading cause of death worldwide. 1 Intracerebral hemorrhage (ICH), although accounting for only 20% of stroke cases, is the most devastating stroke subtype globally, characterized by the highest rates of disability and mortality. Its sudden onset and critical nature impose a heavy burden on patients’ families and the healthcare system. 2 Despite continuous advancements in treatment, the prognosis for ICH remains poor, with a significant proportion of survivors left with severe neurological deficits. This not only severely compromises the patient’s quality of life but also places long-term and substantial economic and non-economic pressure on their families and the entire healthcare system. 3 In this context, effective prevention, timely recognition, and scientific long-term management are crucial for improving ICH outcomes. This largely depends on the public’s accurate knowledge and understanding of the disease. Therefore, enhancing public awareness of ICH warning signs, risk factors, and emergency response is of paramount importance for reducing treatment delays and improving clinical outcomes.
In recent years, the widespread adoption of the internet and social media has fundamentally transformed how the public accesses health information. 4 Among these, video platforms have emerged as prominent forces due to their intuitive, engaging, and easily disseminable nature, becoming a key information source for the public, particularly younger generations. 5 As a convenient and efficient means of health communication, online science popularization education helps improve patients’ knowledge about their diseases and promotes their active participation in disease management. 6 This indicates that online video-based science popularization, characterized by the integration of audio and visual elements, contextual presentation, and repetitive learning, can convey disease-related knowledge more intuitively, thereby exerting greater value in health education within clinical practice. According to data, China’s internet user base has reached 1.123 billion, with short-video users accounting for over 95% (1.068 billion). TikTok and BiliBili, as the two largest platforms in China by user volume, cover all age groups and are highly representative. Among them, TikTok’s total user base has exceeded 1.62 billion, with monthly active users approaching 1 billion and daily active users exceeding 600 million. 7 Bilibili has 107 million daily active users and 348 million monthly active users. 8 With distinct strengths including easy access, straightforward visual display, and swift information spread, these two platforms serve as vital instruments for delivering public health education. Previous international studies have shown that visual health education methods such as short videos also hold significant health communication value worldwide. For example, a relevant study by the University of California in health science popularization found that, compared with patients who only read text information, those who watched video-based education reported significantly higher perceived effectiveness of opioid dose reduction (4.06 vs. 3.67, P < 0.001). 9 Therefore, video-based health communication can have practical value across different countries and cultural contexts; this provides the background for our further evaluation of the quality of health information on video platforms. While these platforms offer unparalleled advantages in the breadth and speed of information dissemination, this very advantage carries significant hidden risks: the low barrier to content creation results in highly variable quality, with no guarantee of accuracy, thereby exposing the public to the dual risks of misinformation and disinformation. 10 For critical emergencies like ICH, judgments based on incomplete or erroneous information can lead to delayed medical care, inappropriate self-management, or unrealistic expectations about prognosis, with potentially dire consequences.11,12
Currently, numerous studies have focused on assessing the quality of various health topics on social media, ranging from chronic degenerative and autoimmune diseases to cardiovascular diseases and even cancer, confirming that short-video platforms have become a major channel for public health information.13–16 However, there is a lack of specialized, systematic evaluation of the information quality specifically for ICH, a critical neurological emergency, on these mainstream video platforms. Furthermore, most existing studies are limited to single-platform analyses, failing to reveal the differential impact of various platforms on information quality through cross-platform comparison.
This study is the first to evaluate ICH-related content on two major Chinese video platforms, TikTok and BiliBili. We conducted a comprehensive assessment using three recognized instruments—the Global Quality Scale (GQS), the modified DISCERN (mDISCERN), and the JAMA benchmark criteria—to examine ICH videos from multiple dimensions, including informational completeness, scientific reliability, and content standardization.17–19 By analyzing platform characteristics, content creator backgrounds, and dissemination effects, this study aims to provide an evidence-based foundation for optimizing ICH-related health information dissemination.
Methods
Video retrieval and screening strategy
This study was conducted between October 17 and 18, 2025, on two leading short-video platforms: TikTok and BiliBili. To simulate the search behavior of an average user and minimize bias from personalized algorithmic recommendations, we used newly registered platform accounts. All searches were performed after clearing browser cache and history. To ensure the reproducibility of the study, we employed the default comprehensive ranking filters on both platforms. These filters rank search results based on relevance, engagement, and platform-specific weighted parameter combinations. The Chinese keyword “脑出血 (ICH)” was used for searching on both platforms, using the default “comprehensive ranking” filter. The top 100 videos from each platform’s search results were recorded. The exclusion criteria for videos were: (1) duplicates; (2) irrelevant content: Those videos that contain actual content unrelated to ICH but were still retrieved; (3) advertisements; (4) videos without sound or containing only background music but no substantive content; (5) videos published within the last 7 days; (6) not informative: The theme of the video is ICH, but the content does not contain any substantive health education information about ICH. The final screening process is illustrated in Figure 1. Flowchart of the search and screening strategy for ICH-related videos.
Data extraction and classification
Basic characteristics and engagement metrics (video duration, number of likes, favorites, comments, and shares) were extracted from each eligible video. Furthermore, uploaders were categorized into three groups based on their self-declared and platform-verified information: Specialized Healthcare Professionals (SHPs, e.g., certified neurologists/neurosurgeons), Non-Specialized Healthcare Professionals (NSHPs, e.g., nurses, other healthcare workers not specialized in neurology), and Individual Users (persons without any healthcare background certification). Additionally, we assessed whether the video content covered key clinical elements, including: epidemiology, etiology, clinical manifestations, diagnosis, treatment, prevention, and prognosis. The mention of each element was recorded as “yes” or “no”.
Quality and reliability assessment
The Global Quality Score (GQS) quality criteria.
The Modified DISCERN (mDISCERN) quality criteria.
The Journal of the American Medical Association (JAMA) benchmark criteria.
Comparison of health information quality evaluation tools.
Statistical analysis
All statistical analyses were performed using R software (version 4.3.2). As continuous variables (e.g., video duration, engagement metrics) were non-normally distributed, data are described using medians and interquartile ranges (IQRs). The Mann-Whitney U test was used for comparisons between two groups, and the Kruskal-Wallis H test was used for comparisons among three or more groups. When a statistically significant difference was found in the Kruskal-Wallis H test, Dunn’s test was used for post-hoc pairwise comparisons. Categorical variables are presented as frequencies and percentages. Spearman’s rank correlation analysis was used to assess the relationships between video variables (e.g., duration, engagement metrics, and quality scores). A P-value of < 0.05 was considered statistically significant.
Results
General characteristics of videos
Using the keyword “脑出血” (ICH), we retrieved the top 100 videos from each platform, TikTok and BiliBili. After screening, 80 videos from TikTok and 66 from BiliBili were included in the final analysis (Figures 1 and 2). Key metrics collected included video duration, likes, favorites, comments, and shares (Table 5). Among the 146 videos analyzed, the median video length was 143.50 seconds. The median number of likes was 3,779, favorites was 1,237, comments was 254.5, and shares was 394. Distribution of included ICH-related videos across platforms. General characteristics, quality, and reliability of the videos.
Content completeness was assessed by evaluating the coverage of key elements: epidemiology, etiology, clinical manifestations, diagnosis, treatment, prevention, and prognosis. Out of the 146 videos, 20 (13.6%) covered epidemiology, 66 (45.2%) covered etiology, 66 (45.2%) covered clinical manifestations, 47 (32.1%) covered diagnosis, 78 (53.4%) covered treatment, 54 (36.9%) covered prevention, and 49 (33.5%) covered prognosis.
Video quality and reliability were assessed using the GQS, mDISCERN, and JAMA scores (Table 5). The median scores for the overall pool of videos were GQS: 3 (IQR 1-3), mDISCERN: 2 (IQR 2-2), and JAMA: 2 (IQR 2-2), indicating generally moderate to low quality and reliability across both platforms.
Analysis of video characteristics by platform
General information, quality, and reliability scores of ICH videos on TikTok and BiliBili.
Content completeness also varied by platform (Table 6, Figure 3). A similarly low proportion of videos covered epidemiology on both BiliBili (13.6%) and TikTok (13.8%). The coverage of etiology (48.5% vs. 42.5%) and clinical manifestations (48.5% vs. 42.5%) was comparable between BiliBili and TikTok, respectively. However, BiliBili had a significantly higher proportion of videos covering diagnosis (43.9% vs. 22.5%), treatment (72.7% vs. 37.5%), prevention (72.7% vs. 45%), and prognosis (54.6% vs. 23.8%) than TikTok, indicating more comprehensive content on BiliBili. Content coverage of ICH-related videos on TikTok and BiliBili.
Quality and reliability assessments by platform are detailed in Table 6 and Figure 4. BiliBili videos achieved significantly higher median GQS scores (3, IQR 2-3) compared to TikTok videos (2, IQR 1-3) (P < 0.001) (Figure 4(a)). Similarly, the mDISCERN score was significantly higher for BiliBili than for TikTok (P = 0.039) (Figure 4(b)). In contrast, no significant difference was found in the JAMA scores between the two platforms (P = 0.177) (Figure 4(c)). These results suggest that the quality and reliability of ICH-related videos are superior on BiliBili compared to TikTok. Quality and reliability assessments of ICH-related videos on TikTok and BiliBili using GQS, mDISCERN, and JAMA scores.(A) Violin plot of GQS scores. (B) Violin plot of mDISCERN scores. (C) Violin plot of JAMA scores. (*P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001).
Analysis of video characteristics by uploader
Significant differences were also found among videos from different uploader categories: SHPs, NSHPs, and Individual Users. The distribution of these uploader categories was similar across BiliBili and TikTok (Figure 5(a) and (b)). Of the 146 videos, 52 were uploaded by SHPs, 52 by NSHPs, and 42 by Individual Users (Table 7). Proportion of video uploaders categorized as SHPs, NSHPs, and Individual Users on TikTok and BiliBili. (A) Pie chart showing the overall proportion of uploader categories (SHPs, NSHPs, and Individual Users). (B) Percentage stacked bar chart comparing the distribution of uploader categories between TikTok and Bilibili. Characteristics, quality, and reliability of ICH videos by different uploaders on TikTok and BiliBili.
A significant difference in video duration was observed among the three groups (Table 7). Videos from SHPs had the longest median duration (197.5 seconds), followed by Individual Users (111 seconds) and NSHPs (108 seconds). Regarding engagement, videos from SHPs received the highest median numbers of likes (7,543.5), favorites (2,701.5), and shares (1,041).
The quality and reliability scores by uploader category are presented in Table 7 and Figure 6. For the GQS score (Table 7, Figure 6(a) and (b)), videos from SHPs had the highest median score (3, IQR 2-4), followed by NSHPs (3, IQR 3-3) and Individual Users (1, IQR 1-2). The GQS scores for both SHPs and NSHPs were significantly higher than for Individual Users, but no significant difference was found between SHPs and NSHPs. For the mDISCERN score (Table 7, Figure 6(c) and (d)), the median score was 2 (IQR 2-2) for all three groups; however, the scores for both SHPs and NSHPs were statistically significantly higher than for Individual Users, with no significant difference between the two professional groups. For the JAMA score (Table 7, Figure 6(e) and (f)), the median score was 2 (IQR 2-2) for all three groups, with no statistically significant differences among them. Quality and reliability assessments of ICH-related videos by uploader category using GQS, mDISCERN, and JAMA scores. (A) Ridge plot of GQS scores across uploader categories (Overall, SHPs, NSHPs, and Individual Users). (B) Ridge plot of mDISCERN scores across uploader categories. (C) Ridge plot of JAMA scores across uploader categories. (*P < 0.05,**P < 0.01, ***P < 0.001,****P < 0.0001).
Correlation analysis
We performed a correlation analysis between basic video characteristics/engagement metrics (duration, likes, favorites, comments, shares) and quality/reliability indicators (GQS, mDISCERN, JAMA scores) across the 146 videos (Figure 7). Strong positive correlations were observed among likes, favorites, comments, and shares. Video duration showed low-to-moderate positive correlations with both the GQS and mDISCERN scores. Conversely, the number of comments exhibited negative correlations with both video duration and the GQS score. Heatmap of correlation analysis between video characteristics, engagement metrics, and quality/reliability scores. The meaning of ×: The P value is not less than 0.05, which indicates that there is no statistical significance.
Discussion
In this cross-sectional assessment of ICH-related videos on TikTok and BiliBili, our study revealed that: (a) the overall quality and reliability scores (GQS, mDISCERN, JAMA) were low; (b) compared to TikTok, videos on BiliBili were significantly longer and provided more comprehensive coverage of key clinical elements such as diagnosis, treatment, prevention, and prognosis; however, TikTok demonstrated superior performance in user engagement metrics including likes, favorites, comments, and shares; (c) videos created by SHPs received higher quality scores and better audience feedback, indicating that the professionalism and credibility of the content are key factors in enhancing the effectiveness of public health education; (d) strong correlations were observed among various engagement metrics; video length showed low-to-moderate positive correlations with GQS and mDISCERN scores, while the number of comments was negatively correlated with both video length and the GQS score.
Data from the 146 videos indicate that the significantly longer duration of videos on BiliBili reflects its user base’s higher acceptance of in-depth, systematic content. The platform’s ecosystem is more conducive to explaining complex health knowledge. 21 In terms of user engagement, however, TikTok’s metrics were orders of magnitude higher, with likes, favorites, comments, and shares far exceeding those on BiliBili. This discrepancy is largely attributable to the distinct positioning and algorithms of the two platforms. BiliBili was originally designed for knowledge-oriented content, and its user base includes more students and medical students. Its recommendation system prioritizes educational and in-depth videos. In contrast, TikTok’s algorithm promotes fast-paced short videos, and its distribution model, heavily based on engagement metrics like likes and watch time, typically favors eye-catching content over purely informational material. 22 This suggests a significant trade-off between informational depth/completeness and dissemination breadth/engagement. Well-produced, comprehensive long-form videos (as found on BiliBili), while capable of providing broader coverage of clinical elements and helping to establish professional authority, face limitations in their dissemination potential and user interaction levels, making it difficult to achieve widespread, immediate engagement. Conversely, concise short-form content (as found on TikTok), although often compromising on depth and comprehensiveness when dealing with complex medical topics, holds a distinct advantage in stimulating user interaction and achieving viral spread. 23 In addition, BiliBili’s search and recommendation algorithm might perform less effectively than TikTok’s, as our retrieval process revealed that BiliBili’s comprehensive sorting yielded a higher proportion of videos unrelated to ICH (e.g., on ischemic stroke, myocardial infarction) compared with TikTok, and such irrelevant videos were excluded.
Our systematic analysis of content coverage related to ICH on TikTok and BiliBili revealed significant disparities in the coverage of different clinical elements, highlighting structural shortcomings in current health education content. The most prominent issue identified was the severe neglect of epidemiological information regarding ICH. Among the 146 videos, only approximately 13.6% mentioned relevant epidemiological data, a pattern consistently observed across both TikTok and BiliBili platforms.24,25 Epidemiological information, such as disease incidence and high-risk groups (e.g., specific age ranges, individuals with hypertension or diabetes), is fundamental for fostering public risk awareness and enabling preliminary self-assessment. The widespread absence of this content may prevent the public, particularly high-risk individuals, from accurately evaluating their personal risk. This could lead to the neglect of necessary preventive measures and early warning signs, potentially delaying timely medical intervention. 26 Furthermore, coverage of other critical aspects—diagnosis, prevention, and prognosis—was also insufficient overall, with notable disparities between platforms. Across all 146 videos, the proportions covering diagnosis (32.1%), prognosis (33.5%), and prevention (36.9%) each remained below 40%. This lack of information can have serious consequences: insufficient knowledge about diagnosis may prevent the public from understanding the importance of essential diagnostic tools (such as CT scans); a scarcity of prognostic information can distort patient and family expectations regarding recovery and long-term management, potentially leading to unnecessary anxiety or unrealistic treatment outcome expectations; and weak coverage of prevention—a cornerstone of stroke management—directly impacts the effectiveness of primary prevention efforts. 27 It is noteworthy that BiliBili demonstrated significantly higher coverage of in-depth information on diagnosis, treatment, prevention, and prognosis compared to TikTok. This aligns with its longer video format, which is more conducive to systematic explanation. However, TikTok’s comparative weakness in these critical clinical areas, despite its superior reach and engagement, implies that the information flow reaching the broadest audience may lack sufficient depth and practical utility. This disconnect between “breadth” and “depth” risks rendering widely disseminated content ineffective in translating into actionable knowledge and improved health literacy for public decision-making. 28
Our study also systematically evaluated ICH-related videos from both platforms using the GQS, mDISCERN, and JAMA instruments. Overall, the quality and reliability scores for all videos were moderate to low. Low scores on GQS, mDISCERN, and JAMA indicate issues such as “insufficient source citation, incomplete logical structure, and weak evidence chains,” which are particularly dangerous for an acute and severe condition like ICH. This suggests that the overall quality of ICH-related health information on social media still requires significant improvement, with urgent need to enhance its accuracy, comprehensiveness, and reference value. 29 However, significant differences in quality and reliability scores were observed between the platforms. Videos on BiliBili demonstrated significantly higher quality and reliability than those on TikTok: the median GQS and mDISCERN scores were both higher on BiliBili, indicating superior performance in information flow completeness, patient usefulness, and content reliability. This finding is consistent with similar studies on other diseases.30,31 This disparity likely stems primarily from the distinct content ecosystems and user consumption habits of the two platforms. First, as mentioned previously, BiliBili allows and encourages longer video content. This provides creators, particularly SHPs, with sufficient space to explain the complex medical knowledge of ICH systematically and in-depth, covering aspects such as diagnostic basis, treatment options, and prognosis management. This is corroborated by the significantly higher coverage of key clinical elements like diagnosis, treatment, prevention, and prognosis in BiliBili videos. Second, BiliBili’s community culture has a stronger “knowledge-oriented” characteristic, where users have higher acceptance and demand for deep, professional content, thereby incentivizing creators to produce higher-quality videos. Finally, BiliBili’s user base likely includes a higher proportion of students, including medical students, making them a more receptive audience for sufficiently long and comprehensive ICH videos. In contrast, TikTok’s core strengths lie in high interactivity and powerful dissemination capabilities. However, its short-video format and algorithm that prioritizes immediate engagement may be structurally disadvantageous for the complete presentation of complex medical information. To capture viewer attention within a short timeframe, content often tends to focus on more visually impactful or dramatic aspects, such as symptom descriptions, at the expense of in-depth information requiring rigorous explanation, like diagnosis and prognosis. 32
This study categorized video sources into three groups based on uploader identity: SHPs, NSHPs, and individual users. Our analysis revealed significant differences in the quality and reliability of ICH-related videos produced by different uploader groups. First, videos uploaded by SHPs demonstrated significantly superior quality compared to the other two categories. Specifically, these videos received the highest GQS scores, reflecting better performance in information flow, clarity, and patient usefulness. This finding aligns with expectations, as SHPs possess systematic medical knowledge and clinical experience that enables them to ensure the accuracy and scientific rigor of health education content.33,34 A notably positive finding is that SHP-created videos also achieved the highest interaction metrics (likes, favorites, and shares). This suggests that in the serious medical domain of ICH, the public can recognize and shows a preference for trusting and disseminating content from professional sources, creating a virtuous cycle between professionalism and dissemination reach. However, our results also reveal a crucial issue: despite the relative superiority of SHP content, their GQS scores remained only at a moderate level, and all uploader groups demonstrated generally low mDISCERN and JAMA scores. This indicates that even content created by professionals has substantial room for improvement in terms of information reliability, citation of references, avoidance of bias, and content comprehensiveness. Previous research has similarly identified this problem.35,36 The most plausible explanation is that professionals may compromise informational rigor and depth to adapt to the characteristics of short-video platforms. Given the current popularity of short-form videos and the high interactivity of concise content, medical and public health professionals need to achieve broader reach without compromising the depth of information. SHPs might be able to use short and engaging videos as entry points to effectively capture viewers’ attention, while directing those seeking more detailed information via embedded links to longer, more comprehensive content published on other platforms. In addition, to improve the quality of videos produced by healthcare professionals, we suggest that SHPs adopt specific, actionable strategies such as: (1) clearly disclosing their professional qualifications and the educational purpose of the content at the beginning of the video; (2) citing authoritative guidelines or peer-reviewed evidence when presenting treatment recommendations or disease information; (3) following standardized formats for health information delivery to reduce ambiguity and enhance consistency; and (4) collaborating with professional associations to develop checklists or templates that help ensure completeness and accuracy of content. Such practices would not only raise the quality of professional videos but also make them more trustworthy and accessible to the public.
Correlation analysis of metrics including video length, likes, favorites, comments, and shares across the 146 videos revealed complex yet instructive relationships among the indicators for ICH-related videos, providing important insights into the dissemination patterns and quality representation of health information on social media. First, our correlation results showed strong positive correlations among engagement metrics such as likes, favorites, comments, and shares. However, no significant correlation was observed between these engagement metrics and quality/reliability scores. This suggests that these metrics collectively reflect a video’s “popularity” or “dissemination reach,” but they do not necessarily serve as direct proxies for its intrinsic informational quality.37,38 A video that triggers strong emotional responses or interest in users is likely to see simultaneous increases across all engagement indicators. Second, a key finding was the low-to-moderate positive correlation between video length and both GQS and mDISCERN scores, a result consistent with previous findings reported by Lei et al.39,40 This strongly indicates that longer duration provides the necessary space for systematically and thoroughly explaining complex medical knowledge, thereby contributing to enhanced content comprehensiveness and reliability. This finding corroborates the superior quality observed with longer videos on BiliBili, underscoring the value of “depth” in health education. However, a noteworthy and cautionary phenomenon was the negative correlation between the number of comments and both video length and GQS score. This may imply that lengthy or overly specialized videos might fail to engage broader viewership and could potentially suppress users’ immediate inclination to comment, possibly because viewers require time to digest the information or feel less capable of participating in in-depth discussions. Furthermore, a more critical interpretation is that a high number of comments may not signal high quality but could instead indicate contentious content. Videos that generate numerous comments might contain controversial viewpoints, unsubstantiated claims, or oversimplified conclusions, thereby provoking emotional debates or panic-driven inquiries—content characteristics typically contrary to high-quality, rigorous, and objective health education.
We suggest TikTok refine its algorithm to give prominent placement in search results to health education videos produced by verified medical practitioners; concurrently, the platform ought to support and incentivize creators to make longer videos that offer more structured and thorough health education content. Meanwhile, BiliBili should further consolidate its strength in in-depth explanations and build a systematic knowledge repository. Content creators, particularly SHPs, need to focus on addressing critical content gaps such as epidemiological background, diagnostic basis, and prognosis management. For viewers/patients, we recommend selecting longer, more comprehensive videos released by SHPs to acquire disease-related knowledge.
While this study provides valuable insights into the quality and dissemination patterns of ICH-related information on short-video platforms, several limitations must be acknowledged. First, the data were sourced by sampling the top 100 videos from each platform at a specific point in time (October 17-18, 2025). This approach may not fully capture all relevant content on the platforms, and the results are susceptible to the dynamic nature of platform-specific recommendation algorithms and content updates. Second, in Chinese, “脑出血” is the more normative and commonly used medical term, while “脑溢血”(ICH) was formerly a colloquial label employed by a limited group and is seldom referenced today. “脑出血” is now broadly utilized as the Chinese shorthand for ICH across lay and professional medical discourse. By employing the Chinese search term “脑出血”, our study may have missed videos indexed under alternative expressions such as “脑溢血,” potentially restricting the overall completeness of the results. Third, although we employed validated assessment tools such as GQS and mDISCERN, the evaluation of video quality inevitably contains a subjective component. While we implemented measures to ensure inter-rater consistency, including using two independent reviewers and an arbitration process, potential inter-rater variability may still exist. Furthermore, as a cross-sectional study, it reveals associations and differences between platforms but cannot establish causality. In addition, due to the difference in user scale between TikTok and BiliBili, without obtaining the information on the scale of viewers for each video (such as the number of plays), directly comparing the engagement metrics like the number of likes and comments across platforms cannot rule out the bias caused by the platform differences. Finally, Although algorithms, user profiling, and content moderation mechanisms vary across platforms, simplifying content to fit platform constraints and thereby reducing rigor is a common trend. These platforms are gradually emerging as important channels for health education, and existing studies have shown that English short videos can be used to disseminate general medical information (e.g., alcohol risk education, 41 early childhood language learning 42 ), but no research has specifically examined stroke-related content. This research gap suggests that our findings may have international relevance, implying that similar adaptations of professional content to platform constraints may exist in other language regions. However, due to differences in language, culture, and healthcare systems, as well as limitations in data availability, these conclusions need to be further validated in non-Chinese contexts by incorporating local platform characteristics. Future research should conduct multinational, multilingual comparative studies to gain deeper insight into how health information is disseminated and evaluated in different socio-technical environments.
Conclusion
This study presents the first assessment and comparison of the content quality, reliability, and dissemination characteristics of ICH-related videos on two major Chinese short-video platforms, TikTok and BiliBili. Our findings clearly demonstrate that BiliBili, leveraging its longer video format and greater tolerance for in-depth content, significantly outperforms TikTok in terms of informational comprehensiveness, overall quality, and reliability. However, TikTok exhibits overwhelming superiority in user engagement metrics, suggesting its short-form, highly immersive model is more conducive to widespread dissemination among the general population. Furthermore, the positive correlation identified between video duration and quality/reliability scores underscores the critical role of sufficient length in delivering substantive content. Conversely, the negative correlation between the number of comments and the GQS score serves as a cautionary note to viewers that high interaction volume should not be simplistically equated with high information quality. Notably, content created by SHPs was found to be of the highest quality, affirming the core value of professional authority in health education. However, a crucial finding is that even videos produced by SHPs failed to meet the standard for high-quality information. This highlights the persistent challenge of effectively translating specialized medical knowledge into accessible, reliable, and practical information for the public.
Footnotes
Acknowledgments
The authors extend their gratitude to all video uploaders included in this study. Their contributions to the dissemination of public health information form the foundation of the value of this research.
Ethical considerations
This study did not involve human participants, clinical data, experimental animals, or histological research. All analyzed data were sourced from publicly available TikTok and BiliBili videos, and data collection strictly complied with the terms of service of TikTok and BiliBili. No private or personally identifiable information was collected or processed, nor was any interaction conducted with users. Therefore, the study qualifies for an ethics exemption and does not require ethics approval.
Author contributions
QWH was responsible for the conceptualization, methodology, and overall supervision of the study. BSH and AQC conducted the investigation, including data extraction and quality assessment. ZT, XY, and YYS performed data editing of the final manuscript. Curation, collection, and formal analysis. The original draft of the manuscript was written by BSH and AQC, and all authors contributed to the subsequent review and editing of the final manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by grants from the National Natural Science Foundation of China (82071335, 82301508) and the Natural Science Foundation of Hubei Province (2023AFB280).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
The datasets generated during this study are available from the corresponding author upon reasonable request.
Generative AI statement
The authors declare that no generative AI or AI-assisted tools were used for writing, data analysis, image generation, or code development in this study.
