Sage Journals: Discover world-class research

Abstract

Hepatitis B is a significant global health concern and poses a substantial burden on public health systems. Short video platforms such as TikTok and Bilibili have become important channels for health information dissemination. However, the quality and reliability of Hepatitis B-related content on these platforms remain unclear. The objective of our research is to evaluate the quality of information regarding Hepatitis B disseminated on the TikTok and Bilibili short video platforms. On April 1, 2025, we systematically collected the top 100 Hepatitis B-related short videos from TikTok and Bilibili, totaling 200 videos. Basic video information was extracted, and video quality and reliability were assessed using the Global Quality Scale (GQS), modified DISCERN (mDISCERN), and JAMA benchmarks. Spearman correlation analysis was performed to examine the relationship between engagement metrics and quality scores. TikTok videos demonstrated greater user engagement, as evidenced by higher metrics for likes, comments, and shares, and also achieved superior reliability scores compared to Bilibili. Specifically, the median reliability scores for TikTok videos were mDISCERN: 4 (3-4) and JAMA: 3 (3-3), whereas for Bilibili videos, these scores were mDISCERN: 3 (3-4) and JAMA: 2 (2-3). In terms of content quality, as assessed by the GQS, both platforms exhibited similar levels (TikTok: 4 [3-4], Bilibili: 4 [3-4]). Additionally, videos uploaded by hepatologists consistently showed higher quality and reliability. Spearman correlation analysis indicated significant but weak positive correlations between engagement metrics (likes, comments, shares, saves) and both GQS and JAMA scores; however, no significant correlation was observed with mDISCERN scores. The overall quality and reliability of Hepatitis B-related short videos were moderate, with TikTok videos outperforming Bilibili videos in reliability. Videos created by hepatologists demonstrated higher quality and reliability. We recommend that the public exercise caution when consuming health information from short videos to avoid potential misinformation.

Keywords

Bilibili health information Hepatitis B short videos TikTok

Introduction

Hepatitis B is a significant global health concern, with an estimated 316 million people living with chronic Hepatitis B virus (HBV) infection worldwide.¹ The disease poses a substantial burden on public health systems, contributing to liver cirrhosis, hepatocellular carcinoma, and other severe liver diseases.² The prevalence of Hepatitis B varies widely by region, influenced by factors such as vaccination programs, transmission routes, and socioeconomic conditions.³ In China, despite significant progress in reducing the prevalence of Hepatitis B through vaccination, the disease remains a major public health challenge.^4,5 The clinical importance of Hepatitis B is underscored by its potential for chronicity and the associated long-term health implications.⁶ These considerations underscore the paramount importance of elevating public awareness and understanding of Hepatitis B, as this is essential for achieving better health outcomes and alleviating the overall disease burden.

With the emergence of the concept of health literacy, the global demand for disease and health education has grown increasingly significant. Leading health organizations, including the National Institutes of Health, the US Department of Health and Human Services, and the American Medical Association, consistently recommend developing patient education materials at a sixth-grade reading level to ensure accessibility.⁷ The digital revolution has fundamentally transformed health information acquisition patterns.⁸ It is known that approximately half of the adult population consults the internet for health-related information.⁹ Short-video platforms like TikTok and Bilibili have emerged as predominant health information sources due to their accessibility, engaging formats, and interactive features.^10,11 These platforms leverage visually compelling content, particularly videos, to facilitate the dissemination of complex medical knowledge, thereby significantly enhancing information accessibility and public engagement while making health information easier to absorb and remember.¹² The quality of such information is paramount, as evidence confirms that patients who acquire accurate knowledge about disease causes, pathophysiology, and treatment protocols demonstrate better participation in and compliance with therapeutic regimens.¹³ However, the absence of peer-review mechanisms and stringent regulatory oversight has resulted in highly variable content quality, with numerous videos containing misleading or inaccurate information. Several studies have indicated that the majority of health-related content on TikTok lacks scientific accuracy, while Bilibili also faces similar issues concerning the completeness and reliability of its content.^14,15

Although existing studies have evaluated video quality for various conditions (eg, laryngeal carcinoma,¹⁶ colorectal polyps,¹⁷ rectal cancer,¹⁸ metabolic dysfunction-associated steatotic liver disease¹⁹ ) on these platforms, the reliability of Hepatitis B-specific content remains largely unexamined. This study employs a multidimensional assessment framework to systematically evaluate the quality and reliability of Hepatitis B-related videos on TikTok and Bilibili. Our findings aim to provide evidence-based recommendations for public health information selection while informing platform content regulation policies to optimize health communication in the digital era.

Methods

Search Strategy and Data Collection

This study is a cross-sectional content analysis aimed at evaluating the quality and reliability of publicly available hepatitis B-related health information on mainstream short-video platforms in China. On April 1, 2025, we systematically collected the top 100 search results for “Hepatitis B” (Chinese: “乙型病毒性肝炎”) from both Bilibili and TikTok (China’s version) using newly registered accounts to control for algorithmic bias (Figure 1). After removing duplicate videos (defined as identical content from different uploaders) and irrelevant entries based on title screening, we obtained a final sample of 100 videos per platform. We confined our analysis to the top 100 videos, as multiple studies have demonstrated that videos beyond the top 100 exert no significant influence on the analytical outcomes.^14,20,21 Furthermore, our final sample of 200 videos exceeds the sample sizes of numerous comparable studies in the field and enabled the detection of statistically significant differences in our primary outcomes between platforms. For each video, we recorded the title, uploader characteristics, duration, engagement metrics (likes, comments, shares, saves), and days since published.

Figure 1.

Flow chart of study selection and inclusion of videos related to Hepatitis B (乙型病毒性肝炎).

Classification of Videos

The videos were classified by upload source (hepatologists, non-hepatologists, patients, and science communicators), content type (detection, disease knowledge, treatment, and reports and news), and format (animation and live videos), providing a comprehensive categorization framework for analysis. For more detailed information, please refer to the online Supplemental material.

Methodology for Assessing Video Quality and Reliability

Video quality and reliability were systematically evaluated using 3 validated instruments: the Global Quality Score (GQS) assessed overall content quality, while the modified DISCERN scale (mDISCERN) and JAMA benchmark criteria were employed to evaluate reliability. The mDISCERN instrument assessed reliability through 5 dichotomous items (1 = present, 0 = absent), with total scores categorized into 5 reliability tiers: unreliable (0-1), marginally reliable (2), moderately reliable (3), largely reliable (4), and highly reliable (5).²² The JAMA score evaluated 4 fundamental quality attributes (authorship, attribution, currency, and disclosure), yielding a maximum possible score of 4 points.²³ Content quality was further assessed using the GQS 5-point Likert scale (1 = exceptionally poor to 5 = excellent quality).²⁴ For more detailed information, please refer to the online Supplemental material.

Evaluation Process

Two trained medical evaluators (BW C and WJ S) independently scored all videos following standardized training to ensure rating consistency. In cases where the ratings from the 2 evaluators were inconsistent, an arbitrator (XY C) was consulted to assign the final score. Thereafter, all authors unanimously agreed on the final ratings.

Ethics Consideration and Consent to Participate

This study did not require approval by the local Research Ethics Board as it involved publicly available data only. All information was accessed and obtained publicly, and no interaction with any individual was involved. No personal information, clinical data, or human specimens were used. All data were anonymized and presented in aggregate form, ensuring that no individual content creator could be identified from the reported results.

Statistical Analyses

Given the non-parametric distribution of the data, results are presented as medians with interquartile ranges (IQR). Nonparametric tests were applied for group comparisons: the Mann–Whitney U test for 2-group analyses and the Kruskal–Wallis H test for multi-group comparisons. Inter-rater reliability, calculated using Cohen’s κ coefficient, was interpreted per Landis and Koch criteria: κ > 0.80 (almost perfect agreement), 0.61 to 0.80 (substantial agreement), 0.41 to 0.60 (moderate agreement), and ≤0.40 (poor agreement). Spearman’s correlation assessed (a) inter-variable relationships among video characteristics and (b) associations between video ratings and other parameters. All tests were 2-tailed, with P < .05 considered statistically significant. Data analysis was conducted using GraphPad Prism version 9.0.0 for Windows.

Result

Video Characteristics

Based on our keyword search, we obtained 200 videos for data extraction and analysis: 100 from TikTok and 100 from Bilibili. The general characteristics of the videos are presented in Table 1, which shows that compared to TikTok videos, Bilibili videos had longer durations and more days since publication, but fewer likes, comments, shares, and saves (P < .001).

Table 1.

Characteristics of the Videos in TikTok and Bilibili.

Variable	Bilibili (n = 100), median (IQR)	TikTok (n = 100), median (IQR)	Wilcoxon rank-sum test
Variable	Bilibili (n = 100), median (IQR)	TikTok (n = 100), median (IQR)	z score	P value
Likes	124.5 (36, 628.5)	5320.5 (1749, 13 837)	10.06	<.001
Comments	25.5 (2, 88)	362.5 (112, 1057)	8.26	<.001
Shares	61 (22, 203)	1253.5 (376, 4619)	8.85	<.001
Saves	132 (41.5, 399)	2488.5 (635, 8976)	9.07	<.001
Days since published	844 (449.5, 1353)	103 (35.5, 342.5)	7.94	<.001
Followers	3720 (325.5, 45 741)	149 000 (22 000, 443 000)	7.62	<.001
Total likes	8951 (1487, 133 710.5)	646 000 (128 500, 2 494 000)	7.86	<.001
Duration	257 (120.5, 575.5)	88 (49.5, 149)	7.78	<.001
GQS score	4 (3, 4)	4 (3, 4)	0.49	.587
Modified DISCERN score	3 (3, 4)	4 (3, 4)	3.09	<.001
JAMA score	2 (2, 3)	3 (3, 3)	6.24	<.001

Figure 2 and Tables 2 and 3 provide a detailed breakdown of the video characteristics on Bilibili and TikTok. Regarding the sources of the videos, science communicators uploaded the most videos on Bilibili (66/100, 66%), followed by hepatologists (22/100, 22%). On TikTok, hepatologists (57/100, 57%) and non-hepatologists (30/100, 30%) were the primary video uploaders. Compared to videos uploaded by science communicators, videos posted by hepatologists and non-hepatologists on TikTok receive fewer likes, comments, saves, and shares. On both Bilibili and TikTok, the most prevalent video content types were disease knowledge and treatment, comprising 65% and 55% of the videos, respectively. Conversely, videos related to detection, reports and news were the least common on both platforms. However, detection-related videos received a higher number of likes, comments, saves, and shares on both platforms. Additionally, 67% (67/100) of the videos on Bilibili were live videos, while 33% (33/100) were animations. On TikTok, 98% (98/100) were live videos, and 2% (2/100) were animations. Despite the popularity of live videos on both platforms, animations were more popular on TikTok.

Figure 2.

Percentage of videos according to video uploaders, video contents, and video formats on TikTok and BiliBili. (A) Overall types of video uploaders. (B) Types of video uploaders on TikTok and Bilibili respectively. (C) Overall categorization of video content. (D) Categorization of video content on TikTok and Bilibili. (E) Overall distribution of video formats. (F) Distribution of video formats on TikTok and Bilibili.

Table 2.

Characteristics of the Videos Across Sources and Content in Bilibili.

Variable	Likes	Comments	Shares	Saves	Days since published	Duration
Video uploaders (n = 100), median (IQR)
Hepatologists (n = 22)	90 (28, 186)	40.5 (2, 79)	41.5 (9, 153)	81.5 (29, 174)	804.5 (431, 1111)	139 (103, 272)
Non-hepatologists (n = 10)	306 (198, 920)	25 (18, 55)	131 (73, 190)	353 (189, 920)	971 (648, 1578)	227 (94, 563)
Patients (n = 2)	68 (64, 72)	52 (32, 72)	10 (7, 13)	45.5 (31, 60)	352.5 (133, 572)	888.5 (208, 1569)
Science communicators (n = 66)	128 (27, 733)	19 (2,103)	61 (23, 280)	141.5 (42, 452)	878 (405, 1408)	275 (153, 761)
Video contents (n = 100), median (IQR)
Detection (n = 8)	774 (120.5, 1293)	41.5 (13, 97)	180.5 (94, 331.5)	720.5 (146, 1794.5)	839 (587, 1251.5)	231.5 (163, 705)
Disease knowledge (n = 65)	123 (27, 542)	23 (2, 83)	62 (22, 216)	136 (41, 353)	915 (464, 1475)	275 (142, 739)
Reports and news (n = 9)	121 (64, 804)	72 (32, 383)	56 (13, 249)	60 (42, 310)	1098 (926, 1193)	208 (112, 253)
Treatment (n = 18)	61.5 (35, 404)	23 (2, 45)	41.5 (14, 96)	99.5 (38, 273)	507.5 (112, 824)	138.5 (88, 305)
Video formats (n = 100), median (IQR)
Animation (n = 33)	143 (39, 629)	23 (2, 99)	62 (21, 370)	134 (37, 407)	907 (355, 1486)	261 (125, 561)
Live video (n = 67)	130 (42, 341)	26 (2, 83)	59 (22, 180)	130 (42, 341)	824 (499, 1272)	253 (114, 661)

Table 3.

Characteristics of the Videos Across Sources and Content in TikTok.

Variable	Likes	Comments	Shares	Saves	Days since published	Duration
Video uploaders (n = 100), median (IQR)
Hepatologists (n = 57)	4173 (1138, 12 134)	349 (95, 799)	1196 (339, 3733)	2472 (621, 8856)	87 (35, 356)	101 (49, 134)
Non-hepatologists (n = 30)	5471.5 (2686, 17 942)	286.5 (167, 1103)	1254.5 (567, 6111)	2369.5 (451, 13 236)	123.5 (39, 245)	67 (46, 157)
Patients (n = 5)	3863 (2552, 9004)	1053 (1015, 1209)	983 (708, 4340)	2647 (926, 3968)	87 (85, 100)	153 (105, 179)
Science communicators (n = 8)	11 569 (6059, 25 012.5)	894.5 (457.5, 5405)	3423 (1828, 8176.5)	4633.5 (869.5, 8910)	613 (134.5, 1226.5)	76.5 (63, 149)
Video contents (n = 100), median (IQR)
Detection (n = 2)	24 002 (418, 47 586)	1303.5 (68, 2539)	8972.5 (90, 17 855)	20 839.5 (211, 41 468)	391 (32, 750)	77.5 (32, 123)
Disease knowledge (n = 36)	5183 (1749, 15 701)	413 (216.5, 1131)	1224 (376, 5125)	2010 (626.5, 6498)	105.5 (44, 300.5)	103 (57.5, 177)
Reports and news (n = 7)	3380 (2214, 7216)	299 (220, 509)	614 (247, 1819)	1527 (514, 5045)	175 (25, 509)	54 (33, 89)
Treatment (n = 55)	5706 (1146, 13 397)	294 (95, 1103)	1577 (425, 4151)	3191 (662, 11 627)	96 (33, 356)	76 (49, 132)
Video formats (n = 100), median (IQR)
Animation (n = 2)	8953 (3629, 14 277)	654 (286, 1022)	2598.5 (1028, 4151)	869.5 (730, 1009)	1226.5 (744, 1709)	149 (104, 194)
Live video (n = 98)	5320.5 (1519, 13 397)	362.5 (104, 1061)	1253.5 (367, 4898)	2576 (621, 9042)	98 (35, 300)	85.5 (49, 146)

Video Quality and Reliability Assessments

The video quality was assessed using the GQS, while reliability was evaluated through the mDISCERN and JAMA scores (Table 4). There was a high degree of concordance, with a κ value of 0.81.

Table 4.

The GQS, DISCERN, and JAMA Scores Were Evaluated Based on Different Video Sources, Contents, and Formats Related to Hepatitis B.

Variable	GQS (median (IQR))	Modified DISCERN (median (IQR))	JAMA (median (IQR))
Video uploaders
Hepatologists	4 (3, 4)	4 (4, 4)	3 (3, 3)
Non-hepatologists	3 (3, 4)	3 (3, 4)	3 (3, 3)
Patients	3 (2, 3)	2 (2, 3)	2 (1, 2)
Science communicators	4 (3, 4)	3 (3, 4)	2 (2, 3)
Video contents
Detection	4 (3, 4)	3.5 (3, 4)	3 (2, 3)
Disease knowledge	4 (3, 4)	3 (3, 4)	3 (2, 3)
Reports and news	3.5 (3, 4)	3 (3, 4)	3 (2.5, 3)
Treatment	4 (3, 4)	4 (3, 4)	3 (3, 3)
Video formats
Animation	4 (3, 4)	3 (3, 4)	2 (2, 3)
Live video	4 (3, 4)	4 (3, , 4)	3 (3, 3)

Comparison of Platforms

The assessment of video quality and reliability revealed evident disparities between the 2 platforms. TikTok videos achieved median (IQR) scores of 4 (3-4) for GQS, 4 (3-4) for mDISCERN, and 3 (3-3) for JAMA. Bilibili content showed comparable yet slightly lower median scores: 4 (3-4) for GQS, 3 (3-4) for mDISCERN, and 2 (2-3) for JAMA. Comparative analysis indicated that TikTok outperformed Bilibili in terms of reliability, with statistically significant superiority in the JAMA and mDISCERN metrics (P < .001). However, no significant difference was observed in the GQS metric (P = .587), suggesting comparable content quality between the 2 platforms (Figure 3).

Figure 3.

The mDISCERN, GQS, and JAMA score of videos related to Hepatitis B on TikTok and BiliBili. (A) Comparison of mDISCERN between TikTok and BiliBili videos. (B) Comparison of GQS score between TikTok and BiliBili videos. (C) Comparison of JAMA score between TikTok and BiliBili videos. (D) Ridge plot showing the overall distribution of mDISCERN score. (E) Ridge plot showing the overall distribution of GQS. (F) Ridge plot showing the overall distribution of JAMA score. NS indicates not significant (P ≥ .05). **P < .01, ***P < .001.

Comparison of Uploaders

Hepatologists consistently exhibited superior video quality in all 3 assessment criteria. Non-hepatologists and science communicators typically produced higher-quality videos than patients, yet their performance still lagged behind that of gastroenterologists. Notably, patient-generated videos consistently ranked lowest across all scoring systems (Figure 4).

Figure 4.

The mDISCERN, GQS, and JAMA score of videos related to Hepatitis B from different video uploaders. (A) mDISCERN score. (B) GQS score. (C) JAMA score. **P < .01, ***P < .001.

Comparison of Content

In terms of content quality, the 4 types of video content exhibit no significant differences in the GQS scoring system, all demonstrating moderate to good quality. However, there are some differences in reliability: videos related to Treatment show higher reliability in both the JAMA and mDISCERN scoring systems (Figure 5).

Figure 5.

The mDISCERN, GQS, and JAMA score of videos related to Hepatitis B from different video content. (A) mDISCERN score. (B) GQS score. (C) JAMA score. *P < .05, **P < .01, ***P < .001.

Comparison of Format

Within the GQS scoring framework, the content quality assessments of animated and live videos are remarkably similar, failing to exhibit any statistically significant differences. Conversely, in the context of reliability evaluations conducted via the JAMA and mDISCERN scoring systems, live videos consistently achieve markedly superior scores when juxtaposed against their animated counterparts (Figure 6).

Figure 6.

The mDISCERN, GQS, and JAMA score of videos related to Hepatitis B from different video format. (A) mDISCERN score. (B) GQS score. (C) JAMA score. NS indicates not significant (P ≥ .05). **P < .01, ***P < .001.

Spearman Correlation Analysis

Given the non-normal distribution of the data, Spearman’s rank correlation analysis was employed to examine the relationships between video engagement metrics (likes, comments, shares, and saves), video duration, time since publication, and 3 quality assessment scales (see Figure 7 and Tables 5 and 6). The analysis revealed significant positive correlations among all engagement metrics (P < .05), with correlation coefficients (r) ranging from .85 to .95, indicating a strong association among these variables. Video duration demonstrated significant negative correlations with likes (r = −.46, P < .001), comments (r = −.36, P < .001), shares (r = −.47, P < .001), and saves (r = −.40, P < .001), suggesting that longer videos tend to have lower engagement metrics. In contrast, a significant positive correlation was observed between video duration and days since publication (r = .26, P < .001), implying that longer videos are often published earlier.

Figure 7.

Spearman correlation analysis among different video variables, mDISCERN, GQS, and JAMA score concerning Hepatitis B videos. *P < .05, **P < .01, ***P < .001.

Table 5.

Spearman Correlation Analysis Between the Video Variables.

Variable	Likes	Comments	Shares	Saves	Days since published	Duration
Likes
r	1	-	-	-	-	-
P value	-	-	-	-	-	-
Comments
r	0.91 (0.88, 0.93)	1	-	-	-	-
P value	<.001^a	-	-	-	-	-
Shares
r	0.95 (0.93, 0.96)	0.88 (0.84, 0.91)	1	-	-	-
P value	<.001^a	<.001^a	-	-	-	-
Saves
r	0.95 (0.92, 0.96)	0.85 (0.80, 0.88)	0.95 (0.92, 0.96)	1	-	-
P value	<.001^a	<.001^a	<.001^a	-	-	-
Days since published					-	-
r	−0.18 (−0.31, −0.05)	−0.04 (−0.16, 0.10)	−0.06 (−0.19, 0.08)	−0.17 (−0.30, −0.05)	1	-
P value	.01^a	.59	.43	.01	-	-
Duration
r	−0.46 (−0.56, −0.33)	−0.36 (−0.48, −0.22)	−0.47 (−0.57, −0.35)	−0.40 (−0.51, 0.27)	0.26 (0.14, 0.37)	1
P value	<.001^a	<.001^a	<.001^a	<.001^a	<.001	-

Significant at P < .05.

Table 6.

Spearman Correlation Analysis Between Video Variables and the GQS, Modified DISCERN, and JAMA Scores.

Variable	GQS	Modified DISCERN	JAMA
Likes
r	0.17 (0.03, 0.30)	0.13 (−0.02, 0.26)	0.31 (0.18, 0.43)
P value	.02^a	.07	<.001^a
Comments
r	0.16 (0.03, 0.30)	0.12 (−0.02, 0.26)	0.24 (0.11, 0.37)
P value	.02^a	.08	<.001^a
Shares
r	0.19 (0.05, 0.32)	0.09 (−0.05, 0.24)	0.25 (0.12, 0.38)
P value	<.001^a	.18	<.001^a
Saves
r	0.16 (0.01, 0.28)	0.15 (0.00, 0.28)	0.25 (0.12, 0.36)
P value	.02^a	.04	<.001^a
Days since published
r	0.15 (0.01, 0.29)	−0.13 (−0.27, 0.01)	−0.34 (−0.46, −0.21)
P value	.03	.06	<.001^a
Duration
r	−0.10 (−0.24, 0.05)	−0.08 (−0.22, 0.06)	−0.46 (−0.57, −0.35)
P value	.16	.28	<.001^a

Significant at P < .05.

In terms of quality assessment, GQS scores and JAMA scores showed weak but significant positive correlations with engagement metrics such as likes, comments, shares, and saves, whereas mDISCERN scores did not exhibit correlations with these metrics. Notably, JAMA scores exhibited significant negative correlations with both video duration (r = −.46, P < .001) and days since publication (r = −.34, P < .001), indicating that high-quality videos, as assessed by the JAMA criteria, tend to be shorter in duration and have a shorter time since publication.

Discussion

This cross-sectional study yielded 3 principal findings regarding Hepatitis B-related information on short-video platforms. Firstly, while the overall content quality was similar between TikTok and Bilibili, TikTok videos demonstrated significantly higher reliability as measured by both the JAMA and mDISCERN instruments. Secondly, the source of the video was a critical determinant of quality; content created by hepatologists was consistently superior in both quality and reliability compared to that from non-specialists, patients, or science communicators. Thirdly, a notable dissociation was observed between quality and popularity, as videos with higher reliability scores did not consistently achieve greater user engagement. These core findings highlight both the potential and the pitfalls of using short-video platforms for public health communication regarding Hepatitis B.

Hepatitis B is a significant global health concern, affecting millions of individuals worldwide and posing a substantial burden on public health systems.²⁵ Effective health communication is critical for disease prevention, early detection, and treatment adherence. In recent years, social media platforms have emerged as powerful tools for health communication, reaching vast audiences with diverse information.^26,27 For instance, Bilibili has evolved into a comprehensive knowledge-sharing platform with strong educational communities, while TikTok dominates mobile health content consumption in China. These 2 platforms represent the dominant short-video ecosystems in mainland China, where other international platforms like YouTube, Facebook and Instagram have limited accessibility. Both platforms specialize in the short-form video format that characterizes contemporary health communication trends, allowing for methodologically consistent comparisons. This focused selection ensures our findings are directly applicable to the primary health information sources for Hepatitis B patients in Chinese-speaking populations. Nevertheless, the absence of rigorous medical content vetting processes on these platforms contributes to substantial variability in information quality.²⁸ Recognizing both the persistent health education needs of Hepatitis B patients and the expanding influence of short-form video content in medical communication, this study conducted a systematic assessment of Hepatitis B-Related video quality and reliability across these platforms, aiming to facilitate patient identification of trustworthy health information.

To our knowledge, the quality of online Hepatitis B information, particularly on short-video platforms, remains underexplored. As the pioneering study addressing this research gap, our findings reveal that while the videos on both platforms were similar in terms of content quality, there were significant differences in reliability. Videos on TikTok scored significantly higher in reliability according to both the JAMA and mDISCERN scoring systems compared to those on Bilibili. This discrepancy primarily stems from systematic differences between the 2 platforms in content format, dissemination mechanisms, and most critically, moderation policies and incentive structures. TikTok’s short-video format facilitates rapid production and distribution, while Bilibili’s longer videos typically involve extended production cycles, resulting in slower updates and weaker interactivity. TikTok creators prioritize conciseness and accessibility to meet users’ need for quick information acquisition. In contrast, although Bilibili hosts substantial high-quality content, its overall quality varies significantly, with some videos being overly specialized or lengthy, making it difficult for users to extract key information efficiently. Notably, TikTok employs a stricter, proactive moderation system for health information, explicitly prohibiting “misleading medical claims” through automated keyword filtering and dedicated human review. While Bilibili also conducts moderation, it remains less centralized regarding health misinformation, relying more on post-publication user reports and community feedback, which may lead to delayed or inconsistent handling of unreliable content. Furthermore, platform incentive structures differ substantially. TikTok’s algorithm prioritizes engagement and completion rates, rewarding creators who present clear, credible, and easily digestible information—often by citing sources and displaying credentials. Conversely, Bilibili’s ecosystem encourages depth, discussion, and community interaction, which may accommodate or even reward speculative or opinion-based content within its longer-form, discussion-oriented culture. Collectively, these factors result in significant differences in content style and quality between the platforms, ultimately influencing the reliability of health information.

Notably, videos created by healthcare professionals, particularly hepatologists, showed significantly higher educational value, quality, and reliability than those from other sources. This advantage likely stems from their specialized expertise, comprehensive understanding of clinical guidelines, and updated research knowledge. In contrast, non-professional creators (eg, patients and science communicators) predominantly relied on personal experiences and subjective opinions, potentially introducing informational biases. These findings underscore the substantial impact of professional background on content quality. However, the data reveal a key paradox: high-quality content did not translate into high user engagement. On the Bilibili platform, the median number of likes for videos by hepatologists was only 90 (IQR: 28-186), significantly lower than that for non-hepatologists, which stood at 306 (IQR: 198-920); on TikTok, although the median number of likes for videos by hepatologists reached 4173 (IQR: 1138-12 134), it was still lower than that for science communicators, which was 11 569 (IQR: 6059-25 012.5). These findings are consistent with the results of Chen et al., in that videos posted by thyroid experts in studies related to thyroid nodules, despite their superior content quality, received relatively less attention in terms of likes, comments, saves, and shares.²⁹ In addition, similar phenomena have been observed in several other comparable studies, indicating that higher-quality videos do not necessarily attract more attention.^30,31

This engagement-reliability paradox appears to stem from multiple factors. The rigorous terminology and complex professional language in expert-created videos likely present comprehension barriers for lay audiences, potentially limiting their appeal. Additionally, platform algorithms that prioritize highly interactive content over quality may further exacerbate this gap.³² Furthermore, short videos entail a tension between educational rigor and entertainment appeal. Professional content often prioritizes factual accuracy over engaging elements such as humor, trending audio, and dynamic editing, which are algorithmically promoted and drive user engagement. Beyond the dimensions of quality and reliability assessed in this study, the health literacy demands of video content warrant careful consideration. While hepatologist-created videos scored higher on reliability metrics, their actual comprehensibility for the general public remains uncertain. The multimodal nature of short videos—integrating visual demonstrations, verbal explanations, and on-screen text—creates both opportunities and challenges for meeting diverse health literacy needs. When reliable health information is not presented in an understandable format, it fails to achieve its educational purpose regardless of scientific quality. This underscores the urgent need for healthcare professionals to develop skills in creating content that balances scientific accuracy with accessibility, while platforms should implement algorithms that value both reliability and comprehensibility in content distribution. Future research should incorporate systematic evaluation of both linguistic complexity and visual clarity using validated health literacy assessment tools to ensure that reliable information is truly understandable to its intended audience.

Content type also influenced video quality and reliability. Videos focusing on treatment exhibited higher reliability scores, possibly because treatment-related information is more likely to be based on established clinical guidelines and research findings. Conversely, videos related to detection, while less common, received higher engagement metrics, suggesting that the public is highly interested in early detection methods. This highlights the need for platforms to encourage the production of high-quality detection-related content to meet public demand.

This study employed Spearman correlation analysis to investigate the relationship between video quality and user engagement metrics. The results demonstrated significant positive correlations (P < .05) among likes, comments, favorites, and shares, indicating strong interrelationships between these engagement indicators, consistent with previous research findings.^29,33 This strong correlation indicates that positive user feedback is likely to be expressed through various forms of interaction. Video duration was found to have significant negative correlations with engagement metrics, implying that shorter videos are more effective in capturing user attention and fostering interaction. This finding highlights the importance of brevity in digital content to maintain user engagement. In terms of quality assessment, GQS and JAMA scores showed significant positive correlations with engagement metrics, indicating that high-quality, reliable content is more likely to be engaged with by users. However, mDISCERN scores did not exhibit significant correlations with engagement metrics, suggesting that while reliability is important, other factors such as content presentation and user experience also play crucial roles in driving engagement. Notably, JAMA scores exhibited significant negative correlations with video duration and days since publication, indicating that high-quality videos, as assessed by the JAMA criteria, tend to be shorter and more recent. This highlights the importance of timely, concise content in maintaining user interest and trust. Furthermore, our analysis revealed moderate intercorrelations among JAMA, mDISCERN, and GQS scores. This indicates that while these assessment tools emphasize different aspects, they share common ground in evaluating overall video quality and reliability. In conclusion, although video quality demonstrates some association with user engagement, this relationship is not strong and is influenced by multiple factors. Therefore, relying solely on engagement metrics to assess video quality and reliability would be insufficient. Platform operators and content creators should place greater emphasis on scientific rigor and professional standards to enhance the quality and reliability of health information dissemination.

Our findings offer valuable insights for health policy development, platform content regulation, and public health communication strategies. These insights carry specific implications for Hepatitis B patients seeking information, healthcare providers who recommend resources, and public health campaigns. For patients, the variability in video reliability underscores the need to prioritize content from verified hepatologists while exercising caution with patient-generated and non-professional sources. Healthcare providers should recognize the heterogeneous quality of short video content and consider actively recommending or creating reliable materials to supplement patient education. Public health campaigns could leverage the high engagement potential of these platforms by collaborating with authoritative creators and developing concise, evidence-based messaging tailored to short video formats.

To translate these implications into concrete actions, we propose a multi-level framework for systemic improvement. Platform operators should optimize their content recommendation algorithms by prioritizing source credibility (such as verified professional credentials) alongside engagement metrics, thereby enhancing the dissemination of reliable health information. Regulatory and professional bodies should establish and promote a clear, visible social media verification system for medical practitioners to increase the transparency and credibility of authoritative sources. Simultaneously, user education initiatives should be strengthened through in-platform literacy prompts, micro-learning content developed in collaboration with health organizations, and practical checklists to help viewers critically evaluate medical videos. These coordinated measures would improve the overall ecosystem of health information dissemination on short video platforms.

This study possesses several strengths: (1) This study presents a systematic evaluation of Hepatitis B-related video quality and reliability across major short-video platforms. Our concurrent analysis of both TikTok and Bilibili platforms effectively controls for platform-specific biases, significantly enhancing the representativeness and reliability of our findings. (2) We implemented a comprehensive assessment framework integrating GQS, mDISCERN, and JAMA scoring systems to provide a multidimensional evaluation of content quality and reliability. (3) our comparative analysis of different uploader categories revealed the substantial influence of professional background on video quality. These findings offer valuable insights for health policy development, platform content regulation, and public health communication strategies. We specifically recommend that platforms enhance their content review mechanisms and prioritize the dissemination of professionally-produced, high-quality health information to improve public health literacy. Additionally, our results highlight the importance of specialized training for healthcare professionals and their active engagement in digital health communication to elevate the overall quality of medical information on short-video platforms.

However, this study has several limitations that should be acknowledged. Firstly, as a single-time-point analysis, our findings reflect a snapshot of content shaped by platform algorithms and do not capture longitudinal trends or seasonal variations in health information dissemination. Secondly, the classification of video uploaders primarily relies on verification by short-video platforms. Although platforms may reference uploaders’ credentials such as medical practitioner licenses during verification, overall, potential misclassification bias may occur due to missing or incorrect verification information. Thirdly, the focus on TikTok and Bilibili, while methodologically deliberate for studying short-form video ecosystems, means our findings may not fully represent the health information landscape on other platforms, such as AI-powered innovative developments, YouTube, and online web channels. We cannot overlook the fact that YouTube remains a key source for longer medical content,³⁴ AI platforms provide synthesized information,³⁵ and official health websites offer higher editorial standards.⁷ Fourthly, the study lacks assessments of the following: assessment of accuracy or medical correctness of short video content; viewer perceptions of content quality, potential behavioral or knowledge outcomes from video exposure; potential selection bias in videos. Additionally, although we employed standardized assessment tools (GQS, mDISCERN, and JAMA scales) and maintained high inter-rater reliability, potential biases inherent in subjective scoring should be considered. Notwithstanding these limitations, this research establishes an important baseline understanding of Hepatitis B information quality on short-video platforms and demonstrates the value of systematic content evaluation in this domain. Future research should expand the sample size and incorporate multi-language, multi-regional short-video platforms for comparative analysis to enhance the external validity of the findings. To advance this field, studies could also examine content evolution through longitudinal designs, assess user comprehension and retention, and extend the evaluation framework to other health conditions.

Conclusion

In our study, we collected and evaluated 200 Hepatitis B-related videos from TikTok and Bilibili. The overall content quality was similar between the 2 platforms, but TikTok videos showed significantly higher reliability based on JAMA and mDISCERN scores. Videos uploaded by hepatologists and those focusing on treatment were found to be of higher quality and reliability. This underscores the importance of professional expertise in health content creation. We suggest that short video platforms should enhance their review mechanisms to improve the reliability of health information. Additionally, users should be cautious and selective, prioritizing content from verified medical professionals to ensure they access accurate and reliable health information.

Supplemental Material

sj-docx-1-inq-10.1177_00469580261441434 – Supplemental material for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study

Supplemental material, sj-docx-1-inq-10.1177_00469580261441434 for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study by Bowen Cheng, Wenjie Shi, Shu Wang, Huanbing Liu, Jing Xiong and Xiaoyan Chen in INQUIRY: The Journal of Health Care Organization, Provision, and Financing

Supplemental Material

sj-docx-2-inq-10.1177_00469580261441434 – Supplemental material for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study

Supplemental material, sj-docx-2-inq-10.1177_00469580261441434 for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study by Bowen Cheng, Wenjie Shi, Shu Wang, Huanbing Liu, Jing Xiong and Xiaoyan Chen in INQUIRY: The Journal of Health Care Organization, Provision, and Financing

Supplemental Material

sj-pdf-1-inq-10.1177_00469580261441434 – Supplemental material for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study

Supplemental material, sj-pdf-1-inq-10.1177_00469580261441434 for Evaluating the Quality and Reliability of Hepatitis B-Related Short Videos on BiliBili and TikTok: A Cross-Sectional Study by Bowen Cheng, Wenjie Shi, Shu Wang, Huanbing Liu, Jing Xiong and Xiaoyan Chen in INQUIRY: The Journal of Health Care Organization, Provision, and Financing

Footnotes

Acknowledgements

The authors would like to express their gratitude to the participants who participated in the study.

Abbreviation

HBV: Hepatitis B; GQS: global quality score.

ORCID iD

Xiaoyan Chen

Ethical Considerations

Consent to Participate

Due to the use of publicly available data with no personal identifiers, informed consent was not applicable.

Author Contributions

Chen XY contributed to the study conception design; Cheng BW, Chen XY, and Xiong J were responsible for the review and scoring of the videos; Wang S and Liu HB conducted the data analysis; Cheng BW and Shi WJ prepared the manuscript; Chen XY and Xiong J interpreted the data and revised the manuscript; All authors contributed to the article and approved the final manuscript.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by The First Affiliated Hospital of Nanchang University Young Talents Research and Cultivation Fund (No. YFYPY202433), The Jiangxi Provincial Health Commission Science and Technology Plan Project (No. 202510223), and The Central Government-guided Local Science and Technology Development Fund (No. 20221ZDG020070).

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data Availability Statement

The data utilized and/or examined in this study can be accessed from the corresponding author upon request, provided that the request is justified and reasonable.*

Supplemental Material

Supplemental material for this article is available online.

Patient and Public Involvement

Patients or the public were not involved in any aspect of our research, including its design, conduct, reporting, or dissemination plans.

References

GBD 2019 Hepatitis B Collaborators. Global, regional, and national burden of hepatitis B, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Gastroenterol Hepatol. 2022;7:796-829. doi:10.1016/S2468-1253(22)00124-8

Veracruz

Gish

Cheung

Chitnis

Wong

RJ.

Global incidence and mortality of hepatitis B and hepatitis C acute infections, cirrhosis and hepatocellular carcinoma from 2010 to 2019. J Viral Hepat. 2022;29:352-365. doi:10.1111/jvh.13663

Wang

Yan

Wang

Jin

Zheng

ZJ.

Global burden of hepatitis B attributable to modifiable risk factors from 1990 to 2019: a growing contribution and its association with socioeconomic status. Global Health. 2023;19:23. doi:10.1186/s12992-023-00922-z

Liang

, et al Epidemic Trends and spatial distribution characteristics of Hepatitis B in China: Surveillance Study. JMIR Public Health Surveill. 2025;11:e70888. doi:10.2196/70888

Yan

Chen

, et al Economic burden of hepatitis B patients and its influencing factors in China: a systematic review. Health Econ Rev. 2024;14:99. doi:10.1186/s13561-024-00584-6

Hsu

Huang

Nguyen

MH.

Global burden of hepatitis B virus: current status, missed opportunities and a call for action. Nat Rev Gastroenterol Hepatol. 2023;20:524-537. doi:10.1038/s41575-023-00760-9

Gunduz

Matis

Ozduran

Hanci

Evaluating the readability, quality, and reliability of online patient education materials on spinal cord stimulation. Turk Neurosurg. 2024;34:588-599. doi:10.5137/1019-5149.JTN.42973-22.3

van Deen

Simpson

Dupuy

Khalil

Bonthala

Spiegel

BMR.

Social Media for the dissemination of educational videos about inflammatory bowel diseases. Am J Gastroenterol. 2022;117(8):1320-1323. doi:10.14309/ajg.0000000000001825

Özduran

Hanci

YouTube as a source of information about stroke rehabilitation during the COVID-19 pandemic. Neurology Asia. 2023;28(4):907-915. doi:10.54029/2023kif

10.

Zenone

Barbic

TikTok and public health: a proposed research agenda. BMJ Glob Health. 2021;6:10. doi:10.1136/bmjgh-2021-007648

11.

Wang

Yao

Wang

Chen

Ouyang

Xie

Bilibili, TikTok, and YouTube as sources of information on gastric cancer: assessment and analysis of the content and quality. BMC Public Health. 2024;24:57. doi:10.1186/s12889-023-17323-x

12.

Feng

Malloch

Kravitz

, et al Assessing the effectiveness of a narrative-based patient education video for promoting opioid tapering. Patient Educ Couns. 2021;104:329-336. doi:10.1016/j.pec.2020.08.019

13.

Ozduran

Hanci

Erkin

Evaluating the readability, quality and reliability of online patient education materials on chronic low back pain. Natl Med J India. 2024;37:124-130. doi:10.25259/NMJI_327_2022

14.

Zheng

Tong

Wan

Quality and reliability of liver cancer-related short Chinese videos on TikTok and Bilibili: cross-sectional content analysis study. J Med Internet Res. 2023;25:e47210. doi:10.2196/47210

15.

Yang

Liu

, et al Quality and reliability of pediatric pneumonia related short videos on mainstream platforms: cross-sectional study. BMC Public Health. 2025;25:1896. doi:10.1186/s12889-025-22963-2

16.

Liu

Chen

Lin

, et al YouTube/ Bilibili/ TikTok videos as sources of medical information on laryngeal carcinoma: cross-sectional content analysis study. BMC Public Health. 2024;24:1594. doi:10.1186/s12889-024-19077-6

17.

Guan

Xia

Zhao

, et al Videos in short-video sharing platforms as sources of information on colorectal polyps: cross-sectional content analysis study. J Med Internet Res. 2024;26:e51655. doi:10.2196/51655

18.

Luo

Shu

, et al Quality and educational content of Douyin and TikTok short videos on early screening of rectal cancer. JGH Open. 2023;7:936-941. doi:10.1002/jgh3.13005

19.

Ding

Kong

Sun

, et al Health information in short videos about metabolic dysfunction-associated steatotic liver disease: analysing quality and reliability. Liver Int. 2024;44:1373-1382. doi:10.1111/liv.15871

20.

Peng

, et al Comparative analysis of NAFLD-related health videos on TikTok: a cross-language study in the USA and China. BMC Public Health. 2024;24:3375. doi:10.1186/s12889-024-20851-9

21.

Sun

Zheng

Quality of information in Gallstone disease videos on TikTok: cross-sectional study. J Med Internet Res. 2023;25:e39162. doi:10.2196/39162

22.

Singh

PP.

YouTube for information on rheumatoid arthritis–a wakeup call?

J Rheumatol. 2012;39:899-903. doi:10.3899/jrheum.111114

23.

Silberg

Lundberg

Musacchio

RA.

Assessing, controlling, and assuring the quality of medical information on the Internet: Caveant lector et viewor–let the reader and viewer beware. JAMA. 1997;277:1244-1245.

24.

Bernard

Langille

Hughes

Rose

Leddin

Veldhuyzen van Zanten

A systematic review of patient inflammatory bowel disease information resources on the World Wide Web. Am J Gastroenterol. 2007;102(9):2070-2077. doi:10.1111/j.1572-0241.2007.01325.x

25.

Alvis-Guzman

Alvis-Zakzuk

De la Hoz Restrepo

How possible is the elimination of viral hepatitis? An analysis based on the global burden of disease from Hepatitis B and C, 1990–2019. Microorganisms. 2024;12:25. doi:10.3390/microorganisms12020388

26.

Lei

Liao

Zhu

Quality and reliability evaluation of pancreatic cancer-related video content on social short video platforms: a cross-sectional study. BMC Public Health. 2025;25:1919. doi:10.1186/s12889-025-23130-3.

27.

Xie

Chen

Zhou

Jin

Quality and accuracy of cardiopulmonary resuscitation teaching in short videos: an analysis across three major short video platforms. BMC Med Educ. 2025;25:631. doi:10.1186/s12909-025-06776-w

28.

Zhang

Jie

, et al Analyzing dissemination, quality, and reliability of Chinese brain tumor-related short videos on TikTok and Bilibili: a cross-sectional study. Front Neurol. 2024;15:1404038. doi:10.3389/fneur.2024.1404038

29.

Chen

Wang

Huang

, et al The quality and reliability of short videos about thyroid nodules on BiliBili and TikTok: cross-sectional study. Digit Health. 2024;10:20552076241288831. doi:10.1177/20552076241288831

30.

Zeng

Zhang

Wang

Zhang

Zhu

Douyin and Bilibili as sources of information on lung cancer in China through assessment and analysis of the content and quality. Sci Rep. 2024;14:20604. doi:10.1038/s41598-024-70640-y

31.

Donzelli

Palomba

Federigi

, et al Misinformation on vaccination: A quantitative analysis of YouTube videos. Hum Vaccin Immunother. 2018;14:1654-1659. doi:10.1080/21645515.2018.1454572

32.

Gao

Liu

Gao

Echo chamber effects on short video platforms. Sci Rep. 2023;13:6282. doi:10.1038/s41598-023-33370-1

33.

Wang

Liang

Qiu

Wang

Evaluating the content and quality of videos related to hypertrophic scarring on TikTok in China: cross-sectional study. JMIR Infodemiology. 2025;5:e64792. doi:10.2196/64792

34.

Hancı

Özduran

ÖÖ

Gökel

E, E

, YouTube as a source of information about Percutan tracheostomy. Gazi Med J. 2023;34(4):372-378. doi:10.12996/gmj.2023.77

35.

Ozduran

Akkoc

Büyükçoban

Erkin

Hanci

Readability, reliability and quality of responses generated by ChatGPT, Gemini, and perplexity for the most frequently asked questions about pain. Medicine. 2025;104:e41780. doi:10.1097/MD.0000000000041780