Sage Journals: Discover world-class research

Abstract

Objective

The growing rehabilitation demands driven by global aging and the increasing prevalence of chronic diseases underscore the limitations of conventional therapies, highlighting robot-assisted training as a promising alternative. However, public awareness remains limited, and the quality of health information on short-video platforms is highly variable. This study aimed to evaluate the quality and reliability of Chinese-language videos pertaining to robot-assisted rehabilitation on Douyin and BiliBili.

Methods

A cross-sectional analysis was conducted on 5 February 2025, involving the selection of the top 100 videos related to “robotic rehabilitation” from each of two platforms: Douyin and BiliBili. The Global Quality Score (GQS) and modified DISCERN tool (mDISCERN) were employed to assess the quality and reliability of the video content. Videos were categorized by their source and content type. Meanwhile, their interaction metrics (including likes, comments, shares, and favorites) as well as basic characteristics (such as upload time and duration) were extracted. Cohen'sκtest was used to evaluate inter-rater reliability. Spearman correlation analysis was performed to explore the relationships between variables. Additionally, multiple linear regression analysis was conducted to identify factors influencing the quality and reliability of the videos.

Results

A total of 200 videos were included in the study (100 from each platform). Videos on Douyin were shorter in duration but garnered significantly higher user engagement (all p < 0.001 for likes, comments, shares, and favorites), whereas videos on BiliBili were longer and featured more academic resource citations. Median scores for video quality and reliability on both platforms were moderate (GQS = 2, mDISCERN = 2), yet BiliBili had a higher proportion of high-quality videos. Content created by experts exhibited greater informational value, while commercially promoted videos demonstrated lower credibility. Multiple regression analysis revealed that sources from academic and medical institutions, content types focused on science communication/education and expert explanations, video duration, and daily share rate were significant positive predictors of both GQS and mDISCERN scores.

Conclusion

Short-video platforms serve as valuable channels for disseminating information on robot-assisted rehabilitation; however, significant variability in content quality necessitates critical evaluation by users. Individuals should verify the scientific validity of such information before applying it in health-related decision making.

Keywords

Robotics rehabilitation short videos information quality

Introduction

Robot-assisted training (RAT) is a cutting-edge, exercise-based technology that utilizes computer-assisted devices to provide patients with targeted functional exercise patterns. These include repetitive training, intensive practice, and task-specific training formats.¹ RAT has been demonstrated to promote the remodeling of the musculoskeletal system, spinal motor neurons, and intraneuronal systems, enhancing neuroplasticity through goal-directed training programs. Consequently, it facilitates motor relearning, increases muscle strength, and improves movement disorders.^2,3 Additionally, RAT's dynamic human–machine interaction capabilities offer real-time training feedback, showcasing significant advantages in enhancing patient compliance and motivation compared to traditional rehabilitation methods.⁴ With the global aging population, alongside an increase in chronic disease sufferers and disabled individuals, the demand for rehabilitation services is on the rise.^5–8 Traditional rehabilitation approaches suffer from inefficiencies and lack personalization, failing to meet the diverse needs of patients.⁹ Robot-assisted rehabilitation offers personalized and precise rehabilitation solutions, significantly improving recovery efficiency.^10–12 In the realm of cardiopulmonary rehabilitation, robotic technologies have shown substantial benefits. They can design tailored exercise programs based on individual patient conditions and monitor vital physiological data in real-time during training sessions. Guiding patients through aerobic exercises and respiratory training greatly enhances cardiopulmonary function while alleviating fatigue during rehabilitation.¹³ A systematic review and network meta-analysis revealed that RAT combined with conventional rehabilitation therapies effectively improves upper limb motor functions and activities of daily living for stroke survivors.⁹ Moreover, some rehabilitation robots integrate virtual reality technology, employing gamified training modes to significantly boost patient engagement and motivation.¹⁴

Despite the promising potential of robot-assisted rehabilitation in enhancing recovery efficacy and patient functionality, its clinical application faces several challenges. Public perception and acceptance of robot-assisted healthcare remain relatively low due to varying viewpoints.¹⁵ Currently, public understanding of this technology largely relies on media reports and online information.¹⁶ The advent of internet technology has revolutionized how people access information, shifting from paper-based to electronic formats, which are now mainstream.¹⁷ Engaging, interesting, and visually appealing video content is particularly popular compared to lengthy traditional texts, contributing to the rapid rise of short video sharing platforms. This trend has led to a significant increase in the availability of health-related short videos, facilitating more efficient and widespread dissemination of health knowledge.^18,19 For instance, a foreign study found that social media-based interventions, such as a liver cancer prevention program via Kakao Talk, could promote hepatitis B virus monitoring and liver cancer prevention.²⁰ However, the unregulated upload of videos without scientific scrutiny often results in variable quality and reliability on these platforms. Misleading or deceptive information can thus be disseminated, increasing the risk of users making health decisions based on inaccurate information.²¹ Douyin and BiliBili, as two major short video platforms in China with extensive user bases and influence,^8,22 were evaluated in our study. We assessed the top 100 RAT-related videos on each platform using the Global Quality Score (GQS) for video quality and a modified DISCERN (mDISCERN) score for reliability. Our analysis also explored the correlation between video quality and sources, content, duration, likes, comments, shares, and saves. This study aims to evaluate the content, quality, and reliability of robot-assisted training (RAT)-related videos on Douyin and BiliBili. To our knowledge, this study is among the first focused analyses of RAT-related video content on short video platforms.^23–25 As pioneering research in this field, it addresses a critical gap in understanding how rehabilitation robotics is portrayed in digital media. This work highlights the need for accurate, high-quality health information in robot-assisted rehabilitation, setting a benchmark for future studies and emphasizing its growing role in patient care and recovery.

Methods

Ethical considerations

This cross-sectional study was completed on 5 February 2025. All data were extracted from publicly available videos on Douyin and BiliBili; no clinical records, human or animal specimens were used, and no personally identifiable information was collected. As there was no interaction with users, ethical review was waived in accordance with the exemption provisions of the Declaration of Helsinki.

Search strategy and data collection

In this cross-sectional study, we searched for videos related to “机器人” (Robotics) and “康复” (Rehabilitation) on Douyin and BiliBili on 5 February 2025, aiming to identify the top 100 robot-assisted rehabilitation videos on each platform (Figure 1 in the attachment).

Figure 1.

Data collection flow chart.

To minimize personalized recommendation algorithm bias, new user accounts were registered and logged into each platform, with no search filters applied. Videos were ranked via the platforms’ “comprehensive ranking mechanism,” which integrates video completion rate (proportion of users who watched >5 s), like rate, comment rate, follow rate, and upload time—prioritizing recent and popular content.

During screening, non-Chinese videos, duplicates (identical content from different uploaders), videos without uploader info or titles, and those unrelated to robot-assisted rehabilitation were excluded. The top 100 videos per platform were finally selected for analysis; this sample size was based on previous studies confirming negligible impact of videos beyond the top 100 on results.^26,27

To reduce temporal bias from content updates, baseline information of all included videos was uniformly extracted on 5 February 2025, including source, theme, duration (seconds), days since upload, and engagement metrics (e.g., favorites, likes, comments, and shares). All data were recorded in Microsoft Excel (Microsoft Corp.) and collected in strict adherence to Douyin and BiliBili's API policies, with no content downloaded or personal information stored.

Video classification

We categorized the videos into five groups based on their sources and six groups according to their content. The classification criteria for video sources are as follows: (1) medical institutions and related personnel, (2) academic institutions and research teams, (3) individual users, (4) business and advertising, and (5) news reports. For content categorization, the videos were classified into the following categories: (1) introduction to rehabilitation technologies, (2) demonstration of rehabilitation training, (3) patient experience sharing, (4) expert explanations and recommendations, (5) product promotion and publicity, and (6) science communication and education. Detailed classification criteria are shown in Table 1 in the annex. This methodology ensures a systematic approach to analyzing the videos, providing clear distinctions based on both origin and subject matter. Such categorizations facilitate a nuanced understanding of the distribution and nature of robot-assisted rehabilitation content across different platforms, thereby enhancing the robustness of our analysis. The delineation of specific criteria for each category aims to uphold the standards of accuracy and reliability expected in scientific inquiry.

Table 1.

Video classification.

Classification
Video sources
1. Healthcare institutions and professionals	2. This category includes videos uploaded by officially certified accounts of hospitals and rehabilitation centers, as well as those created by healthcare professionals (e.g., physicians and therapists) who are explicitly identified through platform-verified credentials (e.g., “MD,” “PT,” and “OT”) or introduced with full affiliation details within the video
3. Academic institutions and research teams	4. Videos produced by universities, research institutes, or affiliated researchers were included if the account or video content clearly indicated institutional affiliation, academic titles, or research context
5. Individual users	6. Videos from accounts without institutional or professional, typically representing personal experiences, opinions, or nonprofessional demonstrations
7. Commercial and advertising entities 8. News reports	9. Content explicitly promoting commercial products or services, including videos from company accounts, sponsored creators, or those with purchase links/advertisements 10. Videos produced by official media organizations or news agencies reporting on robot-assisted rehabilitation
Video content
11. Introduction to rehabilitation technologies	12. The basic principles, application scenarios, and advantages of robot-assisted rehabilitation technology are introduced. Show the specific functions of rehabilitation robots, such as upper limb rehabilitation, lower limb rehabilitation, and cognitive rehabilitation
13. Demonstration of rehabilitation training	14. Show the specific operation steps and precautions of rehabilitation training; the patient's training with robotic rehabilitation equipment was recorded
15. Patient experience sharing	16. The patients’ feelings and effects after using robot-assisted rehabilitation equipment were recorded; show the patient's progress and results in the rehabilitation process
17. Expert explanations and recommendations 18. Product promotion and publicity 19. Science communication and education	20. The explanation and suggestions of rehabilitation experts on robot-assisted rehabilitation 21. Equipment manufacturers introduce rehabilitation robots; Promotion rehabilitation robot 22. Introduces relevant knowledge about rehabilitation robots to facilitate public understanding

Video quality assessment

The quality of information in the videos was assessed using the GQS, while reliability was evaluated through the mDISCENRN tool, both of which have been validated in previous studies.^28,29 GQS evaluates dimensions such as quality, flow, comprehensiveness, and usefulness. It consists of five criteria rated on a scale from 1 to 5, with higher scores indicating superior quality.³⁰ The mDISCERN tool, designed to assess the reliability and quality of videos, comprises five questions, each scored with 1 point for “yes” and 0 points for “no,” resulting in a total score ranging from 0 to 5. Detailed grading criteria for GQS and mDISCERN are presented in appendix Tables 2 and 3, respectively. Scores from GQS and mDISCERN were categorized into five levels as presented in appendix Table 4. Video links were provided to two raters in tabular format; these raters were rehabilitation physicians with expertise in therapeutic interventions. To minimize rating bias, the video links were presented in a randomized order. Prior to evaluating the videos, both raters thoroughly reviewed the scoring details of GQS and mDISCERN. They independently watched the videos simultaneously, scored them, and classified the videos based on their source and content. In cases where discrepancies arose between the two raters’ scores, a comprehensive discussion involving an additional observer was conducted to reach a consensus.

Table 2.

Global quality score (GQS) (1–5 points).

GQS score content	Score
Poor quality, poor flow of the site, most information missing, not at all useful for patients	1
Generally poor quality and poor flow, some information listed but many important topics missing, of very limited use to patients	2
Moderate quality, suboptimal flow, some important information is adequately discussed but others poorly discussed, somewhat useful for patients	3
Good quality and generally good flow, most of the relevant information is listed, but some topics not covered, useful for patients	4
Excellent quality and excellent flow, very useful for patients	5

Table 3.

mDISCERN (1 point for a yes answer and 0 points for a no answer).

mDISCERN score content
1. Are the aims clear and achieved?
2. Are reliable sources of information used?
3. Is the information presented balanced and unbiased?
4. Are additional sources of information listed for patient reference?
5. Are areas of uncertainty mentioned?

GQS: Global Quality Score; mDISCERN: modified DISCERN.

Table 4.

Describes the GQS and mDISCENRN tool scores.

Scale/score	Level
GQS
1	Very poor
2	Poor
3	Fair
4	Good
5	Excellent
mDISCERN
1	Unreliable
2	Less reliable
3	Fairly reliable
4	Relatively reliable
5	Reliable

GQS: Global Quality Score; mDISCERN: modified DISCERN.

Statistic analysis

Shapiro–Wilk test was used to test the normality of continuous variables. Continuous variables with a normal distribution are represented as SD ± mean (standard deviation), while continuous variables without a normal distribution are represented as median, minimum–maximum, and 25 to 75 percentiles. The non-normal distribution data were expressed as the median (interquartile, IQR), and the differences between groups were determined by Mann–Whitney U test, and the significance level was set at P < 0.05. We used Cohen κ to quantify the agreement between the 2 raters. We performed Spearman correlation analysis to evaluate the relationship between quantitative variables. The P value <0.05 was considered statistically significant. All statistical analyses were performed by spss26. Given disparities in video lifespan (“Days since published”), raw cumulative interactions (e.g., total likes) were biased. To control for temporal confounding, we calculated daily average rates (e.g., Likes per day = Total Likes/Days since published), enabling a fair comparison of engagement efficiency.

Result

Video features

In this study, a total of 200 videos were retrieved and analyzed based on keyword searches, with 100 videos sourced from Douyin and another 100 from BiliBili. As summarized in Table 5, the general characteristics of these videos revealed that Douyin videos garnered significantly more likes, comments, shares, and saves compared to those on BiliBili (all P < 0.001). In contrast, videos on BiliBili were notably longer in duration (P < 0.001) and had been published for a significantly greater number of days (P < 0.001) than those on Douyin. Additionally, the distribution of GQS was wider on BiliBili, suggesting a potentially higher proportion of videos with superior quality on this platform.

Table 5.

Characteristics of the videos in Douyin and BiliBili.

Variable	Douyin (n = 100), median (IQR)	BiliBili (n = 100), median (IQR)	Wilcoxon rank-sum test
Variable	Douyin (n = 100), median (IQR)	BiliBili (n = 100), median (IQR)	z score	P value
Likes	614 (85.5–5018.5)	9.50 (3.00–30.75)	−10.316	＜0.001
Comments	66.5 (12–298.75)	1.00 (0.00–6.75)	−9.608	＜0.001
Shares	142 (23.25–509.5)	10.00 (2.00–27.75)	−7.656	＜0.001
Saves	149 (26.5–612)	20 (3.25–44.75)	−6.723	＜0.001
Days since published	251 (73–549)	815.50 (375.25–1292.25)	−6.458	＜0.001
Duration	48.5 (24–103)	158 (81.5–1573)	−7.069	＜0.001
GQS score	2 (1–2)	2 (2–3)	−2.118	＜0.05
mDISCERN score	2 (1–2)	2 (1–2)	−1.928	＜0.05

GQS: Global Quality Score; IQR: interquartile range; mDISCERN: modified DISCERN.

However, given the substantial difference in video longevity (“Days since published,” Table 5), direct comparison of raw interaction counts may be biased, as older videos inherently have more time to accumulate engagement. To account for this, daily average interaction metrics were calculated and compared (Table 6). After standardization by time, Douyin videos still exhibited significantly higher per-day averages in likes, comments, shares, and saves (all P < 0.001). This refined analysis confirms that user engagement with rehabilitation robot-related videos remains higher on Douyin even after adjusting for the time since publication.

Table 6.

Daily interaction metrics data across platforms.

Variable	Douyin (n = 100), median (IQR)	BiliBili (n = 100), median (IQR)	Wilcoxon rank-sum test
Variable	Douyin (n = 100), median (IQR)	BiliBili (n = 100), median (IQR)	z score	P value
Per-day average likes	4.361(1.965–8.957)	0.010(0.007–0.026)	−11.682	＜0.001
Per-day average comments	0.308(0.192–0.625)	0.001(0–0.004)	−11.417	＜0.001
Per-day average shares	0.557(0.381–0.948)	0.012(0.004–0.032)	−11.918	＜0.001
Per-day average Save	0.724(0.480–1.064)	0.026(0.009–0.048)	−11.783	＜0.001

IQR: interquartile range.

An analysis of Tables 7 and 8 and Figure 2 in the attachment uncover significant differences in video sources and content features between Douyin and BiliBili. On Douyin, individual users generate most videos (58%), while medical, academic institutions, news reports, and business sources contribute far less. Conversely, BiliBili's content distribution is more balanced, with heavier contributions from medical and academic institutions. Douyin, with its core focus on short-duration, high-frequency interactions, excels particularly in sharing (with a median of 1499 shares for medical content) and saving (a median of 928 saves for medical content). The platform's content tends to favor lightweight, rapidly disseminated formats, typically ranging from 41.5 to 146 seconds in length, with a relatively brief lifecycle (median posting duration concentrated between 84 and 789 days), aligning well with its entertainment-oriented, fragmented-content positioning. In contrast, BiliBili places a greater emphasis on depth and professional engagement. Videos from academic and medical institutions are notably longer (with a median length of 2690 seconds for medical institution videos). Users on BiliBili demonstrate a stronger inclination towards commenting (a median of 13.5 comments for medical institution videos) and long-term saving (a median of 107.5 saves for medical content). Additionally, the quality of content, as assessed by GQS and mDISCERN scores, is generally higher on BiliBili (for instance, a median GQS score of 3 for academic institution videos), reflecting user recognition of professional content. Moreover, BiliBili exhibits a pronounced long-tail effect in content dissemination (with a median posting duration of 1536 days for medical institution videos). The platform's ecosystem fosters community-based discussions and the dissemination of high-quality knowledge, whereas Douyin relies more on instant dissemination and high-efficiency interactions.

Figure 2.

Percentage of rehabilitation robot videos from different sources and different content in TikTok and Bilibili. (A) Sources of TikTok videos; (B) sources of Bilibili videos; (C) content classification of TikTok videos; and (D) content classification of Bilibili videos.

Table 7.

Video features of video sources and content in Douyin.

Variable	Likes	Comments	Share	Save	Days since published	Duration	GQS score	mDISCERN score
Video source, median (IQR)
Individual user(n = 58)	431.5 (53–4319.5)	42.5 (9.75–251)	87 (10.25–389.75)	104 (11–451.25)	216.5 (56.75–500)	41.5 (17.86.75)	2 (2–2.25)	2 (1–2)
Business and advertising(n = 21)	112 (65–4626)	13 (10.5–298.5)	25 (16–495)	29 (15.5–546.5)	84 (67–537.5)	24 (18.5–99.5)	1(1–2)	1(1–2)
News report(n = 10)	3247.5(1158.75–22750)	199 (82.75–1256.25)	315.5 (157.75–1631.5)	347(196.25–1095.75)	436.5(285.75–823)	82(62.75–200.25)	2(1.75–2)	2 (1–2)
Academic institutions and research teams(n = 4)	2546.5 (1090.75–4617.25)	163(91.5–278)	259(161.75–468.75)	300(196.25–562.75)	398(279.5–534.5)	75(60–97.5)	2(1.25–2)	1(1–1.75)
Medical institutions and related personnel(n = 7)	17000 (1637–88000)	813 (101–3728)	1499 (175–8225)	928 (237–3941)	789 (306–1352)	146 (67–321)	2 (1–3)	2 (1–2)
Video content, median (IQR)
Product promotion and publicity(n = 16)	82.5(59–161.5)	12(10–17.5)	22(13.25–36.25)	26(13.25–48.75)	73(64–97.25)	23.5(18–31.5)	1.5(1–2.75)	2 (1–2)
Patient experience sharing(n = 14)	5347.5(1159.75–22000)	393(66.75–1230.75)	553(119.75–1621.75)	725(166–1060.5)	598.5(225.5–815.5)	116.5(51–200.25)	2(1.75–2)	2 (1–2)
Introduction to rehabilitation technologies (n = 13)	367(106–537.5)	38(12.5–55)	76(21.5–119.5)	98(25–143.5)	156(77.5–240)	40(23–47)	2(1.5–2)	1(1–2)
Demonstration of rehabilitation training (n = 37)	3452(901.5–6555.5)	235(81–485)	355(153.5–889.5)	374(183.5–815.5)	475(265–729.5)	85(58–128)	2 (1–2)	1(1–2)
Science communication and education (n = 19)	358(42–3025)	37(7–142)	75(7–266)	88(9–287)	150(30–376)	39(15–78)	2 (2–3)	2 (1–2)
Expert explanations and recommendations (n = 1)	53	9	8	11	50	17	2	2

GQS: Global Quality Score; IQR: interquartile range; mDISCERN: modified DISCERN.

Table 8.

Video features of video sources and content in BiliBili.

Variable	Likes	Comments	Share
Video source, median (IQR)
Individual user (n = 26)	14 (4.5–26.25)	1.5 (0–4.25)	17 (4.25–28.25)
Business and advertising (n = 21)	2 (1–12.5)	0 (0–1.5)	0 (0–16.5)
News report (n = 7)	6(5–30)	1(0–6)	8(5–43)
Academic institutions and research teams (n = 30)	10(3.75–19)	0.5(0–4.75)	3.5(3–16.25)
Medical institutions and related personnel (n = 16)	64(3–136.5)	13.5(0–34.25)	59.5(1.25–96.75)
Video content, median (IQR)
Product promotion and publicity (n = 23)	2(1–15)	0(0–2)	1(0–19)
Patient experience sharing (n = 4)	3(0–6.75)	0(0.5–1)	4.5(0–9.75)
Introduction to rehabilitation technologies (n = 15)	14(2–30)	1(0–6)	16(1–43)
Demonstration of rehabilitation training (n = 8)	23.5(6–53.25)	4(1–10.5)	27.5(8.5–54)
Science communication and education (n = 35)	14(4–81)	1(0–20)	12(3–71)
Expert explanations and recommendations (n = 15)	11(4–31)	1(0–13)	4(2–18)

GQS: Global Quality Score; IQR: interquartile range; mDISCERN: modified DISCERN.

Video quality and reliability assessment

In this study, the κ value used to assess interobserver reliability was 0.78. This result indicates a high degree of agreement in judgments between two observers. Based on the data analysis of Table 9 and Figure 3 in the attachment, significant disparities are observed between Douyin and BiliBili regarding the GQS and mDISCERN ratings of short videos related to rehabilitation robots. In terms of GQS, a striking 79% of videos on Douyin were categorized as low quality, compared to 51% on BiliBili. BiliBili demonstrated superior performance across the “fair,” “good,” and “excellent” categories, with Douyin lacking any videos in the “excellent” category. Regarding mDISCERN scores, 97% of videos on Douyin were rated as having low reliability, whereas this figure was 87% for BiliBili. Videos on BiliBili showed greater reliability, performing better in the “highly reliable,” “moderately reliable,” and “reliable” categories, while no videos on Douyin achieved the “reliable” rating. Overall, BiliBili outperforms Douyin in both video quality and information reliability, likely attributable to differences in content review mechanisms or user demographics.

Figure 3.

Statistical analysis of GQS scores and mDISCERN scores of short videos related to rehabilitation robots on TikTok and Bilibili: (A) comparison of GQS scores of videos on TikTok and Bilibili; (B) GQS scores of videos on TikTok and Bilibili; (C) mDI of videos on TikTok and Bilibili comparison of SCERN scores. (D) mDISCERN scores of videos on TikTok and Bilibili. GQS: Global Quality Score; mDISCERN: modified DISCERN.

Table 9.

GQS and mDISCERN scores for Douyin and BiliBili videos related to rehabilitation robots.

Scale, score	Douyin (n = 100)	BiliBili (n = 100)
GQS
Very poor	27	22
Poor	52	29
Fair	19	28
Good	2	7
Excellent	0	4
mDISCERN
Unreliable	49	38
Less reliable	48	49
Fairly reliable	3	7
Relatively reliable	0	3
Reliable	0	3

GQS: Global Quality Score; mDISCERN: modified DISCERN.

Spearman correlation analysis

To investigate the associations between video metrics (interaction data, longevity, and duration) and quality scores (GQS and mDISCERN), Spearman correlation analyses were conducted on both the raw metrics (Table 10) and the standardized daily interaction data (Table 11). A Bonferroni correction was applied to adjust for multiple comparisons, resulting in a revised significance threshold of α’ = 0.0018. Only correlations with P -values below this threshold were deemed statistically significant.

Table 10.

Spearman correlation analysis of Douyin and BiliBili short video platform (N = 200).

Variable		Likes	Comments	Share	Save	Days since published	Duration	GQS score	mDISCERN score
Likes	r	-	0.988*	0.949*	0.916*	0.174	0.127	−0.015	0.0201
Likes	P	-	＜0.001	＜0.001	＜0.001	＜0.05	0.073	0.838	0.771
Comments	r	0.988*	-	0.947*	0.914*	0.206*	0.158	−0.009	0.043
Comments	P	＜0.001	-	＜0.001	＜0.001	＜0.001	＜0.05	0.899	0.543
Share	r	0.949*	0.947*	-	0.990*	0.431*	0.390*	−0.031	−0.013
Share	P	＜0.001	＜0.001	-	＜0.001	＜0.001	＜0.001	0.666	0.853
Save	r	0.916*	0.914*	0.990*	-	0.509*	0.471*	−0.034	−0.023
Save	P	＜0.001	＜0.001	＜0.001	-	＜0.001	＜0.001	0.631	0.741
Days since published	r	0.174	0.206*	0.431*	0.509*	-	0.995*	0.065	0.146
Days since published	P	＜0.05	＜0.001	＜0.001	＜0.001	-	＜0.001	0.364	0.002
Duration	r	0.127	0.158	0.390*	0.471*	0.995*	-	0.078	0.099
Duration	P	0.073	＜0.05	＜0.001	＜0.001	＜0.001	-	0.267	0.165
GQS score	r	−0.015	−0.009	−0.031	−0.034	0.065	0.078	-	0.079
GQS score	P	0.838	0.899	0.666	0.631	0.364	0.267		0.267
mDISCERN score	r	0.021	0.043	−0.013	−0.023	0.146	0.099	0.079	-
mDISCERN score	P	0.771	0.543	0.853	0.741	0.002	0.165	0.267	-

After Bonferroni correction, the significance level is α’ = α/k = 0.05/28 ≈ 0.0018; “*” indicates statistically significant results.

GQS: Global Quality Score; mDISCERN: modified DISCERN.

Table 11.

Spearman correlation analysis of the daily interaction data of Douyin and BiliBili short video platform (N = 200).

Variable		Per-day average likes	Per-day average comments	Per-day average shares	Per-day average Save	Days since published	Duration	GQS score	mDISCERN score
Per-day average likes	r	-	0.967*	0.985*	0.983*	−0.233*	−0.009	−0.06	0.011
Per-day average likes	P	-	＜0.001	＜0.001	＜0.001	＜0.001	0.9	0.402	0.875
Per-day average comments	r	0.967*	-	0.954*	0.953*	−0.237*	−0.044	−0.075	0.015
Per-day average comments	P	＜0.001	-	＜0.001	＜0.001	＜0.001	0.533	0.292	0.832
Per-day average shares	r	0.985*	0.954*	-	0.987*	−0.241*	−0.029	−0.092	−0.029
Per-day average shares	P	＜0.001	＜0.001	-	＜0.001	＜0.001	0.684	0.194	0.687
Per-day average Save	r	0.983*	0.953*	0.987*	-	−0.249*	−0.037	−0.102	−0.04
Per-day average Save	P	＜0.001	＜0.001	＜0.001	-	＜0.001	0.599	0.152	0.57
Days since published	r	−0.233*	−0.237*	−0.241*	−0.249*	-	0.177	0.28*	0.281*
Days since published	P	＜0.001	＜0.001	＜0.001	＜0.001	-	0.012	＜0.001	＜0.001
Duration	r	−0.009	−0.044	−0.029	−0.037	0.177	-	0.406*	0.345*
Duration	P	0.9	0.533	0.684	0.599	0.012	-	＜0.001	＜0.001
GQS score	r	−0.06	−0.075	−0.092	−0.102	0.28*	0.406*	-	0.711*
GQS score	P	0.402	0.292	0.194	0.152	＜0.001	＜0.001	-	＜0.001
mDISCERN score	r	0.011	0.015	−0.029	−0.04	0.281*	0.345*	0.711*	-
mDISCERN score	P	0.875	0.832	0.687	0.57	＜0.001	＜0.001	＜0.001	-

After Bonferroni correction, the significance level is α’ = α/k = 0.05/28 ≈ 0.0018; “*” indicates statistically significant results.

GQS: Global Quality Score; mDISCERN: modified DISCERN.

Analysis of the raw interaction metrics (Table 10) revealed strong positive correlations among likes, comments, shares, and saves (all P < 0.0018). A notably high correlation was also found between “Days since published” and “Duration” (P < 0.0018). However, no significant correlations were observed between any raw interaction metric and the quality scores (GQS and mDISCERN) after correction. Additionally, raw interaction counts showed significant positive correlations with both “Days since published” and “Duration” (e.g., Shares vs. Days since published: P < 0.0018; Saves vs. Duration: P < 0.0018), indicating that videos available for a longer period accrued higher absolute interaction numbers.

Analysis of the standardized daily interaction data (Table 11), which accounts for variations in video lifespan, demonstrated that daily averages of likes, comments, shares, and saves remained strongly intercorrelated (all P < 0.0018). The correlation between “Days since published” and “Duration” remained significant (P < 0.0018). Importantly, after temporal standardization, no significant correlations were found between any daily interaction metric and the mDISCERN score—all corresponding P-values exceeded the corrected threshold. Similarly, no significant correlations emerged between the standardized daily metrics and the GQS score. A strong positive correlation was confirmed between GQS and mDISCERN scores (P < 0.0018).

Regression analysis

GQS score regression analysis

Based on multiple linear regression analyses, several factors were identified as significant predictors of GQS (Tables 12 and 13). Videos from academic institutions (β = 0.700, P = 0.003) and medical institutions (β = 0.615, P = 0.004) demonstrated significantly higher GQS scores compared to other sources. In terms of content type, science communication videos (β = 1.000, P < 0.001) and expert explanations (β = 1.090, P = 0.002) showed the strongest positive effects on quality scores. The analysis of standardized daily metrics revealed that the daily average share rate was a particularly strong predictor (β = 0.840, P < 0.001), indicating that frequently shared videos were associated with higher quality ratings. Video duration maintained a consistent positive relationship with GQS across both models (β = 0.00045, P < 0.001 for raw metrics; β = −0.000309, P = 0.000206 for daily metrics), while longer time since publication showed a negative association (β = −0.000848, P < 0.001 for raw metrics; β = −0.00076, P = 0.0022 for daily metrics). Analysis of variance (ANOVA) results confirmed the overall significance of these models (Tables 14 and 15), with platform (P = 0.000999), video source (P < 0.001), video content (P < 0.001), and duration (P < 0.001) all contributing significantly to explaining variance in GQS scores.

Table 12.

Multiple linear regression analysis of GQS scores.

Variable	Estimate (β)	Standard error	t	P	Significance
BiliBili	1.62E-01	1.72E-01	0.944	0.346184
Video Source-Business and Advertising	2.95E-01	1.99E-01	1.478	0.141012
Video Source-News Reports	1.97E-01	2.10E-01	0.941	0.347883
Video Source-Academic Institutions and Research Teams	7.00E-01	2.33E-01	3.004	0.003035	**
Video Source-Medical Institutions and Related Personnel	6.15E-01	2.09E-01	2.941	0.003697	**
Video Content-Patient Experience Sharing	6.87E-01	2.69E-01	2.558	0.011344	*
Video Content-Rehabilitation Technology Introduction	6.30E-01	2.45E-01	2.569	0.010993	*
Video Content-Rehabilitation Training Demonstration	4.32E-01	2.41E-01	1.796	0.074152
Video Content-Popular Science Education	1.00E + 00	2.30E-01	4.366	2.12E-05	***
Video Content-Expert Explanations and Recommendations	1.09E + 00	3.45E-01	3.166	0.001809	**
Likes	−1.33E-05	1.92E-05	-0.693	0.489056
Comments	1.06E-03	4.82E-04	2.197	0.029263	*
Shares	−3.36E-05	5.86E-05	−0.573	0.567318
Save	−4.06E-04	2.98E-04	−1.363	0.174662
Days since published	−8.48E-04	2.52E-04	−3.360	0.000949	***
Duration	4.50E-04	1.24E-04	3.623	0.000378	***

GQS: Global Quality Score.

Table 13.

Multiple linear regression analysis of GQS scores (average per day).

Variable	Estimate (β)	Standard error	t	P	Significance
BiliBili	0.139027	0.2307687	0.606	0.545102
Video Source-Business and Advertising	0.259605	0.1963468	1.323	0.187667
Video Source-News Reports	0.1544806	0.2073644	0.745	0.456428
Video Source-Academic Institutions and Research Teams	0.6069445	0.2291840	2.641	0.009204	**
Video Source-Medical Institutions and Related Personnel	0.6077945	0.2025735	3.001	0.003746	**
Video Content-Patient Experience Sharing	0.6409306	0.2721699	2.593	0.019588	*
Video Content-Rehabilitation Technology Introduction	0.6363946	0.2445497	2.603	0.009497	**
Video Content-Rehabilitation Training Demonstration	0.4240380	0.2420498	1.762	0.079438
Video Content-Popular Science Education	0.9832505	0.2297535	4.331	0.0001	***
Video Content-Expert Explanations and Recommendations	0.8623262	0.3358348	2.568	0.011515	*
Per-day average likes	0.0032092	0.0241088	0.126	0.900116
Per-day average comments	−0.0397308	0.3511320	−0.113	0.910191
Per-day average shares	0.8397260	0.1065923	7.880	0.0001	***
Per-day average save	−0.4298807	0.3649855	−1.178	0.240405
Days since published	−0.0007575	0.0002438	−3.108	0.002188	**
Duration	−0.0003090	0.00001032	−3.787	0.000206	***

GQS: Global Quality Score.

Table 14.

ANOVA results of GQS scores.

Variable	F	P	Significance
Platform	11.1875	0.000999	***
Video Source	10.6893	8.30E-08	***
Video Content	5.4905	9.82E-05	***
Likes	0.2992	0.5850613
Comments	2.928	0.0887517
Shares	1.2375	0.2674128
Save	0.8284	0.3639244
Days since published	0.4634	0.4969049
Duration	13.1237	0.0003776	***

ANOVA: analysis of variance; GQS: Global Quality Score.

Table 15.

ANOVA results of GQS scores (average per day).

Variable	F	P	Significance
Platform	11.4505	0.0008744	***
Video Source	15.7407	0.0001042	***
Video Content	4.6200	0.0005399	***
Per-day average likes	0.1324	0.7164302
Per-day average comments	10.8431	0.0011953	**
Per-day average shares	0.1529	0.6961336
Per-day average save	0.0148	0.9038772
Days since published	1.5545	0.2140653
Duration	10.9800	5.24e-08	***

ANOVA: analysis of variance; GQS: Global Quality Score.

mDISCERN score regression analysis

Based on the multiple linear regression analyses of mDISCERN scores (Tables 16 and 17), several significant predictors of information reliability were identified. Commercial and advertising sources showed a substantial positive effect (β= 0.452, P = 0.007), alongside academic institutions (β= 0.399, P = 0.041) and medical institutions (β= 0.447, P = 0.011). Among content types, science communication and education content demonstrated the strongest positive association with reliability scores (β = 0.863, P < 0.001), followed by expert explanations and recommendations (β = 1.230, P < 0.001). Video duration consistently showed a positive relationship with mDISCERN scores across both raw (β = 0.000476, P < 0.001) and standardized models (β = 0.045, P < 0.001), while longer time since publication exhibited a negative association (β = −0.000845, P < 0.001 for raw metrics; β = −0.00078, P < 0.001 for daily metrics). The analysis of standardized daily metrics revealed that daily average comments significantly predicted reliability scores (β = −0.635, P = 0.034). ANOVA results (Tables 18 and 19) confirmed the overall significance of these relationships, with platform (P < 0.001), video source (P < 0.001), video content (P < 0.001), and duration (P < 0.001) all contributing significantly to explaining variance in mDISCERN scores. These findings indicate that professionally sourced, educational content in longer formats tends to receive higher reliability ratings, while longer exposure time appears to negatively impact perceived reliability.

Table 16.

Multiple linear regression analysis of mDISCERN scores.

Variable	Estimate (β)	Standard error	t	P	Significance
BiliBili	0.2094	0.143	1.465	0.14471
Video Source-Business and Advertising	0.4519	0.1658	2.725	0.00705	**
Video Source-News Reports	0.1524	0.1744	0.874	0.38332
Video Source-Academic Institutions and Research Teams	0.3987	0.1937	2.058	0.04101	*
Video Source-Medical Institutions and Related Personnel	0.4469	0.174	2.569	0.01101	*
Video Content-Patient Experience Sharing	0.6467	0.2235	2.894	0.00426	**
Video Content-Rehabilitation Technology Introduction	0.4957	0.2042	2.428	0.01617	*
Video Content-Rehabilitation Training Demonstration	0.5359	0.2003	2.676	0.00813	**
Video Content-Popular Science Education	0.8625	0.1912	4.512	0.0000115	***
Video Content-Expert Explanations and Recommendations	1.23	0.2873	4.283	0.0000297	***
Likes	0.000009501	0.00001595	0.596	0.55209
Comments	0.0007917	0.0004014	1.972	0.05008
Shares	−0.00009159	0.00004877	−1.878	0.06198
Save	−0.000377	0.000248	−1.52	0.13022
Days since published	−0.0008445	0.0002099	−4.023	0.0000839	***
Duration	0.0004762	0.0001034	4.606	0.00000767	***

mDISCERN: modified DISCERN.

Table 17.

Multiple linear regression analysis of mDISCERN scores (average per day).

Variable	Estimate (β)	Standard error	t	P	Significance
BiliBili	1.245e-01	1.919e-01	0.649	0.517684
Video Source-Business and Advertising	4.854e-01	1.633e-01	2.963	0.005014	**
Video Source-News Reports	4.276e-01	1.725e-01	2.479	0.015971	*
Video Source-Academic Institutions and Research Teams	4.078e-01	1.706e-01	2.390	0.017738	*
Video Source-Medical Institutions and Related Personnel	4.096e-01	1.911e-01	2.139	0.033634	*
Video Content-Patient Experience Sharing	5.870e-01	2.264e-01	2.593	0.010279	*
Video Content-Rehabilitation Technology Introduction	5.056e-01	2.000e-01	2.528	0.013081	*
Video Content-Rehabilitation Training Demonstration	5.197e-01	2.017e-01	2.596	0.010136	*
Video Content-Popular Science Education	8.135e-01	1.878e-01	4.331	.36e-05	***
Video Content-Expert Explanations and Recommendations	1.396e + 00	2.872e-01	4.791	0.000129	***
Per-day average likes	3.060e-02	2.005e-02	1.526	0.128327
Per-day average comments	−6.350e-01	2.864e-01	−2.219	0.033563	*
Per-day average shares	−1.269e-01	9.413e-02	−1.454	0.153257
Per-day average save	−5.490e-01	3.036e-01	−1.808	0.072204
Days since published	−7.800e-04	2.027e-04	−3.847	0.000165	***
Duration	4.524e-02	8.583e-03	5.271	3.87e-07	***

mDISCERN: modified DISCERN.

Table 18.

ANOVA results of mDISCERN scores.

Variable	F	P	Significance
Platform	11.2217	0.0009818	***
Video Source	6.8938	0.00003434	***
Video Content	5.9471	0.00004027	***
Likes	5.099	0.0251202	*
Comments	3.424	0.0658656
Shares	7.1096	0.0083538
Save	2.6494	0.1053097
Days since published	0.1867	0.666157
Duration	21.2136	0.000007675	***

ANOVA: analysis of variance; mDISCERN: modified DISCERN.

Table 19.

ANOVA results of mDISCERN scores (average per day).

Variable	F	P	Significance
Platform	11.4937	0.0008554	***
Video Source	6.6612	0.0003406	***
Video Content	4.8243	5.347e-05	***
Per-day average likes	10.3811	0.0015071	**
Per-day average comments	9.5566	0.0023039	**
Per-day average shares	0.0754	0.7838714
Per-day average save	0.2139	0.6442932
Days since published	2.6249	0.1069220
Duration	30.1231	1.338e-07	***

ANOVA: analysis of variance; mDISCERN: modified DISCERN.

Discussion

This study systematically evaluated the content quality and reliability of videos related to “robot-assisted rehabilitation” on two major short-video platforms in China: Douyin and BiliBili. While our findings confirm the enormous potential of short-video platforms in health information dissemination, their content ecosystems also reveal structural issues closely linked to digital health literacy and the challenges of misinformation spread.

Content source and platform ecology shape information quality

Although Douyin videos garnered significantly more raw interactions (likes, comments, and shares), this higher engagement did not correspond to better quality or reliability. Regression analysis indicated that the daily share rate was a positive predictor of GQS (β = 0.840, P < 0.001), implying that content worth sharing tends to be of higher quality. However, no significant correlation was observed between daily interaction rates and mDISCERN scores after time-normalization. This disconnect emphasizes that popularity does not equate to credibility—a crucial consideration for public health education, as users may erroneously associate high visibility with trustworthiness.^21,31

Content type significantly influences perceived value

Videos featuring science communication (β = 1.000, P < 0.001) or expert explanations (β = 1.090, P = 0.002) were strong predictors of higher GQS scores. Similarly, content from academic and medical institutions received elevated mDISCERN scores. In contrast, commercially promoted videos consistently underperformed. These results underscore the importance of source credibility and communicative intent in health-related content.³² They also point to a worrying trend: promotional materials often prioritize esthetic appeal and persuasive messaging over educational value, which may mislead audiences.

Duration and recency matter in content utility

Longer videos were consistently associated with higher GQS and mDISCERN scores, likely because they allow more comprehensive explanations—essential for complex topics like rehabilitation robotics. Conversely, videos that had been online for extended periods scored lower on both scales, possibly due to outdated information or algorithmic neglect. These findings suggest that platforms and creators should prioritize both informational depth and temporal relevance to enhance public understanding.

Educational deficiencies of commercial content and algorithmic amplification effect

Videos produced or promoted by commercial entities consistently underperformed in both GQS and mDISCERN evaluations. This can be attributed to a fundamental misalignment of incentives: whereas educational and scientific content aims to inform, commercial content is primarily designed to promote products or services, capture attention, and drive conversions. Typically centered on product marketing, brand exposure, or traffic conversion, such content prioritizes visual appeal and emotional arousal while neglecting scientific rigor and educational depth. For instance, some videos deliberately construct an image of “pseudo-authority” to mislead viewers—by exaggerating rehabilitation efficacy, concealing indications, and limitations, or employing marketing rhetoric in the form of “patient testimonials.”

More alarmingly, platform algorithms may inadvertently facilitate the dissemination of such content. Owing to their higher production quality, stronger emotional resonance, and clearer audience targeting, commercial videos often garner greater initial user engagement, thereby meeting the criteria for priority recommendation by algorithms. This occurs because recommendation systems are generally optimized for maximizing user retention and interaction—metrics that persuasive, emotionally charged, and well-produced commercial content readily achieves. As a result, algorithm-driven platforms often inherently favor promotive material over educational integrity. This “traffic-first” logic not only squeezes the survival space of high-quality educational content but also further undermines the public's ability to distinguish the authenticity of health information.

Implications for public health and policy

With rapid population aging and increasing rehabilitation needs, accessible, and accurate information on technologies like robot-assisted training is essential. While short videos can effectively disseminate knowledge, unregulated content may perpetuate misinformation. Recent Chinese policies aimed at improving health information quality are a step in the right direction.^8,33 To further enhance the reliability and utility of health-related video content, we propose the following evidence-based strategies: Platforms should introduce a mandatory verification system where content from medical institutions, accredited professionals, and academic sources receives visible authentication labels (e.g., “Verified Medical Source” and “Academic-Endorsed”). Additionally, a traffic-weighting mechanism could be established to prioritize the distribution of verified content in user recommendations, increasing its reach and impact while demoting unverified or commercial promotional material. Creating detailed, platform-specific content guidelines for health science communication—covering citation standards, disclosure requirements, and balanced presentation of scientific information—can significantly improve quality. Platforms could collaborate with health authorities to offer training and certification programs for creators, especially those producing content in high-demand areas like rehabilitation and assistive technologies. To address issues of outdated or misleading content, platforms should implement timeliness alerts (e.g., “Uploaded over 2 years ago—content may not reflect latest standards”) and integrate rapid public feedback mechanisms such as “Accuracy Flags” where users or experts can highlight questionable claims for third-party review. Encouraging structured collaborations between healthcare professionals and digital content creators can bridge the gap between scientific accuracy and public engagement. Platforms can sponsor “creator-researcher pairing” programs, support the production of dual-version videos (both expert and public-friendly), and feature these collaborations prominently to set quality benchmarks. Embedding simple, interactive checklists or prompting questions next to health videos—such as “Is the source cited?”, “Are benefits and limitations explained?”, and “Is this advice applicable to your condition?”—can cultivate critical appraisal skills among viewers. These tools can be designed in a gamified format to encourage user participation without being overly burdensome.

By adopting these strategies, platforms, creators, and regulators can collectively foster a more trustworthy and educative digital environment—enabling patients, caregivers, and the general public to access robot-assisted rehabilitation information that is not only engaging but also scientifically sound and ethically communicated.

Limitations

The present study is subject to four main limitations: (1) reliance on publicly available videos from platforms exposed the sample to algorithmic popularity bias and survivorship bias—high-quality but slow-spreading content in the “long tail” was inevitably omitted. Although we normalized engagement metrics by video age (i.e., calculated daily average rates), the data obtained still only reflect the most visible rehabilitation robotics technology (RAT)-related content on the platforms at the time of data scraping. Adopting a “time-bounded data collection approach” (e.g., including only all videos posted within a single year) would yield a corpus that is less tied to algorithms and more comprehensive. (2) The exclusive focus on Chinese-language videos limits the cross-linguistic and cross-cultural generalizability of the findings. Future research should conduct multilingual replications of the study to verify the conclusions. (3) Both the GQS and modified DISCERN (mDISCERN) were originally developed as text-based assessment tools, and thus cannot fully capture the audiovisual attributes of videos—such as narrative coherence, visual clarity, and production sophistication. (4) Despite high inter-rater agreement (Cohen's κ = 0.78), the ratings remain inherently subjective. (5) The small sample size of videos from academic institutions and medical facilities undermined the precision of the associated subgroup analyses. Future studies should expand the sample size to include more of these “underrepresented” video sources.

Conclusion

This study is among the first focused analyses of RAT-related video content on Chinese short-video platforms. While both Douyin and BiliBili show potential for disseminating health information, the overall quality of content remains moderate. BiliBili demonstrated higher quality and reliability, likely due to its user base and content moderation mechanisms. Content creators are encouraged to emphasize accuracy, clarity, and scientific rigor. Users should critically evaluate information and consult healthcare professionals when making medical decisions. These findings highlight the need for improved content regulation and effective science communication strategies in the digital age.

Footnotes

ORCID iD

Chi Zhang

Ethical approval

This study analyzed publicly available video metadata (e.g., likes and shares) and did not involve direct interaction with human participants. Ethical approval was waived per national guidelines for noninterventional research using anonymized public data.

Informed consent

This crosssectional study was completed on 5 February 2025. All study data were extracted from publicly accessible videos on Douyin and BiliBili platforms. Notably, no clinical records, human specimens or animal specimens were utilized in the study, and no personally identifiable information (PII) of any individual was collected during the data extraction process. Since there was no direct or indirect interaction with platform users and no involvement of human subjects as defined by ethical guidelines, ethical review was exempted in accordance with the exemption provisions outlined in the Declaration of Helsinki.

Contributorship

XRL proposed the project and wrote the manuscript; CMZ and YJX conducted data analysis; LW, SJW, and XL provided suggestions for the revision of the article; and CZ supervised the project and interpreted the data. All authors reviewed and edited the manuscript.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is supported by Special Project for Central Government-Guided Local Sci-Tech Development in Sichuan Province (2024ZYD0269). This work was supported by the Sichuan Provincial Science and Technology Department [grant numbers 2024YFHZ0050], the Luzhou City Science and Technology Bureau [grant number 2024LZXNYDJ035 and 2020LZXNYDJ14], and Cooperation Project between the Second People's Hospital of Deyang and Southwest Medical University [grant number 2022DYEXNYD002]. This research is supported by Special Project for Central Government-Guided Local Sci-Tech Development in Sichuan Province (2024ZYD0269).

Declaration of conflicting interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Data availability statement

All data are derived from public platforms (TikTok and Bilibili) and are available through search functions. Raw video links and metadata are available upon reasonable request.

Gurantor

CZ.

Peer review

Dr. Gaoyang Pang, University of Sydney reviewed this manuscript.

References

Duret

Grosmaire

Krebs

. Robot-assisted therapy in upper extremity hemiparesis: overview of an evidence-based approach. Front Neurol 2019; 10: 412.

Cinnera

Bonnì

D'Acunto

, et al. Cortico-cortical stimulation and robot-assisted therapy (CCS and RAT) for upper limb recovery after stroke: study protocol for a randomised controlled trial. Trials 2023; 24: 823.

Yang

Fengyi

, et al. Effects of robot-assisted upper limb training combined with functional electrical stimulation in stroke patients: study protocol for a randomized controlled trial. Trials 2024; 25: 355.

Mehrholz

Pohl

Platz

, et al. Electromechanical and robot-assisted arm training for improving activities of daily living, arm function, and arm muscle strength after stroke. Cochrane Database Syst Rev 2018; 9: Cd006876.

Hua

Pan

Fang

, et al. Integrating social, climate and environmental changes to confront accelerating global aging. BMC Public Health 2024; 24: 2838.

Xie

Bowe

Mokdad

, et al. Analysis of the global burden of disease study highlights the global, regional, and national trends of chronic kidney disease epidemiology from 1990 to 2016. Kidney Int 2018; 94: 567–581.

Global burden of chronic respiratory diseases and risk factors, 1990–2019: an update from the global burden of disease study 2019. EClinicalMedicine. 2023; 59: 101936.

Zhu

, et al. When “Aging” meets “Intelligence”: smart health cognition and intentions of older adults in rural Western China. Front Psychiatry 2024; 15: 1493376.

Germanotta

Cortellini

Insalaco

, et al. Effects of upper limb robot-assisted rehabilitation compared with conventional therapy in patients with stroke: preliminary results on a daily task assessed using motion analysis. Sensors (Basel, Switzerland) 2023; 23: 3089.

10.

Kim

Lee

. Effects of rehabilitation robot training on physical function, functional recovery, and daily living activities in patients with sub-acute stroke. Medicina (Kaunas, Lithuania) 2024; 60: 811.

11.

Yan

Cui

Murong

, et al. Effect of rehabilitation robot rehabilitation training synchronizing acupuncture exercise therapy on postoperative rehabilitation with hip fracture. Zhongguo Zhen Jiu 2021; 41: 387–390.

12.

Barrientos

Del Cerro

. Robotics in medicine. Med Clin (Barc) 2019; 152: 493–494.

13.

Aburub

Darabseh

Badran

, et al. The application of robotics in cardiac rehabilitation: a systematic review. Medicina (Kaunas, Lithuania) 2024; 60: 1161.

14.

Ase

Honaga

Tani

, et al. Effects of home-based virtual reality upper extremity rehabilitation in persons with chronic stroke: a randomized controlled trial. J Neuroeng Rehabil 2025; 22: 20.

15.

McDonnell

Devine

Kavanagh

. The general public's perception of robotic surgery—a scoping review. Surgeon: J Royal Colleges Surgeons of Edinburgh Ireland 2025; 23: e49–e62.

16.

Ding

Gui

, et al. Patient acceptance of medical service robots in the medical intelligence era: an empirical study based on an extended AI device use acceptance model. Humanities Social Sci Commun 2024; 11: 1495.

17.

Dee

Muralidhar

Butler

, et al. General and health-related internet use among cancer survivors in the United States: a 2013–2018 cross-sectional analysis. J National Comprehensive Cancer Network: JNCCN 2020; 18: 1468–1475.

18.

Wang

Song

, et al. The reliability and quality of short videos as a source of dietary guidance for inflammatory bowel disease: cross-sectional study. J Med Internet Res 2023; 25: e41518.

19.

Sun

Zheng

. Quality of information in gallstone disease videos on TikTok: cross-sectional study. J Med Internet Res 2023; 25: e39162.

20.

Hong

Yee

Bagchi

, et al. Social media-based intervention to promote HBV screening and liver cancer prevention among Korean Americans: results of a pilot study. Digital Health 2022; 8: 20552076221076257.

21.

Zheng

Tong

Wan

, et al. Quality and reliability of liver cancer-related short Chinese videos on TikTok and Bilibili: cross-sectional content analysis study. J Med Internet Res 2023; 25: e47210.

22.

Sun

Guo

, et al. Evolutionary game analysis of building a sustainable intelligent elderly care service platform. Sci Rep 2024; 14: 28653.

23.

Liu

, et al. Quality assessment of health science-related short videos on TikTok: a scoping review. Int J Med Inf 2024; 186: 105426.

24.

Canatan

. Assessing the quality and reliability of videos related to fibromyalgia on TikTok: a comprehensive analysis. Cureus 2024; 16: e64704.

25.

Lai

Liao

, et al. The status quo of short videos as a health information source of Helicobacter pylori: a cross-sectional study. Front Public Health 2023; 11: 1344212.

26.

Ferhatoglu

Kartal

Ekici

, et al. Evaluation of the reliability, utility, and quality of the information in sleeve gastrectomy videos shared on open access video sharing platform YouTube. Obes Surg 2019; 29: 1477–1484.

27.

Mueller

Hongler

VNS

Jungo

, et al. Fiction, falsehoods, and few facts: cross-sectional study on the content-related quality of atopic eczema-related videos on YouTube. J Med Internet Res 2020; 22: e15599.

28.

Charnock

Shepperd

Needham

, et al. DISCERN: an instrument for judging the quality of written consumer health information on treatment choices. J Epidemiol Community Health 1999; 53: 105–111.

29.

Karakoyun

Yildirim

. YouTube videos as a source of information concerning Behçet's disease: a reliability and quality analysis. Rheumatol Int 2021; 41: 2117–2123.

30.

Kyarunts

Mansukhani

Loukianova

, et al. Assessing the quality of publicly available videos on MDMA-assisted psychotherapy for PTSD. Am J Addict 2022; 31: 502–507.

31.

Chou

Hunt

Beckjord

, et al. Social media use in the United States: implications for health communication. J Med Internet Res 2009; 11: e48.

32.

Singh

. YouTube for information on rheumatoid arthritis—a wakeup call? J Rheumatol 2012; 39: 899–903.

33.

D'Souza

Strand

, et al. YouTube as a source of medical information on the novel coronavirus 2019 disease (COVID-19) pandemic. Glob Public Health 2020; 15: 935–942.

Quality assessment of Chinese robot-assisted rehabilitation shorts

Abstract

Objective

Methods

Results

Conclusion

Keywords

Introduction

Methods

Ethical considerations

Search strategy and data collection

Video classification

Video quality assessment

Statistic analysis

Result

Video features

Video quality and reliability assessment

Spearman correlation analysis

Regression analysis

GQS score regression analysis

mDISCERN score regression analysis

Discussion

Content source and platform ecology shape information quality

Content type significantly influences perceived value

Duration and recency matter in content utility

Educational deficiencies of commercial content and algorithmic amplification effect

Implications for public health and policy

Limitations

Conclusion

Footnotes

ORCID iD

Ethical approval

Informed consent

Contributorship

Funding

Declaration of conflicting interests

Data availability statement

Gurantor

Peer review

References