Abstract
Background
With the rise of short video platforms, TikTok and Bilibili have become major sources of health information for the public. This study aimed to evaluate the content, quality, and reliability of amblyopia-related videos.
Methods
Using “amblyopia” as the keyword, we collected the video content, engagement metrics, video duration, and uploaders identity of the top 150 default-ranked videos on both platforms. The Global Quality Score (GQS) and the modified DISCERN (mDISCERN) tool were used to assess video quality and reliability. Mann-Whitney U and Kruskal-Wallis tests were then used for group comparisons, and Spearman correlation was used for correlation analysis.
Results
A total of 199 videos were included. TikTok videos were significantly shorter than Bilibili videos. TikTok demonstrated considerably higher user engagement across likes, comments, collections and shares. Video content mainly focused on treatment (88.44%), while etiology (39.70%) and prevention (40.70%) were less discussed. The median GQS score was 3.00 (IQR: 2.00–3.00), and the median mDISCERN score was 3.00 (IQR: 2.00–3.00). No differences were found between platforms in GQS and mDISCERN (
Conclusion
The quality and reliability of amblyopia-related videos were suboptimal, with diagnosis and prevention receiving insufficient attention. Videos uploaded by specialists had the highest quality and reliability. Strengthening content review and oversight and encouraging greater participation of specialists in amblyopia science communication are needed to improve the quality of health information on short video platforms.
Introduction
Amblyopia is a visual disorder where there is no organic eye disease, but corrected vision remains below normal, affecting 1%–5% of the global population.1,2 This condition arises from abnormal binocular interaction, leading to impaired best-corrected visual acuity. In addition to affecting vision, amblyopia significantly impacts children’s academic performance, psychological development, and overall quality of life.3,4 If left untreated during the critical period of visual development, it may lead to permanent damage. 5 Therefore, raising public awareness is crucial for early detection and intervention, which helps improve visual outcomes and reduce the associated socioeconomic burden. 6
With the widespread use of the internet and social media, short video platforms such as TikTok and Bilibili have become essential tools in public health education. These platforms, with their large user bases, easily shareable formats, and algorithm-driven distribution mechanisms, have become key carriers of health information.7,8 However, the rapid dissemination of medical content on these platforms has raised serious concerns regarding its quality and reliability. Studies on conditions such as gastroesophageal reflux disease, 9 thyroid eye disease, 10 and hypertension 11 show that while user engagement is high, the quality and reliability of content are often suboptimal, leading to the spread of misinformation, delays in appropriate treatment, and negative impacts on health behaviors.12,13 Given the impact of amblyopia on patients, particularly the long-term damage to children’s academic performance, psychological development, and quality of life,14,15 it is essential to assess the quality and reliability of amblyopia-related videos.
This study evaluated the content, quality, and reliability of amblyopia-related videos on TikTok and Bilibili platforms, aiming to analyze the current status and shortcomings of amblyopia-related videos, and provide preliminary insights for optimizing health information dissemination strategies.
Materials and methods
Data sources and search strategy
This study employed a cross-sectional design to evaluate the quality and reliability of amblyopia-related short videos on two major Chinese platforms, Bilibili (https://www.bilibili.com) and TikTok (https://www.douyin.com). On October 1, 2025, the Chinese keyword “弱视” (amblyopia) was used as the primary search term. To minimize bias from personalized recommendation algorithms, all searches were conducted using newly created accounts in incognito mode to ensure that historical data or personalization did not influence the results. At the initial retrieval stage, the top 150 videos from the default search results on each platform were collected.
Inclusion and exclusion criteria
Inclusion criteria: Videos directly related to amblyopia, including information on etiology, clinical features, diagnosis, treatment, or prevention. Exclusion criteria: 1) Duplicate content or re-uploaded videos; 2) Videos unrelated to amblyopia or those that merely mention the condition without providing substantial information; 3) Advertisements or promotional material; 4) Private or inaccessible videos.
Data extraction and classification
For each video meeting the inclusion criteria, the following variables were systematically extracted: platform source (Bilibili or TikTok), upload time, video duration (in seconds), likes, comments, shares, and collections. Video content was categorized into etiology, symptoms, diagnosis, treatment, and prevention. The etiology of amblyopia refers to the common causes, such as refractive errors, strabismus, and congenital conditions. 16 The symptoms involve the main clinical features, including visual impairment, differences in vision between the eyes, and strabismus. 17 Diagnosis includes the techniques used to detect amblyopia, such as visual acuity tests and cover tests. 18 Treatment focuses on the methods available to manage amblyopia, including corrective lenses, patch therapy, and vision training. 19 Prevention emphasizes early screening and intervention strategies to identify and address amblyopia in children, aiming to prevent its progression. 1 Uploaders were grouped as follows: specialist group (ophthalmologists and related medical professionals), non-specialist healthcare practitioner group (healthcare providers from other disciplines), and individual user group.
Video quality assessment
The Global Quality Score (GQS) quality criteria.
The modified DISCERN (mDISCERN) quality criteria.
Statistical analysis
Descriptive statistics were used to summarize basic video characteristics. Non-normally distributed data are presented as median and interquartile range (IQR), and categorical variables as frequencies and percentages. Two group comparisons were conducted using the Mann–Whitney U test, and comparisons among three or more groups were performed with the Kruskal–Wallis H test. Spearman’s rank correlation coefficient was used to assess associations between video quality scores (GQS and mDISCERN) and engagement metrics (likes, comments, shares, and collections). A two-sided
Result
Video characteristics
After applying the exclusion criteria, a total of 199 videos meeting the inclusion criteria were analyzed, including 100 from Bilibili and 99 from TikTok. The detailed selection process is shown in Figure 1, and the video characteristics are summarized in Table 3. The median engagement metrics were as follows: likes 91.00 (IQR: 16.50–557.50), comments 19.00 (IQR: 3.00–184.00), collections 86.00 (IQR: 14.00–344.00), and shares 37.00 (IQR: 4.25–204.75). The median video duration was 134.00 seconds (IQR: 67.00–307.50). Regarding video quality, the median GQS score was 3.00 (IQR: 2.00–3.00), and the median mDISCERN score was also 3.00 (IQR: 2.00–3.00). Inter-rater consistency was excellent, with Cohen’s Kappa values of 0.88 for GQS and 0.89 for modified DISCERN scores. The flow chart of this study. General characteristics, quality, and reliability of the videos.
Video content
Treatment was the most frequently discussed topic, appearing in 176.00 videos (88.44%), followed by clinical features in 127.00 videos (63.82%). Etiology, diagnosis, and prevention were discussed in 79.00 (39.70%), 84.00 (42.21%), and 81.00 (40.70%) videos, respectively (Table 3).
Cross-platform comparison
General information, quality, and reliability scores of amblyopia videos on TikTok and Bilibili.

Information about amblyopia-related video content from TikTok and Bilibili.

(a) Comparison of GQS scores between TikTok and Bilibili platforms. (b) Comparison of mDISCERN scores between TikTok and Bilibili platforms.
Comparison among different uploaders
Figure 4(a) shows the overall distribution of uploader categories. Specialists accounted for 34.67%, individual users for 43.72%, and non-specialists for 21.61%. Figure 4(b) compared the proportions of different uploader types on TikTok and Bilibili. Individual users made up the largest proportion of uploaders, followed by specialists, with non-specialists representing the smallest group. Both TikTok and Bilibili exhibited similar distribution patterns. By analyzing the relationship between video data and uploader category, we found no statistically significant differences in engagement metrics (likes, comments, collections, and shares) among videos uploaded by specialists, non-specialists, and individual users (Table 5). However, when analyzing video quality, we found that videos uploaded by specialists had significantly higher quality, with a median GQS score of 4.00 (IQR: 3.00–4.00) and a median mDISCERN score of 3.00 (IQR:3.00–4.00). Figure 5(a) illustrates the distribution of GQS scores across different uploader types. Specialists scored significantly higher than other uploader groups in both GQS ( (a) Uploaders on amblyopia-related videos from TikTok and Bilibili. (b) Distribution of uploaders for amblyopia-related videos on TikTok and Bilibili. Characteristics, quality, and reliability of amblyopia videos by different uploaders on TikTok and Bilibili. (a) Violin plots illustrating the distribution of GQS scores among different uploader groups. (b) Violin plots illustrating the distribution of mDISCERN scores among different uploader groups.

Correlation analysis between video features and quality
Figure 6 presents the correlation analysis between video characteristics and quality metrics. A strong positive correlation was found between GQS and mDISCERN scores ( Correlation analysis between video characteristics and quality metrics.
Discussion
Systemic gaps in content coverage
Analysis of amblyopia-related video content revealed a clear imbalance in topic coverage. Treatment was the most frequently discussed theme, whereas coverage of etiology, diagnosis, and prevention was notably insufficient. Most videos provided only brief introductions to rehabilitation methods, lacking in-depth explanations of pathogenesis, standardized diagnostic criteria, and long-term outcomes. This imbalance may result from creators’ tendency to focus on content that is practical and easy to present, while giving less attention to more abstract or complex topics such as etiology, diagnosis, and prevention. Furthermore, the limited duration of short videos may contribute to selective presentation of content. 22 Such systematic information gaps may pose serious public health risks: insufficient knowledge of etiology can delay early treatment; inadequate understanding of diagnostic criteria may result in misinterpretation of screening results, missing optimal intervention opportunities; and lack of prognostic information can reduce treatment adherence and rehabilitation effectiveness, potentially causing strabismus, visual impairment, or loss of stereopsis.23–25 Previous studies have shown that the completeness of health information is positively associated with public disease awareness and treatment adherence. 26 Based on these findings, platforms are encouraged to establish systematic content review standards and promote the production of educational videos that comprehensively cover the entire disease process, including prevention, etiology, diagnosis, treatment, and rehabilitation, to ensure the thorough dissemination of essential medical information. In addition, short video creators should prioritize diagnostic and preventive content to improve the scientific accuracy and completeness of their videos.
Platform characteristics and content quality
This study identified significant differences in the dissemination of amblyopia-related content between Bilibili and TikTok. Bilibili predominantly features long-form videos, which are conducive to systematic explanations, yet exhibit relatively low levels of user interaction. Conversely, TikTok’s shorter, more concise, and fast-paced videos elicit substantially higher user engagement across metrics such as likes, comments, favorites, and shares. Notably, both platforms demonstrate comparable levels of scientific reliability and overall content quality, generally at a moderate level. 27 These differences appear to stem primarily from variations in platform content mechanisms, user demographics, and modes of content presentation. To enhance the effectiveness of amblyopia-related science communication, it is recommended that platforms optimize content formats, for instance, by incorporating chapter segmentation or interactive elements in long-form videos and providing appropriate scientific substantiation in short form videos. 28 Furthermore, content creators should be encouraged to translate complex concepts into accessible and comprehensible formats while maintaining scientific rigor, thereby leveraging the high interactivity characteristic of short videos without compromising informational completeness.
Deterministic influence of uploader professional background
Analysis of uploader composition revealed a similar distribution pattern of amblyopia-related video creators across both platforms, with individual users accounting for the largest proportion (43.72%), followed by medical specialists (34.67%) and non-specialists (21.61%). Quality assessment demonstrated significant differences in content quality among creators with varying professional backgrounds. Videos produced by specialists achieved the highest median GQS (4.00) and mDISCERN (3.00) scores, underscoring the decisive influence of professional qualifications on content reliability. This finding aligns with previous studies on diseases such as monkeypox 29 and chronic obstructive pulmonary disease, 30 which reported that healthcare professionals adhering to evidence-based principles are more likely to produce scientifically standardized and credible content. The disparity in content quality may primarily stem from non-professional creators’ limited medical knowledge and their tendency to simplify or sensationalize information to attract viewer engagement, thereby compromising scientific accuracy. 31 Nevertheless, even specialist-generated videos exhibited only moderate overall quality (median GQS and mDISCERN = 3.00), indicating substantial room for improvement in professional health communication. Therefore, it is recommended that platforms strengthen medical content review and professional verification systems, implement algorithmic mechanisms to promote high quality evidence-based materials, and encourage broader participation of healthcare professionals in short video production. Meanwhile, viewers should be guided to prioritize medically verified sources and remain cautious when interpreting information from non-professional creators. 32
Analysis of the correlation between video metrics and scientific content
Notably, correlation analysis revealed strong positive relationships among the engagement metrics, indicating that videos performing well in one dimension of interaction tend to perform well across other dimensions, which is consistent with previous research.27,33 However, high levels of engagement do not necessarily correspond to increased scientific rigor or content quality. This may be attributed to users’ preference for videos that are entertaining or easily understandable rather than those with the highest scientific accuracy. 32 Therefore, it is recommended that platforms incorporate professional content quality assessment mechanisms into their recommendation algorithms and establish medical content review and traffic support systems to promote the dissemination of high quality science communication. At the same time, users should be fully aware that high engagement does not equate to high quality or highly reliable medical information. 34
Although the videos included in this study were designed for the general public and primarily educational, some medical terminology was inevitably used. 35 Such terms are relatively understandable to professionals with an ophthalmic background but may pose comprehension challenges for patients without medical training, particularly for concepts such as “refractive error” and “amblyopia”. 36 To enhance public accessibility and educational effectiveness, we strongly recommend that video creators use plain language and provide clear explanations when introducing medical terms, thereby helping patients accurately understand and utilize the health information.37–39
This study systematically evaluated amblyopia-related short videos on Bilibili and TikTok, revealing that platform characteristics, uploaders professionalism, and content focus significantly affect the quality of health information. Videos produced by medical professionals demonstrated higher reliability, whereas content uploaded by non-professionals lacked scientific rigor,34,40 although overall quality still has room for improvement. 41 Platform differences were evident, with Bilibili videos being longer and TikTok videos exhibiting higher user engagement,33,42 yet overall content quality was comparable. Content analysis identified systematic gaps, including insufficient coverage of etiology, diagnosis, and prevention,43,44 while treatment-related information accounted for the largest proportion. These findings provide empirical support for optimizing health information dissemination strategies.
This study has several limitations. First, this study focused solely on domestic platforms, which limits the external generalizability of the findings. Additionally, most of the videos are in Chinese, which may hinder understanding for non-Chinese-speaking readers. Second, although validated scoring scales were used, the evaluation process was inherently subjective, as the observers were not blinded to the uploader’s identity, which may have introduced bias. Third, although validated general quality scoring systems were used, these tools were not specifically designed for amblyopia-related content, which represents a limitation. The study also relied on a fixed set of search terms, which may restrict the comprehensiveness of retrieved videos and limit broader applicability. Future studies should include more diverse and commonly used search terms, such as lay expressions equivalent to “lazy eye” in different languages, and may benefit from developing more disease-specific scoring criteria. Furthermore, the ophthalmic background of the assessors may have introduced potential bias into the evaluation. Future studies could employ dedicated evaluation tools that explicitly assess plain-language use, avoidance of excessive medical jargon, and public understandability, thereby better ensuring health information is accessible and comprehensible to the general population. Finally, as a cross-sectional study, it does not capture the dynamic changes in video quality and reliability over time.
Conclusion
This cross-sectional study evaluated 199 amblyopia-related short videos on Bilibili and TikTok using the mDISCERN tool and GQS. Treatment was the most frequently discussed topic, while diagnosis and prevention were inadequately addressed. The overall quality of videos on both platforms was moderate, with Bilibili favoring long-form videos and TikTok demonstrating higher user engagement. Videos uploaded by specialists showed the highest quality and reliability. Video duration had a weak positive correlation with quality scores, while no significant association was found between engagement metrics and quality scores. These findings highlight the need for platforms to establish professional review mechanisms, optimize recommendation algorithms, and encourage comprehensive, evidence-based content creation. Additionally, users should prioritize information provided by specialists to obtain higher quality amblyopia-related online health information.
Footnotes
Acknowledgements
The authors express their gratitude to the people who participated in the study.
Ethical considerations
All data were sourced from publicly available TikTok videos and did not include any personally identifiable information or involve human participants. Accordingly, in line with national and international ethical guidelines for digital health research, this study was exempt from institutional ethics review.
Consent to participate
Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.
Author contributions
YZ conceptualized and designed the study. Data collection was performed by NZ and LJ. YZ and NZ conducted the data analysis. Manuscript drafting was carried out by YZ, with critical revisions by LN. LN supervised the study and managed the project. All authors have read and approved the final manuscript.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Nursing Association Scientific Research Project of Wenzhou (WHZD202504). Science and Technology Project of Wenzhou (Self-financed Funds) (RC20250118).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
The data supporting the findings of this study are available from the corresponding author upon reasonable request.
