Abstract
Importance
Clinicians face great challenges in diagnosing dizziness/vertigo disease due to its subjectivity. Currently, there is an absence of machine learning model that could make full use of the information gained from both medical history and physical signs.
Objective
To develop and validate a machine learning model based on medical history and physical signs for dizziness/vertigo disease diagnosis, relieving the burden of diagnosis for clinicians.
Design
A retrospective cohort study.
Setting
Tertiary referral center.
Participants
This study included 1003 patients conformed to the inclusion criteria at the neuro-otologists’ clinics.
Exposures
Thirty-one medical history items, and 9 bedside examination signs recorded by routinely performing a detailed ocular motor examination using video goggles.
Main Outcome Measures
The accuracy, precision, recall, F1 scores, and Matthews’ correlation coefficient of disease diagnosis.
Results
On the collected dataset of 16 categories of dizziness/vertigo diseases, the proposed model achieved an accuracy of 98.11% and an F1 score of 95.43%. The model demonstrated its optimal robustness when tested with datasets containing added noise. Additionally, an analysis of the correlation between medical history and signs was conducted, along with several case studies.
Conclusions
A machine learning-based model was proposed for the diagnosis of dizziness/vertigo, which effectively combined patients’ medical history and signs. In terms of diagnostic accuracy, it outperforms models that rely solely on either medical history or signs for diagnosis.
Relevance
The proposed method can effectively combine the patient’s medical history and physical sign information to make the diagnosis of dizziness/vertigo disease, which has the potential to relieve the burden of diagnosis for clinicians to a certain extent.
Key Messages
Previous works remain constrained to isolated analysis of either medical history data or physical signs, lacking integration of both information.
A machine learning-based model is developed to diagnose dizziness/vertigo diseases by analyzing the information from both medical history and physical signs.
In comparison to utilizing either information alone, the combined use of medical history and physical signs demonstrates a substantial improvement in diagnostic performance.
Introduction
According to large population-based studies, the prevalence of dizziness/vertigo among adults varies from 15% to over 20% and rises along with age. 1 Numerous conditions can underlie dizziness/vertigo. A thorough history combined with classic signs make bedside diagnosis possible in most patients with dizziness/vertigo. 2 Informative history that helps differentiate confusing diagnosis, should include frequency, duration, and triggers of the attacks, previous medical history, and accompanying symptom. Whereas, typical signs may point directly to a specific diagnosis, such as spontaneous, gaze-evoked, head-shaking-evoked, positional-testing evoked nystagmus, and decreased vestibulo-ocular reflex (VOR). Both medical history and body signs are crucial for physicians to make the correct diagnosis, overlooking or over-reliance on either one can easily compromise the accurate assessment. 3
Specialists for dizziness/vertigo can reach a preliminary diagnosis of the disease based on detailed medical history and accurate bedside examination results. However, untrained emergency or general practitioners may have a hard time making a diagnosis. In the absence of reliable history or correctly measured physical signs, even experienced doctors may have difficulty making accurate diagnosis. Artificial intelligence algorithms trained on datasets can minimize the effects of missing data in terms of inaccurate medical history or sign judgments; thereby better assisting physicians who are not specialized in dizziness/vertigo with clinical decision-making. When applying machine learning model for dizziness/vertigo diagnosis, most of the methods proposed in the current related literature are limited to either information extracted from the medical history, or data on physical signs obtained from examinations.4 -6 However, to the best of our knowledge, few works have attempted to make full use of the information gained from both medical history and physical signs. In addition, the current models perform relatively poorly in differentiating the dizziness/vertigo diseases with precision, which rarely includes diagnosis other than the most common 6 to 9 subtypes, such as vestibular migraine (VM), Menière’s disease (MD), benign paroxysmal positional vertigo (BPPV), and vestibular paroxysmia (VP).7 -9
The aim of this study is to develop and validate a machine learning model based on medical history and physical signs for dizziness/vertigo disease diagnosis, serving as an auxiliary diagnosis tool for clinicians. Specifically, to understand the correlation between medical history and signs, canonical correlation analysis (CCA) was adopted to seek canonical variables, which also establish a shared representation subspace for them at the same time. Inspired by the good performance of ensemble learning, an ensemble prediction model was proposed to collaboratively exploit the information from the original medical history and signs, as well as the potential commonality information between them.
Materials and Methods
Dataset
To develop data-driven machine learning model for dizziness/vertigo disease diagnosis, the dataset used throughout this study was collected from a tertiary referral center. Specifically, in compliance with the medical ethic requirement and privacy regulations, we collected a total of 1003 cases at the neuro-otologists’ clinics. For each case, the medical history and bedside examination signs were recorded, by routinely performing a detailed ocular motor examination using video goggles. In addition, diagnosis of these cases falls into 16 categories, as shown in Table 1, including BPPV, MD, VM, VP, persistent postural perceptual vertigo, hemodynamic orthostatic dizziness/vertigo, severe sudden sensorineural hearing loss with vertigo (excluded from central original). For medical history, 31 typical attributes are extracted to characterize the patients’ baseline health condition. Meanwhile, for physical examination, the nystagmus tests are performed to measure the signs, where 9 associate attributes were extracted. The detailed information about the feature attributes and distribution of demographic characteristics is described in Supplementary Table S1, S2, and Figure S1.
Subtypes of Dizziness/Vertigo Diseases in Dataset.
Model Overview
As shown in Figure 1a, we develop a machine-learning based model to diagnose dizziness/vertigo diseases by analyzing the information from both medical history and physical signs of patients. Overall, the model mainly consists of 3 parts as follows:

(a) Overall framework of data-driven prediction and diagnosis of dizziness and vertigo diseases. (b) Schematic representation of feature importance based on CED. CED, case-level explainable paramedical diagnosis model.
Embedding for categorical data is to obtain the numerical type embeddings of the medical history and signs composed of categorical attributions.
CCA attempts to seek canonical variables for correlation measurement between them, which also form a shared representation.
Ensemble prediction is adopted to promote the prediction performance by collaboratively exploiting the information from the original medical history and signs, as well as the potential commonality information between them.
Preliminaries
Let’s first give some notations for use in the following description. Suppose that there are
CED-Based Embedding for Medical History and Signs
For the given medical history
In our previous work, we have proposed a decision tree based case-level explainable paramedical diagnosis model (CED).
10
To implement backtracking of the decision path, the bi-side mutual information is exploited in CED to obtain the contribution of each feature to the final decision. By aggregating feature contributions across
where
In fact, the effects of the embeddings
CCA for Medical History and Signs
As a medical consensus, it is deeply accepted in dizziness/vertigo disease diagnose by clinical doctor that the medical history and signs information from a patient are not independent of each other but take on some kind of correlation to a certain extent; in other word, they generally play a complementary role to each other in making clinical decision. To measure their correlation, the CCA is adopted. 11
For multivariate data analysis, CCA aims to seek for multiple pairs of projection vectors that can maximize the correlation between the projections of 2 set of multivariables. Specifically, given the embeddings
where
Given the canonical correlation subspace supported by
Ensemble Prediction by Voting
As shown in Figure 1a, an ensemble scheme with soft weighting based on gradient boosting decision tree (GBDT) is designed to exploit the information of the original medical history and signs, as well as the potential commonality information between them. Specifically, for a patient
where
Results
Experiment Setting and Evaluation Metrics
To validate the effectiveness of our proposed model for dizziness/vertigo disease diagnosis, the collected dataset as described in dataset section is divided randomly into training and testing sets in a ratio of 8:2. Meanwhile, the following metrics including accuracy, precision, recall, F1, and Matthews correlation coefficient are adopted for performance evaluation.
Main Results for Dizziness/Vertigo Disease Diagnosis
To evaluate the performances of machine learning models for dizziness/vertigo disease, some classical methods are used as baselines, including logistic regression, multilayer perceptron, categorical boosting, and GBDT. In addition, to make the performance comparison more fair, the random division of the dataset as mentioned above is carried out 5 times, and the average results are reported.
In Table 2, we use “mh,” “ps,” and “mh + ps” to indicate that the medical history, physical signs, and their concatenation, respectively, are used for disease diagnosis. As we can see, the medical history information is more useful for disease diagnosis than physical signs. When taking GBDT as classifier, 15.62% improvement from medical history over physical signs in accuracy is obtained. Meanwhile, in comparison to utilizing either information alone, the combined use of medical history and physical signs demonstrates a substantial improvement in performance across all baseline models. It has shown that our method has achieved the best result in Precision of 96.25%, closely rivaling the best baseline GBDT in other performance metrics, showing the proposed method can effectively integrate medical history and physical signs for diagnosis of dizziness and vertigo diseases.
Results for Dizziness/Vertigo Disease Diagnosis by Using Medical History, Physical Signs, and Their Combination, Respectively. Best and Second-Best Results Are Highlighted in Bold and Italics, Respectively.
Abbreviations: CatBoost, categorical boosting; GBDT, gradient boosting decision tree; LR, logistic regression; MCC, Matthews correlation coefficient; mh, medical history; MLP, multilayer perceptron; ps, physical signs.
To demonstrate the model’s capability in distinguishing rare types, Figure 2 shows the overall confusion matrix of prediction results from 5 repetitive experiments. Through careful observation of the matrix, it can be seen that our model achieves good classification results for rare disease types as well, such as hemodynamic orthostatic dizziness/vertigo (Index 6) and Hunt syndrome (Index 8). This indicates that our model is not only applicable to common vertigo/dizziness diseases but also capable of accurately identifying and classifying rarer types.

The overall confusion matrix of prediction results.
Robustness of Our Model
For a good machine learning model, it should not only show advantages in overall prediction performance but also have good robustness. In the clinical examination of signs, due to its subjective nature, there will inevitably be incorrect descriptions of certain attributes, which can especially happen to some inexperienced clinicians. To simulate the above situation that may occur in clinical examination of patients, some sort of perturbation to the observed data is carried out.
Compared to medical history, it is more likely to appear as some imprecise judgments in the examination of nystagmus signs. For this reason, we only consider adding some noise to the examination of physical signs. Specifically, we implemented random perturbations on all attributes in the examination results of the nystagmus signs. For each physical sign data of all patients in the test dataset, we randomly replaced the values with one of the other values of the same attribute with a probability of P to simulate the effect of data noise.
To observe the robustness of the decision model under different intensities of interference, we set P = .1 and P = .2, respectively. As we can see from Table 3, the performances of the baseline methods have decreased significantly, whereas our model maintains a notable level of stability and achieves the best result. In real-world scenarios, the inherent limitations of medical conditions and facilities make it inevitable to encounter misjudgments in interpreting patient signs. Our method can effectively reduce the impact of such misjudgment on diagnosis and improve medical convenience and accuracy.
Results for Dizziness/Vertigo Disease Diagnosis After Adding Noise to the Examination of Physical Signs. Best and Second-Best Results Are Highlighted in Bold and Italics, Respectively.
Abbreviations: CatBoost, categorical boosting; GBDT, gradient boosting decision tree; LR, logistic regression; MCC, Matthews correlation coefficient; MLP, multi-layer perceptron.
Correlation Analysis Between Medical History and Signs
In order to provide doctors with more information worthy of reference, it is necessary to quantitatively evaluate the correlation between them. As mentioned above, CCA can not only be used to reduce the dimensions of medical history and signs to obtain the shared canonical related variables but also be used to calculate and quantify the correlation of medical history and signs. Concretely, we compute the cosine similarity between the embeddings of medical history and physical signs for each sample and obtain the correlation between the medical history and signs by averaging samples within each disease.
As shown in Figure 3, the correlation between medical history and physical signs for various diseases are consistently positive and exhibit an overall high level. One-sample t-tests (null hypothesis: the correlation in the population is 0) are conducted to evaluate the similarity between the embeddings of medical history and physical signs for each disease. Across all disease cohorts, it can be observed that P < .001. This suggests a strong association between medical history and physical signs within the same disease, facilitating the integration of both feature types for accurate patient diagnosis.

Correlation analysis between medical history and signs of different subtypes.
Discussion
More than 30 years ago, researchers started to launch studies on the application of artificial intelligence in the field of dizziness/vertigo diagnosis. Since 2019, owing to the pandemic of the COVID-19, artificial intelligence techniques have been increasingly applied to establish diagnosis of dizziness/vertigo in both the emergency room and the remote setting. This change effectively reduced the offline visits, and therefore, significantly lowered the risk of exposure and infection.12 -16 In 2019, Choi predicted that remote consultation based on artificial intelligence might be the future practice mode that enabled real-time diagnosis of emergency dizziness and dizziness. 17
Machine learning technology has been popularly used in medical record analysis18 -20 and disease diagnosis21 -24 throughout the past years. By discovering the underlying features and patterns, machine learning models have shown a great potential for medical-aided diagnosis. As far as dizziness/vertigo disease is concerned, machine learning can be employed for automated detection of nystagmus patterns.25 -27 Furthermore, both the etiology and treatment options can be determined based on historical data and patients’ clinical manifestations.28,29 Physicians can, in turn, refer to these prediction results to enhance the accuracy of diagnosis and to optimize the treatment options.
In clinical practice, physicians should integrate information obtained from the medical history with physical signs to establish the diagnosis of dizziness/vertigo. Knowledge of the frequency and duration of dizziness attacks, accompanying symptoms during attacks, patient’s age, precedent attacks of dizziness, past medical history of blood pressure, blood glucose, and lipid-related diseases, past medical history of headache, and hearing loss are all crucial to the diagnosis. Therefore, we constructed a dataset to include the 31 selected attributes concerning the medical history.
All patients underwent a battery of bedside examinations, among which 9 signs were selected as attributes for our model, based on high diagnostic sensitivity and specificity for different vestibular lesions. Specifically, spontaneous nystagmus, gaze-evoked nystagmus, head-shaking nystagmus (HSN), and head-shaking tilt suppression test distinguish central lesions from the peripherals 30 ; head-impulse test; and dynamic visual acuity are typical signs of VOR abnormalities. Furthermore, nystagmus characterizes certain position-related disorders (such as benign paroxysmal positional vertigo (BPPV) and central paroxysmal positional vertigo (CPPV)), including the direction, latency, duration, and attenuation mode of nystagmus. Abnormal HSN results suggested potential vestibular dysfunction, which has the most diagnostic value in distinguishing 2 paroxysmal vestibular lesions with similar clinical symptoms, for example, MD and VM. 31 Finally, a total of 1003 patients at the neuro-otologists’ clinics was collected.
In this study, we developed a machine learning model based on medical history and physical signs for dizziness/vertigo disease diagnosis, serving as an auxiliary diagnosis tool for clinicians. We conducted experiments using medical history, physical signs, and their combination, respectively. The results showed that diagnosis based on medical history achieved higher accuracy than that based on physical signs alone, while the combination of both achieved superior diagnostic performance compared to using either information source independently. Furthermore, we attempted to add noise to physical signs to simulate the distraction from establishing precise diagnosis in clinical scenario. Despite the background noise, as shown in Table 3, the diagnostic accuracy of our model was as high as 95.12% and 90.25% with random perturbation probability P = .1 and P = .2, respectively. In contrast, baseline methods exhibited marked performance degradation, struggling to adapt to potential physical sign misjudgments in real-world scenarios. It means the proposed model can help those general practitioners or emergency physicians, who are not specialists in dizziness, reach preliminary diagnosis.
As discussed above, diagnostic accuracy ought to be doubted when relying solely on medical history or physical signs in specific clinical scenarios. In contrast, our model is competent to comprehensively integrate both medical history with physical signs, thereby reaching precise diagnosis of the disease. We further demonstrate this through case studies, with specifics detailed in Table 4.
Case 1. A patient diagnosed with central positional vertigo. Due to the metastatic tumor involving his cerebellum, this patient suffered clinical symptoms resembling closely to that of BPPV. It is hard to avoid misdiagnosis if medical history was the only evidence to be considered. Signs including consistent spontaneous downbeat nystagmus, directional changing gaze-evoked nystagmus, and atypical direction of nystagmus evoked by positional tests all pointed to the central origin.
Case 2. The clinical manifestations of different subtypes of BPPV are the same, characterized by spinning sensation provoked by changes in head position. However, for different subtypes of BPPV there are different liberatory maneuvers. In case 2, diagnosis of posterior BPPV can only be confirmed via direction, latency, and duration of the patient’s nystagmus, which allowed for the administration of appropriate treatment.
Case 3. For this case, it indicates the importance of medical history to reaching an accurate diagnosis. MD and VM, the 2 most common diseases of episodic vestibular syndrome, might share similar physical signs, such as horizontal HSN. However, accompanying symptoms might offer insights into differential diagnosis, for example, ear muffling with fluctuating hearing loss suggests MD, while either headache or the complication of photophobia and phonophobia indicates VM.
Although the proposed method has achieved promising results in dizziness/vertigo disease diagnosis, there are several limitations in this study. First, its retrospective, single-center design may inherently restrict the model’s generalizability. In the future, we plan to establish multicenter collaborations to conduct external validation of our model in a larger cohort. Moreover, our current work lacks integration with domain knowledge in medicine. In future work, we intend to construct a specialized medical knowledge graph, leveraging structured medical expertise to enhance the representations of medical history and physical signs. More broadly, medical artificial intelligence faces some ethical challenges. First, regarding privacy and security, model training relies on large-scale datasets that contain sensitive patient information and health records. Unauthorized disclosure or misuse of such data could compromise individual privacy and pose significant risks. Second, regarding the reliability of the results, although the model achieves high accuracy, the decision-making process by which it generates its final outputs remains opaque and unintuitive, which may engender patient concern and distrust. Additionally, biases embedded in data collection or algorithmic design may lead to unfair outcomes for specific populations, potentially leading to erroneous decisions and even undermining social justice and stability.
Case Studies of Employing Medical History and Physical Signs for Predicting Disease Diagnosis.
Abbreviations: BPPV, benign paroxysmal positional vertigo; GBDT-mh, gradient boosting decision tree solely based on medical history; GBDT-ps, gradient boosting decision tree solely based on physical signs.
Conclusion
In this article, we propose a machine learning model to realize accurate diagnosis of dizziness/vertigo disease, thereby providing auxiliary diagnosis. The proposed model collaboratively exploits the information from both the original medical history and physical signs, as well as the potential commonality information between them. The experimental results indicated that our model achieved excellent performance and also demonstrated certain robustness. It means that the developed model can relieve the burden of diagnosis for clinicians to a certain extent.
Supplemental Material
sj-docx-1-ohn-10.1177_19160216251375034 – Supplemental material for Machine Learning-Based Dizziness/Vertigo Disease Diagnosis by Combining Medical History and Signs
Supplemental material, sj-docx-1-ohn-10.1177_19160216251375034 for Machine Learning-Based Dizziness/Vertigo Disease Diagnosis by Combining Medical History and Signs by Yiwen Zhao, Xumeng Tian, Haiyan Wu, Muhao Xu, Ruizhe Yang, Jinlin Xiao and Zhenfeng Zhu in Journal of Otolaryngology - Head & Neck Surgery
Footnotes
Author Contributions
Y.Z. contributed to software, investigation, methodology, writing—original draft. X.T. contributed to investigation, methodology, visualization, writing—original draft. H.W. contributed to conceptualization, data curation, project administration, writing—review and editing. M.X. contributed to formal analysis, validation. R.Y. contributed to writing—review and editing. J.X. contributed to software, visualization. Z.Z. contributed to supervision, formal analysis, writing—review and editing.
Data Availability Statement
The dataset used during the current study are available from the corresponding author on reasonable request.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was in part by National High Level Hospital Clinical Research Funding (2022-PUMCH-C-041), Beijing Natural Science Foundation (7222313), and Non-profit Central Research Institute Fund of Chinese Academy of Medical Sciences (2023-RW320-03).
Ethical Considerations
This study was exempted from the Institutional Review Board review by the Medical Ethics Committee of Peking Union Medical College Hospital due to the retrospective design of the study (No I-22PJ211).
Consent for Publication
Not applicable.
Supplemental Material
Additional supporting information is available in the online version of the article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
