Abstract
Background
Telemedicine support virtual consultations and evaluations in hand surgery for patients in remote areas during the COVID-19 era. However, traditional physical examination is challenging in telemedicine and it is inconvenient to manually measure the hand range of motion (ROM) from images or videos. Here, we propose an automatic method using the hand pose estimation technique, aiming to measure the hand ROM from smartphone images.
Methods
Twenty-eight healthy volunteers participated in the study. An eight-hand gestures measurement protocol and the Google MediaPipe Hands were used to analyze images and calculate the ROM automatically. Manual goniometry was also performed according to the guideline of the American Medical Association. The correlation between the automatic and manual methods was analyzed by the intraclass correlation coefficient and Pearson correlation coefficient. The clinical acceptance was testified using Bland-Altman plots.
Results
A total of 32 parameters of each hand were measured by both methods, and 1792 measurement results were compared. The mean difference between automatic and manual methods is −2.21 ± 9.29° in the angle measurement and 0.48 ± 0.48 cm in the distance measurement. The intraclass correlation coefficient of 75% of parameters was higher than 0.75, the Pearson correlation coefficient of 84% of parameters was over 0.6, and 40.6% of parameters reached well-accepted clinical agreements.
Conclusions
The proposed method provides a helpful protocol for automatic hand ROM measurement based on smartphone images and the MediaPipe Hands pose estimation technique. The automatic measurement is acceptable and comparable with existing methods, showing a possible application in the telemedicine examination of hand surgery.
Introduction
The range of motion (ROM) is fundamental in the physical examination and the follow-ups in clinical practice.1,2 For hand surgeons and therapists, measurement of hand ROM is an objective and quantified way to assess the retained or improved motor function of digits, and provide evidence for estimation of the outcome or the progress of surgery or rehabilitation therapy. 3 Traditionally, manual goniometry has been the widely accepted method for ROM measuring in clinical practice, 4 which could be traced back to the early 1900s. 5 However, the dependence on experienced raters, face-to-face measurement, and physical contact brings difficulties, especially in COVID-19 and beyond.
At present, smartphones have become valuable and acceptable tools in telemedicine. 6 It allows patients to interact and consult with experts more easily through images or videos, significantly reducing the travel burden for patients in remote or pandemic lockdown areas. Additionally, in hand and upper extremity surgery, previous studies have reported the utility and benefit of telemedicine for diagnosis and postoperative follow-up.7,8 Patients also show high interest and preferences for virtual visits. 9 However, it has been concerned that virtual consultation only permits primary interaction between clinicians and patients without physical contact. It hence highly restricts the application of physical examination in traditional ways. An available solution is visually estimating the ROM by giving instructions and asking the patient to demonstrate their hand movement. Despite visual assessments showing excellent agreement with conventional measurements for the elbow and wrist joints,10,11 how to measure the complex movement of digit joints via images or videos remains difficult for researchers.12–14 The other solution is photographic goniometry, which manually measures the joint angles from images. This method has been investigated in healthy volunteers and patients with fractures of the phalanges, tendon rupture repair, and Dupuytren disease with good agreements.15–18 However, both methods are still subjective and time-consuming to obtain the data. A more convenient and automatic technique would be valuable in telemedicine examination, particularly for analyzing the function of digits in hand surgery.
With continuous development in computer vision, the hand pose estimation technique offers a contactless solution to predict the articulated joint locations from images or videos for hand motion analysis.19–21 Commonly, hand pose estimation is based on images with depth information or just ordinary RGB images. The former requires depth camera devices, like Microsoft Kinect, 22 Intel RealSense, 23 and Leap Motion, 18 and have been evaluated in home care and telerehabilitation of the upper limb.24–26 However, these devices may not be available for every patient. The other method could extract key points from a single image via general cameras or smartphones. The MediaPipe Hands is a deep learning-based hand-tracking solution with only RGB input, and it can predict 21 key points coordinates of a hand. 27 It is one of the state-of-the-art algorithms trained on some of the largest and most diverse datasets from Google. 28 Like other pose estimation methods, the MediaPipe Hands are popular in gesture recognition and control for human–computer interaction.27–29 However, it remains unknown whether it would be helpful in measuring hand ROM for medical purposes.
We aim to propose an automatic hand ROM measurement method based on MediaPipe Hands and test whether it would achieve an acceptable agreement for telemedicine applications via a simple smartphone device.
Materials and methods
According to the pilot study, we calculated the sample size of 26:26 for an equivalent test (
In this study, we followed the guides for hand impairment evaluation set by Swanson and the American Medical Association (AMA),30–32 which suggest the required ROM parameters. According to the references, the following movement should be included: (1) The extension/flexion of the metacarpophalangeal (MCP) joints, proximal interphalangeal (PIP) joints and distal interphalangeal (DIP) joints of the index, middle, ring and little finger on the sagittal plane, as well as the MCP joint and interphalangeal (IP) joint of the thumb on the coronal plane; (2) The radial-abduction/ radial-adduction of the thumb carpometacarpal (CMC) joint on the coronal plane. (3) The opposition/adduction movement of the thumb. In total, 32 parameters for ROM measurement were required to be recorded. The measurement parameters and methods are shown in Table 1.
The 32 ROM parameters and the measurement of MediaPipe and manual methods.
The key point number of MediaPipe measurement is demonstrated in Figure 2B.
For automatic measurement, we proposed a standard protocol based on eight hand gestures as instructions to guide participants in demonstrating the required ROM movements independently (Figure 1). We used a smartphone (iPhone XS Max, Apple Inc., image resolution: 1980*1080p) to take the images and mimic the setting of telemedicine visits. All images were taken from the palmar view for better key points detection and prediction. For manual measurement, we followed the AMA guideline and measured the ROM with a universal goniometer. 32 Measurement was performed by two hand surgeons in our department. For both methods, the zero position was defined as the neutral position of a fully extended joint. 33 All data were expressed as positive values, except hyperextension was recorded in negative values.

The eight-hand-gesture automatic measurement protocol with examples.
A simple experimental setup was prepared for automatic measurement (Figure 2A). The participant was asked to sit by the test table and place a forearm and wrist on a holding fixture (16.5*21.5*17.5cm) to reduce fatigue and shaking of the upper limb. The transverse crease of the wrist was aligned to the edge of the fixture to standardize the hand's position. The smartphone was installed 40 cm above the fixture, and the palm was kept in the center of the image. The forearm was kept in a supination position, and the palm was parallel to the table. A laptop was set in front of the participant to provide visual instructions of the targeted gesture and display the performing gesture in real-time.

The illustration of the automatic measurement. (A) The setup of the automatic measurement; (B) The 21 key points of the MediaPipe Hands model. The red arrow shows the measuring scale for distance calculation.
Before the experiment, participants were requested to wash and clean their hands. Jewellery, watches, and clothing on the wrist or hand were removed during the investigation. Then, participants were asked to perform the required gestures in order and hold the testing gesture for about 3 seconds. A hand surgeon took three images continuously for each gesture once the participant indicated the gesture had been accomplished thoroughly. A total of 48 images (24 images per hand) of each participant were taken. Breaks were allowed if the participants felt their hands were uncomfortable. The time of image acquisition was recorded. There was a five-minute interval after image acquisition. Then another hand surgeon performed the manual goniometry and recorded the parameters. The time taken for manual measurement was also recorded.
The images were analyzed using a Python-based (Version 3.8) procedure. Briefly, we obtained the 3D coordinates of the 21 key points (Figure 2B) from each image using MediaPipe Hands (Version 0.8.9) and then calculated the parameters using the following equations:

The diagram of modified MCP joint angle calculation. The palm plane is determined by two MCP points and the wrist point. PIP’ is the projection point of PIP on the palm plane. θ represents the modified MCP joint angle.
Measurement results were expressed by the mean and standard deviation (Mean ± SD). The reliability of the automatic methods was analyzed in the following steps: (1) The deviation between the two measurement methods was assessed for each parameter. Then the 95% confidence interval (CI) was calculated and compared with the range of clinical acceptance. The acceptance for angle parameters was defined as ± 5° since it was acceptable in clinical practice.26,35 Acceptance for distance parameters was defined as ± 1 cm based on the impairment algorithm proposed by Swanson. 31 (2) The correlation of the two methods was assessed by intraclass correlation coefficient (ICC) and Pearson correlation coefficient. ICC value explanation was defined as follows: 0–0.20: unacceptable, 0.20–0.40: questionable, 0.41–0.60: good, 0.61–0.80 very good, 0.81–1.00: excellent. In the Pearson correlation coefficient, 1 indicates a total positive linear correlation, 0 means no linear correlation, and −1 shows a total negative linear correlation. (3) The clinical agreement of the two methods was assessed by Bland-Altman plots to visualize the data and compare the limits of agreement (LOA). 36 In this study, a well-acceptable clinical agreement was defined as the LOA was entirely within the range of clinical acceptance. As the key point coordinates were constant for the same image using MediaPipe Hands, the experiment for test–retest was unnecessary. Statistical analysis was performed by IBM SPSS v25 (IBM, Armonk, NY, USA).
Results
In the study, 56 hands of 28 volunteers were measured. The mean age of volunteers was 26.9 ± 2.9 years, the female-to-male ratio was 1:3, and the BMI was 23.2 ± 3.1 (Table 2). All participants were right-handed and completed both measurement methods. The mean completion time for image taking and manual measurement were 1.5 ± 0.4 min and 11.8 ± 1.0 min. All images were successfully detected and analyzed by MediaPipe Hands. There was no missing data for all 1792 (56 × 32) parameters.
Demographics of participants.
The results of the automatic and manual measurements are summarized in Table 3. The mean difference is −2.21 ± 9.29° for all angle parameters and 0.48 ± 0.48 cm for distance parameters, respectively. According to the 95% CI, 81.25% of the parameters had a deviation less than ± 5° or ± 1 cm, and both were considered acceptable. The extensions of all digits, the thumb flexion, and the thumb radial-adduction achieved the best results, with deviations of less than 2°. The maximum difference was found in the flexion of the index MCP joint (mean: −25.01°, 95% CI: −27.485 to −22.531). Similarly, differences were also evident in the flexion of the middle, ring, and little finger MCP joints (mean, 14.33°, 15.23°, 11.64°).
Results of two ROM measurement methods.
*The opposition and adduction are recorded in centimeters, and the rest parameters are recorded in degrees.
ROM: range of motion; SD: standard deviation; CI: confidence interval; _E: extension; _F: flexion; _ab: abduction; _ad: adduction.
In ICC and the Pearson correlation coefficient (Table 4), 75% of parameters (24/32) achieved excellent or very good correlation, except the MCP joint flexion of the four fingers, which was regarded as questionable correlation (ICC: 0.126, 0.157, 0.209, 0.195). 84% of parameters (27/32) resulted in a Pearson correlation coefficient over 0.6, showing a strong positive correlation.
Summary of the correlation between the two measurement methods.
Correlation is significant at the 0.01 level (two-tailed).
Correlation is significant at the 0.05 level (two-tailed).
ICC: intraclass correlation coefficient; _E: extension; _F: flexion; _ab: abduction; _ad: adduction.
The clinical agreement was analyzed using Bland–Altman plots (Figure 4). In general, 40.6% of parameters (13/32) achieved a well-acceptable clinical agreement when comparing the LOA and the clinical acceptance. Some parameters like thumb MCP flexion, index finger PIP extension, middle finger PIP extension, ring finger MCP and PIP extension, and little finger DIP extension nearly reached clinical acceptance. The rest 13 parameters were considered to have failed in the clinical agreement.

Bland–Altman plots for analyzing clinical agreement. There are 56 dots for every parameter. In each plot, the x-axis represents the mean and the y-axis represents the difference between automatic and manual methods. The bold black line represents the mean difference, the dotted lines represent the limit of agreement, and the light red area represents the range of clinical acceptance (±5° or ± 1 cm).
Discussion
Hand ROM measurement is a critical component in the examination of motor function, disease diagnosis, and impairment evaluation. 37 Unfortunately, acquiring the parameters in the telemedicine visit was inconvenient. Our goal was to provide a new method that uses smartphones and images to automatically obtain and calculate hand ROM, trying to alleviate the manual measurement burden and achieve acceptable clinical agreement.
This study provided an eight-gesture-based measurement protocol to guide users in taking the required images with a smartphone. Compared with other related works,26,38,39 the study offered a protocol with the comprehensive movement of the thumb in gestures 2–6 and separate digit flexion movement into gestures 7 and 8. In our study, we noticed a measurement improvement in DIP flexion that difference with the manual measurement was only from −1.75° to 0.08°. In the study of Nizamis et al., 38 all digits were flexed together (like gesture 8), and only MCP flexion of the index finger achieved satisfactory agreement. Pham et al. also used a “fist position” for measuring the flexion of fingers using a Creative Senz3D camera. 39 They found that DIP had a more noticeable difference and variance than the other joints. We believe the improvement was caused by gesture 7, in which the key points of the fingertip, DIP and PIP joints were not severely obstructed and led to a better coordinate prediction of MediaPipe Hands. Therefore, the angle of DIP joints could achieve a more reasonable result. However, the unsatisfied PIP and MCP flexion measurement had not been solved. Our contribution was providing a clinical guideline-based ROM measurement protocol for hand pose estimation and automatic ROM measurement. It contained 32 indispensable parameters for hand functional evaluation, which were also recommended in telemedicine examination. 12
The reliability of automatic methods was assessed by comparing the result with manual measurement. Although the quality of manual measurement relied on the rater's experience, it had been a cheap, simple, and well-accepted method in clinical practice. It was an essential benchmark for clinical acceptance and was more well-known among hand surgeons. In traditional goniometry, Ellis et al. reported an inter-rater variance of 7° to 9° and 0.7 to 0.9 cm between therapists, with an intra-rater variance of 4° to 5° and 0.5 to 0.6 cm for repeated measurement. 40 In manual photographic measurement, Georgeu et al. compared lateral image measurement of the ring or little fingers with standard goniometry in 20 patients. 41 In their study, the average difference was 1° (SD: 6°, 95% Cl: 0° to + 2°), and the 95% LOA was −11° to + 13°, which was beyond the range of clinical acceptance. Meals et al. found that photo-based measurements were reliable within and between observers, but only 27.3% (3/11) of photo-based measurements were in acceptable agreement (r≥0.70) with manual goniometry. 42 We found our results comparable to these measurements. In our study, even if we set a ± 5° as the level of clinical acceptance, 40.6% of parameters had strictly reached the clinical agreement, and another 18.8% of parameters had almost achieved the agreement. We have also noticed that the key point coordinates remain unchanged using MediaPipe Hands for the same image, which means our method has perfect repeatability. It also indicated that the automatic method could assist less experienced examiners in the challenging visual measurement of the hand, promote intra-rater reliability and facilitate the process of tele-examination.
There has been an increasing interest in hand pose estimation techniques for medical research and application. For example, Leap Motion has been used in hand rehabilitation assessments with 3D skeletal data from RGB-D images.43,44 Wang et al. developed a Leap Motion-based virtual reality system to improve the motor function of subacute stroke patients. 45 Butt et al. testified an automatic classification of Parkinson's disease by assessing the movement of the hand and forearm. 46 Martini et al. used OpenPose software to evaluate the Hand-To-Nose test for stroke patients. 21 Their works focused more attention on the kinematics feature of the upper limb, and the implementation was complicated. In the research field of automatic hand ROM measurement, the clinical agreement was unfavorable in related studies.38,47–49 The reason might be the lack of large datasets for hand pose estimation compared with face or body, 20 and the occlusion of the hand is still a challenging task for manual annotation. 50 Moreover, hand datasets for medical applications or clinical settings did not exist. Further development in the 3D annotation of the hand, more clinical datasets, and more robust algorithms would hopefully offer better key points prediction and improve the automatic ROM parameters calculation.
The proposed method has several advantages: It can reduce the workload for clinicians, and the measurement time was approximately reduced to one-sixth of traditional manual goniometry. Meanwhile, it requires no measuring tools such as tape, caliper, or goniometer, even no necessity of experienced raters. It is simple enough that all images were taken successfully in a short time without supervision. Our method has probably provided a promising solution as a constant, objective, and convenient measurement tool in telemedicine. It can evaluate whether a joint was extended or flexed, even if the digit flexion measurement has not reached clinical acceptance. By qualitatively analyzing the abnormal movement of the particular joint(s), our method is helpful for the primary evaluation or screening of diseases or injuries, such as the radial, ulnar, or median nerve injury,51–53 tendon rupture, or severity classification of stroke and Parkinson disease.
Our study has several limitations. Firstly, the new method was only testified in healthy and young volunteers who had no obvious ROM limitation. Similarly, whether MediaPipe Hands is sensitive enough to predict key points with limited ROM is uncertain to answer. Secondly, the images were taken in a relatively standardized environment rather than by the subjects themselves at home or other places. Further research should include more complex application environments. Also, the comparatively small number of participants might reduce the robustness and generalizability of the proposed method.
Conclusion
The measurement of hand range of motion is challenging in telemedicine examination. This study demonstrates a new thought for automatic measurement using RGB images with MediaPipe Hand pose estimation via a smartphone. A comprehensive and clinical-based protocol with eight hand gestures was provided to bridge the traditional manual and automatic measurements. Our method obtained good clinical agreements and is suitable for other hand pose estimation studies. It indicates excellent potential for telemedicine application with the benefits of convenience, acceptable reliability, and simple smartphone implementation.
Ethics approval and informed consent
Ethical approval to report this case was obtained from the institutional review board of The First Affiliated Hospital of Sun Yat-sen University (ID: [2021]387).
Verbal informed consent was obtained from the patient(s) for their anonymized information to be published in this article.
Footnotes
Acknowledgments
The authors thank Mr Bai Leng, Mr Zhenguo Lao, Ms. Jianwen Zheng, and Guangzhou HuanTek Co., Ltd for technical help, writing assistance, and general support.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Authorship confirmation
We have confirmed that all the authors meet the authorship criteria.
Author biographies
Fanbin Gu is a PhD student in Clinical Medicine. His research interest is artificial intelligence for hand surgery.
Jingyuan Fan is a PhD student in Clinical Medicine. His research interest is artificial intelligence for hand surgery.
Zhaoyang Wang is a PhD student in Clinical Medicine. His research interest is artificial intelligence for hand surgery.
Xiaolin Liu is a professor at the Department of Microsurgery, Orthopaedic Trauma and Hand Surgery, The First Affiliated Hospital, Sun Yat-sen University. His research interest is peripheral nerve repair and biofabrication of nerve substitutes.
Jiantao Yang is an associate professor at the Department of Microsurgery, Orthopaedic Trauma and Hand Surgery, The First Affiliated Hospital, Sun Yat-sen University. His research interest is peripheral nerve repair and biofabrication of nerve substitutes.
Qingtang Zhu is a professor at the Department of Microsurgery, Orthopaedic Trauma and Hand Surgery, The First Affiliated Hospital, Sun Yat-sen University. He majors in reconstructive microsurgery for extremity and is interested in management of severed extremity injury, peripheral nerve repair, biofabrication of nerve substitutes, and artificial intelligence for telemedicine.
