Abstract
Perioperative teaching and feedback of technical performance are essential during surgical training but are limited by competing demands on faculty time, resident work-hour restrictions, and desire for efficient operating room utilization. The increasing use of high-definition video microscopy and endoscopy in otolaryngology offers opportunities for trainees and faculty to evaluate performance outside the operating room but still requires faculty time. Our hypothesis is that automated motion tracking via video analysis offers a way forward to provide more consistent and objective feedback for surgical trainees. In this study, otolaryngology trainees at various levels were recorded performing a cortical mastoidectomy on cadaveric temporal bones using standard surgical instrumentation and high-definition video cameras coupled to an operating microscope. Videos were postprocessed to automatically track the tip of otologic dissection instruments. Data were analyzed for key metrics potentially applicable to the global rating scale used in the Accreditation Council for Graduate Medical Education’s Objective Structured Assessments of Technical Skills.
Evaluation of surgical resident performance has traditionally relied on informal and subjective perioperative feedback from attending surgeons. 1 Unfortunately, this method of assessment often occurs sporadically, especially in the context of increasing demands on faculty time and work-hour restrictions.2-5 There has been a greater focus on surgical simulation, retrospective video analyses, and other methods to provide feedback in a less resource-intensive setting.4,6,7 Motion tracking technology has shown promise and has been successfully used to follow surgical stroke motion and predict technical expertise.8-10 However, studies to date have required tagged instruments, custom simulators, or high-speed cameras.
Modern otolaryngology, with its widespread use of high-definition (HD) cameras integrated with endoscopes and microscopes, is well positioned to take advantage of video-based motion tracking. Otologic surgery, especially cortical mastoidectomy, represents an ideal use case, given a largely stable field of view and extensive use of a single instrument (otologic drill). The aim of this pilot study was to assess the utility of commonly used motion tracking software to assess surgical skills with standard dissection instruments and HD cameras coupled to a binocular microscope during cortical mastoidectomy.
Materials and Methods
Subjects were recorded performing cortical mastoidectomies on cadaveric fixed temporal bone specimens with standard HD cameras (Karl Storz, Tuttlingen, Germany) coupled to binocular microscopes (Olympus Corp, Tokyo, Japan) in the otolaryngology surgical training laboratory. Subjects included those at the novice level (medical students), midlevel (postgraduate year 2/3 residents), and advanced level (postgraduate year 5, neurotology fellow). In 3 cases, trainees were recorded performing a simple cortical mastoidectomy intraoperatively under the direct supervision of a senior attending neurotologist with a comparable HD camera integrated to the operative microscope (OH4; Leica, Wetzlar, Germany). All videos were recorded under 30 Hz and 720 p. Representative segments 20 to 30 seconds long were then selected where the otologic drill was active and remained within the field of view of the recording camera. Videos were postprocessed with ProAnalyst Motion Tracking Software (Xcitex, Woburn, Massachusetts) to automatically track the tip of the drill across the surgical field. User intervention was provided when it was clear that the tracking deviated from the tip of the drill for >3 frames. Tracking data were then exported into Matlab (2016a; MathWorks, Natick, Massachusetts) and analyzed for key metrics. Linear regression and analysis of variance with these statistics (cadaveric only) were performed with Microsoft Excel. This study was deemed exempt by our Massachusetts Eye and Ear Institutional Review Board (1040003-1, 1045566-1).
Results
Seven subjects performed mastoidectomies in cadaveric specimens with the motion tracking software, and 18 discrete video segments were analyzed ( Figure 1 , Video 1 ). Three additional trainees were recorded performing segments of mastoidectomies intraoperatively. The motion tracking adequately followed the tip of the otologic drill as it moved across the surgical field without any user intervention in the majority of frames (and in the case of the higher-quality intraoperative videos in >99.7% of frames).

Automated tracking of standard otologic instruments. The figure shows the automated tracking of a selected instrument (an otologic drill) as it moves across the surgical field of a cortical mastoidectomy performed in a cadaveric specimen. The green box represents the feature selection (tip of the drill), and red dots represent the location of the drill in each sequential frame.
The tracking data were then analyzed for key metrics potentially applicable to the global rating scale used in the Accreditation Council for Graduate Medical Education’s Objective Structured Assessments of Technical Skills (OSATS). Specifically, we aimed to use metrics that could help define the “time and motion” or “instrument handling” criteria of the OSATS ( Table 1 ). The metrics chosen included speed (mean ± SD: 12.1 ± 9.1 pixels/s), acceleration (11.1 ± 10.3 pixels/s2), total distance (7436.2 ± 4863.5 pixels), and total stoppage time (7.0 ± 4.9 seconds; Table 2 ). Total distance traveled, speed, and acceleration trended with level of training, although this did not achieve significance.
Comparison of Relevant Measures Obtained from Motion Tracking and the Global Rating Scale Used in the OSATS.
Abbreviation: OSATS, Objective Structured Assessments of Technical Skills.
Adapted from original description of OSATS global rating scale from Martin et al. 12
Mean Values from Metrics Extracted from Motion Tracking Software. a
Values are presented as mean ± SD.
Discussion
This pilot study explores the utility of using video-based motion tracking software to track instruments in otologic surgery post hoc and quantify operative skills of surgical trainees. We found that in the specific case of a cortical mastoidectomy, using standard dissection instruments and HD cameras coupled to an operative microscope was sufficient to enable automated tracking of an otologic drill as it moved across the surgical field. This is a significant finding, as using recording equipment that is analogous to what is already commonly found in otolaryngology operating theaters removes a crucial barrier to wider adoption of motion tracking technology in surgical training.6,8-10
It is important to note that this motion tracking was not fully automatic, requiring users to select their instrument of interest and presegment videos into 20- to 30-second clips, in which the drill burr remained within the field of view of the microscope. Moreover, especially fast movements of the instrument occasionally required the user to reorient the tracking cursor. These limitations may be overcome in the near future by incorporating artificial intelligence models, which have already shown promise in surgical instrument detection in simulated procedures. 11
We also attempted to extract key metrics from the tracking data potentially applicable to the “time and motion” or “instrument handling” sections of the OSATS scale for surgical trainees. 12 We based these in part on simplified versions of the metrics used by McGoldrick et al 8 in their study of motion tracking in microvascular surgery. Total distance traveled, speed, and acceleration trended with level of training, but this should be interpreted with caution given the relatively small size of our initial pilot cohort (eg, a single outlier resulted in midlevel trainees having higher speed/acceleration than advanced-level participants). Future work will focus on expanding our study to a larger well-controlled cohort and careful validation and contextualization of these measures by expert surgeons.
Author Contributions
Disclosures
Footnotes
Sponsorships or competing interests that may be relevant to content are disclosed at the end of this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
