Modeling variation of clinical team processes with multiple sequence alignment

Abstract

Our objective was to model process variation of Emergency Medical Service teams responding to simulated pediatric emergencies and determine if sequence alignment distinguishes performance quality. We performed a retrospective process analysis by watching and coding activities in videos from standardized simulations of 42 Emergency Medical Service teams. Teams were classified into high- or low-performing groups based on the Clinical Teamwork Scale™. Activities were coded according to resuscitation tasks, performer, and times. We used ClustalG to align task sequences within and between groups, and measured similarity. Teams within and between performance levels had an average sequence similarity of 52 ± 7% and 50 ± 7%. Teams performed clinically appropriate tasks that varied in prioritization, for example, performing compressions or connecting the EKG monitor early. There was no statistical difference in gross similarity between groups but specific differences in prioritization may have had clinically meaningful implications. Alignment could improve by accounting for task duration and concurrency.

Keywords

Clinical practice patterns/guidelines/resource use/evidence-based practice observational data/quasi-experiments ambulatory/outpatient care quality of care/patient safety (measurement)

Introduction

Effective teamwork is considered important for managing and preventing adverse events that could harm patients (Manser, 2009; Schmutz and Manser, 2013). An important prerequisite for improving teamwork is being able to define and measure it. Teamwork is a multifaceted concept that includes technical performance and non-technical behaviors (Salas et al., 2005; Xyrichis and Ream, 2008). Technical performance refers to the physical task work that is performed to achieve a shared goal, such as cardiopulmonary resuscitation/chest compressions (CPR) and ventilation in responding to cardiac arrest. Non-technical behaviors include social interactions, such as leadership, closed-loop communication, performance monitoring, and orientation. The standard for evaluating teamwork has been to rate a combination of technical and non-technical behaviors with respect to performance (Dietz et al., 2014; Mishra et al., 2009; Undre et al., 2007). This approach allows experts to assign numerical values to observed behaviors for comparison. The drawback, however, is that it does not provide insight into how teams’ interactive processes lead to particular outcomes (Hickey, 2012).

Sequence alignment (SA) has been recommended as a method for studying team dynamics (Herndon and Lewis, 2015). SA is a visual analytical technique that lines up matching symbols across ordered lists to assess regions of similarity. It has predominantly been used in the biological sciences to measure the similarity between nucleic acid and amino acid sequences, identify functional conservation, and infer evolutionary relationships between species (Edgar, 2004; Felsenstein, 1988; Notredame et al., 2000; Sankoff and Kruskal, 1983; Thompson et al., 1994). Researchers in the social sciences have adapted this technique to describe similarity of activity performed by individuals, such as the sequence of chores performed in daily routine (Wilson et al., 1999; Wongchavalidkul and Piantanakulchai, 2015), order of locations visited by tourists (Shoval and Isaacson, 2007), and tasks in business processes (Bose and van der Aalst, 2010). SA could be applied in a similar manner to explore patterns of team activity that are associated with performance.

The goal of this study was to explore the suitability of using SA to model process variation and distinguish performance levels in EMS teams responding to simulations of pediatric trauma patients. Ambulance and Fire & Rescue teams are trained to follow highly structured, practice-based protocols (Ralston, 2006). While these protocols provide guidance on treating patients with different signs and symptoms, they leave task management to the discretion of the EMS providers. The motivation for using SA is that it would provide an objective framework for describing similarities and dissimilarities in activity sequences between teams for a given situation.

Materials

We analyzed 42 videos of EMS providers responding to a pediatric simulation. Teams had members from public fire departments and private transport agencies in a major metropolitan city. They reflected typical response teams, where there is a non-tiered response of Advanced Life Support from both public and private agencies. Fire crews had 3–4 members and transport crews had 2 members on average. These professionals had training that ranged from Emergency Medical Technicians (EMT) to paramedics, and all teams had at least one paramedic.

Simulations were conducted in situ using high-fidelity patient simulators, scene design, and professional actors playing parents and bystanders. All crews responded to the same simulated clinical case, in which a 6-month-old was unconscious and unresponsive after “falling” from a couch. Vital signs were set to indicate elevated intracranial pressure (ICP), consistent with “shaken baby syndrome.”

Clinical experts rated performance using the Clinical Teamwork Scale (CTS^™). The CTS is a validated instrument that evaluates team skills along five dimensions: communication, situational awareness, decision-making, role responsibility, and patient friendliness (Guise et al., 2008). Clinical experts who use this instrument do not explicitly evaluate task sequence; they rate team behaviors along the five dimensions and assign an overall score, ranging from 0 (unacceptable) to 10 (perfect). Team behaviors are distinct from, but may contribute to, process variation.

Methods

We used ClustalG (Wilson et al., 1999) to align multiple activity sequences between groups of EMS teams. Ten (10) teams were labeled as high-performing (CTS ⩾8) and another 10 as low performing (CTS ⩽4). These groupings were established to evaluate if alignment techniques could distinguish teams based on technical performance. Activity sequences were created by coding patient-centric tasks that reflected EMS teamwork.

Coding teamwork

SA involves lining up matching symbols across ordered lists. In our analysis of simulation profiles, symbols correspond to resuscitation tasks performed and the lists contained the set of tasks performed by EMS teams. The coding framework, in Table 1 is based on tasks from pediatric resuscitation guidelines (Ralston, 2006) and feedback from clinical experts.

Table 1.

Patient-centric tasks.

Task	Code
First arrival	Fst
Second arrival	Snd
Expose	Exp
Check physical status	Phy
Check mental status	Men
Check breathing	Bre
Check pulse	Pul
Measure length	Len
Attach pulse oximeter	Pox
Attach end-tidal CO2 monitor	Ent
Attach EKG (pads or leads)	Ekg
Attach blood pressure cuff	Bpc
Maintain cervical spine	Cer
Intubate	Int
Ventilate with bag valve mask	Bvm
Cardiopulmonary resuscitation	Cpr
Establish an IO/IV line	Iov
Administer drugs	Dru
Transport the patient	Tra

One researcher coded tasks, performers, and times. A second researcher independently reviewed a sample to coding reliability. Agreement was defined as two codes specifying the same task, performer, and times (±2 seconds). The Jaccard coefficient was used to measure raw agreement as the intersection of agreed codes divided by the union of codes. Physicians board certified in pediatric emergency medicine provided the gold standard of expected tasks and times (mm: ss): check responsiveness at 0:00, check breathing at 0:10, check pulse at 0:15, start ventilation at 0:40, attach monitors at 1:30, and obtain intraosseous or intravenous (IO/IV) access at 2:30.

SA

The alignment, in Figure 1, illustrates how care processes can vary and be similar between teams. The matching symbols for checking pulse and breaths (Pul-Pul and Bre-Bre) suggest that teams perform these tasks in the same order. The mismatching symbols performing CPR or ventilation with bag valve mask (BVM) [Cpr-Bvm] represent a substitution and suggest that the teams performed different but comparable tasks. The symbol for establishing an intraosseous (IO) or intravenous (IV) route (Iov) is matched to a gap. This alignment is an indel, or insertion/deletion. It suggests that either team 2 inserted an Iov task or team 1 deleted an Iov task in their respective process.

Figure 1.

The process of aligning EMS tasks from observation.

We used ClustalG to perform multiple alignments on the activity sequences. Multiple alignments occur in three steps: (1) compute distance matrix for similarity between all sequences pairs, (2) construct guide tree from the distance matrix, a hierarchical data structure that groups sequences by similarity, and (3) progressively align sequences according to the guide tree. Wilson et al. (2005) created ClustalG to align generic activity sequences in the social sciences. It works on user-defined symbols in addition to symbols that represent nucleic and amino acids. In the literature, guide trees have been used to classify types of tourists (Shoval and Isaacson, 2007) and the multiple alignments to highlight conserved and missing activities (Bose and van der Aalst, 2010).

We analyzed the guide tree clusters to see if they could classify different care delivery strategies. This involved visually inspecting the guide trees and using internal validation metrics from clValid (Brock et al., 2011) to identify teams with similar patterns of activity. After aligning the sequences within and between performance levels, we extracted and compared conserved tasks, or tasks that are aligned across ⩾50% of the teams. These tasks describe the general protocol teams followed and differences based on levels of teamwork.

Results

Coding teamwork

Inter-observer raw coding agreement was 66% (62/94 codes) across a sample of simulations. Of the 32 disagreements, 11 corresponded to minor timing differences that had no effect on the order of tasks or subsequent alignment. Of the 32 disagreements, 5 occurred due to differences in granularity, primarily for CPR compressions and BVM ventilations. For example, a paramedic may stabilize the cervical spine between BVM ventilations. This activity could be interpreted as one contiguous application of BVM or two applications of BVM, interrupted. In all, 16 disagreements occurred due to insertions and deletions. Some tasks, such as pulse and breath checks, were difficult to detect because they were occluded from view or occurred implicitly through visual observation. Considering only those disagreements that could affect the sequence alignment, the raw agreement is 78% (73/94 codes). We considered this an acceptable level of agreement for subsequent analysis.

The 42 simulations had a median of 29 ± 3 activities, with a minimum of 16 and a maximum of 51. Low-performing teams had a median of 28 ± 5 activities and the high-performing teams had a median of 27 ± 3 activities. The activities were not normally distributed and there was no significant difference between performance levels according to the Mann–Whitney U test.

Table 2 shows the frequency of activities in all teams, and the low-performing and high-performing groups. Teams performed the same tasks with the same frequency, with slight differences. Low-performing teams made more attempts, attaching the end-tidal CO2 monitor and applying CPR. High-performing teams exposed the child for physical assessment and maintained cervical spine more frequently. These differences were within one deviation of the other across performance groups. In other words, performance could not be attributed to task frequencies alone.

Table 2.

The median ± median average deviation of task frequencies for all teams, low-performing (LP) teams, and high-performing (HP) teams.

Activity	Freq all teams	Freq LP teams	Freq HP teams
First arrival	1 ± 0	1 ± 0	1 ± 0
Second arrival	1 ± 0	1 ± 0	1 ± 0
Expose	1 ± 0	1 ± 0	2 ± 1
Check physical status	1 ± 0	1 ± 0	1 ± 0
Check mental status	1 ± 0	1 ± 0	1 ± 0
Check breathing	3 ± 1	3 ± 1	3 ± 1
Check pulse	4 ± 1	3 ± 1	3 ± 1
Measure length	1 ± 0	1 ± 0	1 ± 0
Attach pulse oximeter	1 ± 0	1 ± 0	1 ± 0
Attach end-tidal CO2 monitor	1 ± 0	2 ± 0	1 ± 0
Attach EKG (pads or leads)	1 ± 0	1 ± 0	1 ± 0
Attach blood pressure cuff	1 ± 0	1 ± 0	1 ± 0
Maintain cervical spine	1 ± 0	1 ± 0	2 ± 1
Intubate	1 ± 0	1 ± 0	1 ± 0
Ventilate with bag valve mask (BVM)	4 ± 1	4 ± 1	4 ± 2
Perform cardiopulmonary resuscitation (CPR)	4 ± 2	4 ± 2	2 ± 0
Establish an IO/IV line	1 ± 0	1 ± 0	1 ± 0
Administer drugs	1 ± 0	1 ± 0	1 ± 0
Transport the patient	2 ± 1	1 ± 0	1 ± 0

Note. Bold values highlight potential variances in care according by performance level.

SA

Task sequences were highly varied. Intragroup similarity for 10 high- and 10 low-performing teams were 52 ± 7%. Intergroup similarity scores was 50 ± 7%. There were no meaningful clusters within performance level according to the internal validation metrics: connectivity, Dunn Index, and silhouette widths. This means that the teams were not performing a distinguishable set of strategies in treating the simulated pediatric patient. These results suggest that team processes varied regardless of performance and that major differences could be attributed to a few key tasks.

Figure 2 presents a three-part excerpt of the multiple alignments between low- and high-performing teams. Panel 1 shows the raw alignment between task sequences. Panel 2 shows that many tasks, highlighted in orange, varied across teams. Panel 3 shows that some tasks, highlighted in green, are aligned across ⩾50% of the teams. These aligned tasks appear to have a conserved order and represent an underlying protocol that teams try to follow. We used these conserved tasks as the basis for comparing processes in the high- and low-performing teams.

Figure 2.

A three-panel excerpt showing (left) the raw alignment, (middle) varied tasks, and (right) conserved tasks across low- and high-performing teams.

Figure 3 describes tasks conserved in ⩾50% of teams, times they were performed and occurrence with respect to the gold standard. Low-performing teams set up the EKG monitor, exposed the patient, and checked breathing earlier than high-performing teams. On average, low-performing teams started CPR before BVM, with BVM starting much later in some cases. These tasks occurred in opposite order according to the recommended pediatric care guidelines. High-performing teams applied BVM before CPR, established an IO/IV line, and transported the patient, once stabilized. Half of the low-performing teams, in contrast, transported the patient early in the simulation. They had more conserved tasks and most tasks were performed earlier than the low-performing teams.

Figure 3.

Simplified version of multiple alignment showing conserved activities and average time at which they occurred.

Discussion

We explored the suitability of using alignment techniques to distinguish the levels of performance between EMS teams responding to pediatric trauma simulations. We did so by (1) observing and coding the treatment they provide, (2) mapping the codes onto a sequence, and (3) aligning the sequences across teams. The teams perform approximately the same number of tasks and teamwork scores were not associated with the observed process variation. Gross differences were not apparent, but SA helped identify the prioritization of particular tasks that were clinically meaningful.

For example, chest compressions are emphasized in resuscitation for adults, but ventilation is given priority over compressions in children. The multiple alignments highlighted tasks that appeared to be conserved across teams in terms of task sequence order. Both high- and low-performing teams checked pulse and breathing before performing cycles of CPR and BVM ventilation. However, high-performing teams performed BVM earlier, perhaps due to correctly recognizing and treating the underlying cause of increased ICP due to trauma or understanding that ventilation is the first response to pediatric arrests. The low-performing teams focused on other tasks such as setting up the EKG monitor for preliminary assessment and delayed BVM ventilation. This suggests that the low-performing teams may have been following adult guidelines instead of pediatric guidelines.

We identified several limitations in using SA to operationalize teamwork and to associate it with performance. First, we observed substantial process variation, which added noise to the alignments. This could be attributed to the degrees-of-freedom by which teams can observe symptoms, diagnose problems, and treat underlying causes. For example, any combination of team members can check pulse, breathing, physical status, mental status, or attach equipment to track vitals. Once an anomalous reading is observed, they can then choose to treat, medicate, or transport. These systems of activity have a certain tolerance for variation in behavior, but can be sensitive to isolated events. For example, pulse checks occur frequently and dominate the alignment, but delayed ventilation can cause the patient to deteriorate and require more care to stabilize. SA was not developed to analyze complex in which tasks can be performed concurrently over time.

A second limitation is that not all aspects of teamwork are encoded in the task sequence. We identified pairs of teams that performed a similar sequence of tasks but had different levels of performance on the CTS. In one case, Teams A and B had a similarity of 71% but CTS of 1 (poor) and 5 (fair), respectively. Team A, rated as having poor teamwork displayed little communication, whispered, and had BVM errors. Team B, rated as having fair teamwork had adequate communication, but administered a 10-fold overdose of epinephrine. In another example, Teams C and D had 53% similarity with CTS scores of 8 (good) and 4 (fair) respectively. Team C started BVM and ETCO2 early, while team D used an oxygen mask instead of BVM and intubated without attaching the pulse oximeter, which is used to monitor oxygen intake. In these cases, indicators of performance depended on communication and task quality, which were not encoded in the task symbols. Future research in comparing team processes could benefit by encoding errors and quality of performance.

In healthcare, there is an underlying theory that compliance with practice-based guidelines limits process variation and risk of patient safety events (Sutton et al., 2014). Compliance can be difficult to evaluate as the diversity of teams allows for work to be carried out in many different ways. SA is a promising method that can be used explicitly and systematically compare clinical processes. However, there are some limitations as the underlying algorithms are not calibrated for human or systems-based activity. More work is needed to adapt it to the analysis of healthcare activities.

Conclusion

SA is a promising tool for describing variation in team behavior. The alignment revealed distinct patterns of activity that could be used to explain the difference in performance levels across teams. We find that it can be used to identify conserved tasks and points of deviation that could explain the difference in performance across teams. Furthermore, it can be used to develop more accurate process models based on actual data. However, SA requires improvements to account for the temporality, concurrency, and quality of team activities. Once refined, the SA method could provide a more objective approach toward studying complex activity in clinical systems.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Institutes of Health (R01 HD062478) and the National Library of Medicine (T15M007088).

ORCID iD

Nathan J Bahr

Author biographies

Nathan J Bahr is a postdoctoral fellow of the Department of Medical Informatics and Clinical Epidemiology at OHSU. His research focus is on adapting informatics tools to understand and improve care in healthcare systems.

S Herzberg is an MD, PhD student at Vanderbilt University School of Medicine. Her research focuses on Medical Informatics and Public Health.

W Lambert is an associate professor of Epidemiology and Environmental Systems & Human Health in the OHSU-PSU School of Public Health where he teaches epidemiologic research methods. His health services research has included the evaluation of intervention programs in clinical settings, communities, and public health agencies.

M Hansen is an assistant professor of the Departments of Emergency Medicine and Pediatrics at Oregon Health & Science University and he is board certification in both emergency medicine and pediatric emergency medicine. His primary research is in improving pediatric outcomes during high-risk clinical scenarios, specifically in the EMS setting.

JJ McNulty is the Director of Operations for the AA. Center for Advancement of Resuscitation Education at Oregon Health & Science University. He designs and conducts emergency simulations for training in high risk situations.

A Cohen is a professor of the Department of Medical Informatics and Clinical Epidemiology at Oregon Health & Science University. His research focuses on designing and applying text and data mining algorithms within the clinical and biomedical informatics problem domains. He has an extensive publication history demonstrating the success of these methods in such diverse areas as systematic literature review, electronic medical record data extraction, cohort discovery, proteomics, and microRNA analysis.

PN Gorman is professor of the Department of Medical Informatics and Clinical Epidemiology at Oregon Health & Science University is board certified in Internal Medicine, is a Fellow of the American College of Physicians, and a Fellow of the American College of Medical Informatics. He presently serves as Thread Director for Health Systems Sciences in the School of Medicine at OHSU, implementing a major new thread concerned with systems thinking and systems based practice in the OHSU YourMD curriculum. He also serves as assistant dean for Rural Medical Education, working with the Campus for Rural Health and others to expand and improve OHSU’s rural medical education programs to help build Oregon’s rural physician workforce.

JM Guise is professor of Obstetrics & Gynecology, Emergency Medicine, and Medical Informatics and Clinical Epidemiology in the School of Medicine at Oregon Health & Science University and in the OHSU-PSU School of Public Health. She is PI of the NIH-funded study (R01HD062478) that conducted the simulations used for this study and is mentor to Dr. Bahr in his AHRQ-funded F32.

References

Bose

RJC

van der Aalst

(2010) Trace alignment in process mining: Opportunities for process diagnostics. BPM 6336: 227–242. Available at: http://link.springer.com/content/pdf/10.1007/978-3-642-15618-2.pdf#page=239

Brock

Pihur

Datta

et al . (2011) clValid, an R package for cluster validation. Journal of Statistical Software. Available at: http://cran.us.r-project.org/web/packages/clValid/vignettes/clValid.pdf

Dietz

Pronovost

Benson

et al . (2014) A systematic review of behavioural marker systems in healthcare: What do we know about their attributes, validity and application? BMJ Quality & Safety, 23(12): 1031–1039.

Edgar

(2004) MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research 32(5): 1792–1797.

Felsenstein

(1988) Phylogenies from molecular sequences: Inference and reliability. Annual Review of Genetics 22(1): 521–565.

Guise

J-M

Deering

Kanki

et al . (2008) Validation of a tool to measure and promote clinical teamwork. Simulation in Healthcare 3(4): 217–223.

Herndon

Lewis

(2015) Applying sequence methods to the study of team temporal dynamics. Organizational Psychology Review 5(4): 318–332.

Hickey

(2012) Evaluating health care teams. In: Hickey

Brosnan

(eds) Evaluation of Health Care Quality in Advanced Practice Nursing. New York: Springer, pp. 177–208.

Manser

(2009) Teamwork and patient safety in dynamic domains of healthcare: A review of the literature. Acta Anaesthesiologica Scandinavica 53(2): 143–151.

10.

Mishra

Catchpole

McCulloch

(2009) The Oxford NOTECHS System: Reliability and validity of a tool for measuring teamwork behaviour in the operating theatre. BMJ Quality & Safety 18(2): 104–108.

11.

Notredame

Higgins

Heringa

(2000) T-coffee: A novel method for fast and accurate multiple sequence alignment11Edited by J. Thornton. Journal of Molecular Biology 302(1): 205–217.

12.

Ralston

(2006) Pediatric Advanced Life Support, vol. 1. Dallas, TX: American Heart Association.

13.

Salas

Sims

Burke

(2005) Is there a “big five” in teamwork? Small Group Research 36(5): 555–599.

14.

Sankoff

Kruskal

(1983) Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison (eds Sankoff

Kruskal

). Reading: Addison-Wesley Publication. Available at: http://adsabs.harvard.edu/abs/1983twse.book…..S

15.

Schmutz

Manser

(2013) Do team processes really have an effect on clinical performance? A systematic literature review. British Journal of Anaesthesia 110(4): 529–544.

16.

Shoval

Isaacson

(2007) Sequence alignment as a method for human activity analysis in space and time. Annals of the Association of American Geographers 97(2): 282–297.

17.

Sutton

French

Niles

et al . (2014) 2010 American Heart Association recommended compression depths during pediatric in-hospital resuscitations are associated with survival. Resuscitation 85(9): 1179–1184.

18.

Thompson

Higgins

Gibson

(1994) CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research 22(22): 4673–4680.

19.

Undre

Sevdalis

Healey

et al . (2007) Observational Teamwork Assessment for Surgery (OTAS): Refinement and application in urological surgery. World Journal of Surgery 31(7): 1373–1381.

20.

Wilson

Harvey

Thompson

(1999) ClustalG: Software for analysis of activities and sequential events. In: IATUR conference proceedings. Available at: https://www.researchgate.net/profile/Andrew_Harvey6/publication/228816916_ClustalG_Software_for_Analysis_of_Activities_and_Sequential_Events/links/02e7e52dd4fec71678000000/ClustalG-Software-for-Analysis-of-Activities-and-Sequential-Events.pdf

21.

Wilson

Harvey

Thompson

(2005) ClustalG: Software for analysis of activities and sequential events. In: Paper presented at the workshop on sequence alignment methods, Halifax, NS, Canada, October. 2005

22.

Wongchavalidkul

Piantanakulchai

(2015) The integration of classification tree and Sequence Alignment Method for exploring groups of population based on daily time use data. Applied Soft Computing 34: 106–119.

23.

Xyrichis

Ream

(2008) Teamwork: A concept analysis. Journal of Advanced Nursing 61(2): 232–241.