Abstract
This study applies risk management strategies from commercial aviation to better understand when it is safe to send unarmed responders to 911/999 calls. By analyzing three years of community-generated call data from Seattle, Washington (727,423 calls across 356 types), this research assesses risk, or the “credible worst-case scenario” for each call type. Findings reveal nearly half (48.9%) of all call types could be safely addressed by diversified, non-police responders rather than the traditional “all-hazards” police approach. Observations that the all-hazards responder is often necessary to identify such opportunities, or what we term “Ratcliffe's Paradox,” are confirmed. This study indicates a systematic approach to risk management could enable such a diversified responses, potentially freeing up to 26% of police resources currently allocated to non-police calls. A Risk Managed Demand (RMD) could improve public safety outcomes, reduce the collateral harms of armed responses, and operationalize Alternative First Responder programs resulting in an optimal allocation of resources.
Introduction
The “demand for order in a civil society” (A. Silver, 1967) requires maintenance. Silver describes a common experience for anyone who has lived in a city over the last “two centuries” (now two and a half), as well as an “extensive and relevant journalistic and historical literature concerning reactions to urban crime, violence and riot” as de facto demand. The occurrence of crime clearly demands order, but Silver additionally underscores the threat posed to the “center” by “mobs and riots” formed of “dangerous classes” on the periphery. Naturally the result of income, wealth, and opportunity inequality in an evolving society, these political expressions necessitate the formation, application, and maintenance of a “bureaucratic police.” That society expresses support and upholds expectations of service for formal social control (i.e., police) is tacit endorsement of the demand for order (“the demand”).
The tension between safety and apparently desirable qualities for living (e.g., fairness, equity, social justice, etc.) complicates “the response.” 1 The murder of George Floyd, typical of reactions under the “Second Great Awakening” (Sherman, 2017), reinvigorated fundamental questions about the utility, the appropriate use and ultimately, the necessity of police. There is anecdotal evidence to suggest alternatives to police could be effective; however, an extensive and growing body of research suggests that implementation may be more complex (Lum et al., 2020, 2022; Neusteter et al., 2019, 2020; Ratcliffe, 2021). Although “opportunity” for a non-police response appears significant, particularly with the aid of hindsight, but what is the “opportunity cost?”
This study demonstrates the use of conventional risk/safety management principles to optimize the opportunity for a specialist response. We leverage an objective framework adapted from commercial aviation to imagine a “diversified response” (defined below) to emergency calls received by government (i.e., 911/999). This paper finds compelling evidence that non-police responders can address the myriad other, ultimately non-police, requests for service, and do so safely. In so doing we establish a framework for operationalized risk management supporting strategic and tactical decision-making.
With dwindling police resources and sustained public pressure to reduce the collateral harms associated with an enforcement-forward approach to public safety, this study finds a “Risk Managed Demand” (RMD) essential to modern policing. Under the proposed model, lessons from the study of industrial accidents (Perrow, 1984), “human factors” (Edwards, 1985; Perezgonzalez, 2009), and common risk and safety management principles from commercial aviation (FAA, 2010; ICAO, 2013; Müller et al., 2014) are translated to determine the safety and efficacy of a diversified response. RQ-1: Is it safe to send a diversified response? When? RQ-2: What mitigations (i.e., “safety policy”) are necessary to a diversified response? RQ-3: What is the prospective impact (i.e., opportunity) of a full-scale diversified response?
We find an evidence-basis indicating the opportunity for a diversified response can be realized safely, if managed properly.
This work suggests that well-formed “safety policy” (as defined below) would facilitate a safe, specialist response to nearly 50% of service requests (“call types”) currently handled by the Seattle Police Department (as all-hazards responders). In its mature state, a diversified emergency response system could return nearly 27% of police capacity, creating significant efficiency 2 as well as improved service. This analysis strongly indicates a systematic approach to risk management, and in so doing evolves our conceptual understanding of policing, with broader implications for public safety policy.
Literature Review
Alternatives to the “all-hazards responder” model (i.e., police) have been a common focus of reform under the Second Great Awakening but have yet to gain widespread acceptance. WACS, Durham HART, etc.) have attempted a more nuanced response to emergency calls for service (911/999). Comprised of “alternative first responders,” these programs seek to divert (before police response) or refer specialist response opportunities (e.g., Mental Health Professional, social worker, non-police “outreach” worker) to satisfy the request more completely.
Although significant strides have been made, most jurisdictions have progressed little beyond the pilot stage. Alternative first responder programs are limited by the same issues as conventional alternatives. Police “co-responder” programs (police and a specialist) and “differential police response” (Mcewen et al., 1986) have been contemplated or in development for nearly fifty years. While some technological gaps are evidence, more critically absent is a framework for risk related decision-making. Without some common agreed rules, standards, and concept of “the good,” progress toward a “diversified response” will continue to stall.
The Problem
Awareness of the efficiency (i.e., use of police for non-police service) is not new. Scholars like Reiss (1971) made note of the opportunity in observational studies conducted in the 1960s. Modern call data analysis confirms, a police responder is not always necessary to successfully and safety resolve a call for service (Amos & Eisenberg, 2023; Gilsinan, 1989; Horspool et al., 2016; Lum et al., 2020; Neusteter et al., 2020; Ratcliffe, 2021; University of Chicago Health Lab, 2022). There are, however, a number of barriers to implementation.
Ratcliffe’s (2021) examination of public health calls in Philadelphia represents the problem. While the opportunity for a specialist response (i.e., Mental Health Provider) to “persons with mental illness” is real and verifiable by observation, the information necessary to discriminate such opportunities, safely, is lacking at the “Public Safety Answering Point” (911 call center). An in-person assessment by an all-hazards responder is necessary to safety affect a specialist response, at present. Although the opportunity for a non-police response is apparent and substantial, paradoxically, it requires a police response to identify it. What we term: “Ratcliffe's Paradox.”
Violent outcomes are rare but exert an outsized influence on the discussion. Our tendency to catastrophize, particularly in an era of deregulated media, amplifies these events. In September of 2020, two employees of a Seattle apartment complex were attacked by a tenant who had been served with eviction notice earlier that day. One of the employees died. According to the Seattle Times, a “mental health caution” was set in the Seattle Police Departments records system (Green, 2020a). In November of 2020, five blocks from the scene of the September attack, a resident of a low-income housing facility attacked and killed a resident case worker, in her office (Green, 2020c).
Evidence of mental illness is present in both cases. The man said he stabbed the case worker on the delusional belief 3 he was going to be evicted (Green, 2020d). Charging documents in the September attack indicate the attacker was evicted, despite a statewide moratorium on evictions, after having threatened to stab a private security guard employed by the property, earlier the same day (Green, 2020b). In both cases, there is evidence to suggest the person was “in crisis.” 4 While it is not immediately clear why police were not involved in attempting to remove the resident involved in the September attack, it is reasonable to assume the ongoing moral panic over police use of force was present in the minds of those involved. 5
The tension between the desire for police alternatives and the need to protect responders is real. The violence demonstrated above could just as easily have befallen a “generalist first responder” (Friedman, 2021, p. 925), with the predictably chilling effect on the adoption of alternatives. Society, thusly, finds itself trapped in a perseverating cycle between catastrophization and moral panic. This describes a kind of analysis paralysis in which it is simultaneously true that most calls for service do not require the police, and it is unsafe to send anyone else. An apt predicament for a jurisdiction which has famously labeled said paralysis, “the Seattle Process” (Bennett & Giloth, 2007).
Our generally impoverished understanding of risk in policing (both conceptual and theoretical) has resulted in the formation and perpetuation of a highly risk averse posture. The current system evolved organically to rely on the professional judgement of “street level bureaucrats” (Prottas, 1978) and “streetcorner politicians” (Muir, 1979) acting as “gatekeepers” (Lum et al., 2020) to the myriad services available from government. The 911 systems in the United States receive approximately 240 million calls annually (National Emergency Number Association, 2021b). Those calls are triaged with a combination of “call takers” and what we term “all-hazards” responders (i.e., police). These professionals gather information, assess need, risk, and apply “minimax” thinking (Muir, 1979) to optimize the delivery of service. This manual process is the current accepted practice. It is demonstrably effective but still carries the inherent risks of an armed police response.
The Opportunity and the Opportunity Cost
Decision-makers are often called to consider what economists refer to as the “opportunity cost” when optimizing investments or forming public policy. Some cost-benefit analyses are simple and objective. Some require a more complex framework to standardize and contextualize the considerations. In this section we review the literature to better understand the opportunities and opportunity costs of a diversified response, as well as gaps in our theoretical and empirical understanding of these concepts in policing.
Loss of efficiency is a substantial opportunity cost incurred under the current model. A recent multi-site survey of call outcomes reveals “Most of the time (between 62% and 83% of calls received) they [police] are likely providing either assistance, advice, or peacekeeping functions” (Lum et al., 2022, p. 15). A similar analysis of calls for service in Seattle estimated slightly more calls were apparently
6
“criminal in nature, approximately 20%, while only about 6% related to felonies.” (National Institute for Criminal Justice Reform - NICJR, 2021, Figure 1). Between 60% and 80% of the time, police are not necessary but are deployed out of an abundance of caution. The all-hazards responder, employing generalist street-level bureaucrats equipped and authorized to defend themselves and other, is inefficient; however, the resulting potential for collateral harms appear to drive the current focus on alternative first responders.
It could be argued that the all-hazards responder model is an effective risk management strategy. Of the 240 million calls received by 911 in the US, approximately 1 in every 244,000, or .00041%7, 8 result in that the Washington Post refers to as “fatal force.” According to statistics maintained by the Washington Post, 80% of those fatal force encounters involve an armed subject, 9 suggesting a large portion of those incidents represent some risk to those involved. Indeed, not all scholars agree the “footprint of police” needs to be shrunk (Thacher, 2022). However, if recent public reaction to incidents involving fatal force is any indication, even this opportunity cost may be more than the public is willing to accept.
In the wake of the murder of George Floyd, several American cities engaged serious attempts to disband, “defund,” or otherwise divest from the all-hazards responder model (i.e., police). In Seattle (WA) and Portland (OR), public demonstrations persisted for more than 100 consecutive nights (Groover, 2020). Those attempts largely failed to appreciably reduce (directly), 10 reform, or reshape police (Cummins, 2023; Lum et al., 2022; Sinclair et al., 2021; Thacher, 2022; Travis, 2021; Vaughn et al., 2022; Weichselbaum & Lewis, 2020), but the effect on the crime rate, public policy, and the urgency to “reimagine” policing appears to reverberate.
While there is much that we do know about the opportunity, and the opportunity cost, we lack a conceptual framework for systematic engagement of these considerations. There is evidence to suggest police perform well as all-hazards responders, serving as street-level bureaucrats and street-corner politicians to effectively optimize outcomes (“minimax strategy” (Muir, 1979, p. 169)) across a diverse set of problem-solving and service functions. The opportunity cost, however, remains too high. There exists a political will to disaggregate the police function (Friedman, 2021), but questions of safety and practicality remain gaps in our understanding.
Current Practice
The systems used to route telephone, text, and email to the call centers, and those systems used to document and coordinate the response are increasingly more sophisticated, integrated, and robust. Yet, public safety still depends on a member of the public observing something “… - that-ought-not-to-be-happening-and-about-which-somebody-had-better-do-something-now” (Bittner, 1970, p. 29), and a street-level bureaucrat interpreting descriptions such as “they is clowning tough” (Gilsinan, 1989), to affect the “just right” response. This model, capable of safely handling millions of calls for service across a disparate and fractured system, is nevertheless inefficient and vulnerable to the rare but catastrophe, or “normal accident” (Perrow, 1984).
In 1938, 11 the first “emergency number” (999) in London, UK (The Communications Museum Trust, n.d). A similar service (911) was launched in the United States in 1968 (National Emergency Number Association, 2021a) 12 . While technical systems (e.g., Computer Aided Dispatch), standardization (e.g., policies, procedures, guides) and training have established a robust service, these systems still rely heavily on a professional to “extract, interpret, and classify” information (Gillooly, 2020; Lum et al., 2020, 2022; Neusteter et al., 2019, 2020; University of Chicago Health Lab, 2022). Call handling is still largely dependent on professional judgement and is thus limited by the perspective of the observer.
As Gillooly (2020) observes, call takers identify the type of service being requested and establish priority but after a rough risk assessment and when triage is complete, an all-hazards responder is still apparently necessary to fine tune the response (i.e., Ratcliffe’s Paradox). Even the mature diversified response competencies (Albuquerque Public Safety, Durham HEART, Denver STAR) rely on a system of manual triage (e.g., field responders monitoring 911, a senior responder in the call center), 13 and referrals from other service providers (e.g., police, fire, social services, etc.). The call triage system in Seattle functions similarly.
Early identification of safety opportunities for a diversified response (call triage) is key and powerful when well governed. An analysis of three years of Seattle call data finds more than 42,000 combinations of how the call was received, how it was classified initially, prioritized, classified finally and “cleared” (disposition) (Atherley, 2024). This analysis considers a single dimension, final call type, and the more than 350 distinct ways to classify a request, after service is complete. A systematic cost-benefit analysis enables strategic development of a diversified response to optimize the benefit. Other industries form strategy in a similar process referred to as “risk management.”
Risk Management
A similar low-frequency-high-impact risk profile is managed by other industries. These systems can be found in a broad spectrum of public and private industrial applications from food safety (Panghal et al., 2018) to nuclear power (Ylönen & Björkman, 2023), the national airspace system and wildland firefighting (Black & Baldauf McBride, 2013). They are as multi-disciplinary as the complex industries they are designed to manage, often reflecting engineering, systems theory, and psychology. This analysis utilizes principles from the highly successful commercial aviation implementation.
According to a 2013 article assessing “problems, challenges, and opportunities” in aviation safety, 2011 was the safest year on record for commercial aviation, with just one out of every 7.1 million passengers suffering a fatality (Oster et al., 2013). Although demonstrably safer than all other forms of transportation, by far, aviation maintains a very low tolerance for mishaps. Much the same is true for policing. A proactive approach to accident prevention has been essential to achieving the current state of airline safety (Oster et al., 2013, p. 3). A 2018 publication by the Federal Aviation Administration credits the maturity of safety systems with a 95% reduction in already low fatalities during the preceding two decades (FAA, 2018). Today, so called “Safety Management Systems” exist to implement “defenses or preventive controls to lower the severity 14 and/or likelihood 15 of a hazard’s projected consequence” (ICAO, 2013, p. xii).
Fundamental to success of these systems is the concept of the “credible worst-case scenario.” In a system processing a high volume of activity, even rare probabilities may eventually be realized. These rare events represent real individual risks but are not typical of the system; however, the relative frequency of these events provides important context. From this credible (likely) worse-case scenario, an evidence-basis is formed to accept, mitigate, or reject the “hazard” 16 . Rather than fixate on the rare catastrophic failure, to the detriment of an objective good, safety management systems emphasize minimax thinking to optimize opportunity.
The current state of safety in policing is like that of commercial aviation in the late 1970s and 1980s. For much of the 20th Century the safety focus was on equipment (FAA, 2021; Müller et al., 2014), or “Hardware” in the human factors SHEL-L model (“Software, Hardware, Environment and Liveware-Liveware interactions”) (Edwards, 1985). It was not until failure was all but engineered out of the hardware that a focus on human factors and organization was embraced (FAA, 2018). A similarly myopic focus is embodied in application of “streetcorner politicians” to “minimize the maximum risk [minimax]” (Muir, 1979, p. 40) as they deliver government service. Empowered to manage physical risk as well as less tangible outcomes (e.g., safety, trust, legitimacy), police discretion and decision-making has long been a trusted bulwark against the permutable harms inherent in the service delivery role they occupy. Now during a crisis of confidence fundamental to the Second Great Awakening, an approach similar to what was applied in commercial aviation nearly a half-century ago, is necessary to affect safety and legitimacy, systematically.
Commercial Aviation: A Model Process
Commercial aviation shares a familiar risk profile with policing. In both cases failure is (1) statistically rare, (2) catastrophic and generally regarded as (3) the result of a cascading series of failures in what Perrow called “complex and coupled systems” (Perrow, 1984). The international consensus on the best practice management of these systems prescribes four-pillars: safety policy, risk management, safety assurance, and safety promotion (FAA, 2010; ICAO, 2013). This paper focuses on two of the three, “risk management” (2010, Section 6.b) and “safety policy” (2010, Section 6.a), critical practices for the development and operation of a diversified response.
We focus on the standard methodology for assessing and controlling 17 risk (i.e., safety risk management). Investigation, analysis, auditing, and promotion of safety are all worthy of their own study, far beyond the scope of this paper. Together, these processes serve to classify and eliminate or mitigate (lesson) any given hazard 18 . These are complementary strategic or “design” processes representing the tolerance(s) for acceptable risk or loss, given a cost-benefit analysis, sometimes referred to as a “value proposition.” This synergy is essential to evidence-based decision-making.
A comprehensive overview, Figure 1 (below) depicts guidance from the International Civil Aviation Organization and US Federal Aviation Administration describing the “Design” and “Performance” of safety. This diagram outlines a system where design decisions are informed of the best available information (see “Design” left of Figure 1, outlined in green), and feedback is used from surveillance of actual operations (see “Performance” right of Figure 1, outlined in blue) to inform continuously improving safety practices, feedback (solid lines between Design and Performance).
Throughout the workflow (Design and Performance), a continuous evolution of data collection, analysis and action (see Figure 1, “swim lanes” 19 depicted far right, red arrow) forms the safety management process.
How is Risk Assessed?
Mature Safety management systems pursue a comprehensive awareness of all hazards, mitigated to an acceptable level (i.e., all credible worst-case scenarios are acceptable), and actively managed to improve continuously (FAA, 2010; ICAO, 2013). Successful risk management, however, does not require the absence of risk. Opportunities come at a cost. It is the substantive significance of the risk assessment, what is described in The cult of statistical significance (McCloskey & Ziliak, 2010) as “oomph” guiding the decision to accept.
Credibility is important to regulators, who decide whether the flying public will be safe. It is equally important to public safety policy makers, for many of the same reasons. For these purposes, risk is defined as the “The composite of predicted severity (how bad) and likelihood (how probable) of the potential effect of a hazard in its worst credible (reasonable or believable) system state.” (FAA, 2010, p. 24) As mentioned previously, given the volume and diversity of services requested of police, anything and everything CAN happen. The credibility of that worst-case scenario is the key difference for policy makers.
Sample Severity Scale (ICAO, 2013, pp. 2–29) Severity Levels With Definitional Criteria (“Meaning”) and Their Value to be Plotted on the Matrix
Scores range from “Negligible” to “Catastrophic,” and can include as many steps as can be discriminated by the evidence.
Sample Likelihood Scale (ICAO, 2013, pp. 2–28) Likelihood Defined (“Meaning”) and Their Value to be Plotted on the Matrix
Sample Risk Matrix (ICAO, 2013, pp. 2–29) Combined Severity and Likelihood plots From Definitions in Tables 1 and 2
The decision to accept risk is contextual and relative. A commercial air carrier may elect to suspend service where any phase of the operation includes any probability (i.e., likelihood) of the loss of an airframe (i.e., severity). 22 In contrast, military operations under the same conditions (loss of an airframe) are acceptable. The eventual loss of an airframe (crew and cargo) may be a near certainty; however, the prospective benefit to the enterprise (a military objective) requires the risk to be accommodated. The goal of any organization managing risk is to establish an “As Low As Reasonably Achievable” (ALARA) condition, while still being able to conduct its core function.
How Does the Assessment Translate into Action?
How much risk is acceptable is a matter of first principles. In the example (above), a commercial airline may find the “loss of hull” and all aboard to be prohibitively risky; however, from the perspective of military flight operations, the objective (e.g., benefit) may not be achievable in safe airspace. A similarly skewed cost-benefit relationship exists in public safety.
The utilitarian need to provide the most public safety to the most people often outweighs the needs of the one. “Active shooters” (lone actor terrorists) arguably suffer an acute behavior condition. Yet, when an active shooter is threatening public safety the use of lethal force to stop (mitigate) that threat may be reasonable.
The risk matrix is a logic model and is both an assessment tool (demonstrated above), and a policy driven guide to action. In visualizing the dimensions of risk (i.e., severity and likelihood), encoded to reflect the specific tolerances (acceptable and unacceptable) of the organization, a evidence basis to manage the credible worst-case scenario results. Severity, risk, and risk tolerance (safety policy) are controls to be manipulated. Consider the sample risk matrix, below (see Figure 2). Sample “Risk Matrix” Where Red Indicates the Area of Unacceptable Risk, Yellow “Acceptable With Mitigation” and Green, “Acceptable” Without Mitigation (FAA, 2010, p. Appendix 3, Page 3)
This Figure both classifies hazards according to risk, and differentiates between “Acceptable”, “Acceptable with Mitigation”, and “Unacceptable” risks. The risk matrix visually correlates control inputs (severity and likelihood) to system outputs (risk). These objective criteria are levers to control risk. To achieve a “good” (i.e., a specialist response) otherwise considered an unacceptable risk, either (1) the credible worst-case (i.e., severity) or (2) its probability of occurring (i.e., likelihood), or both might be altered to achieve a lower risk assessment (from red to yellow).
Interactions with those experiencing a mental health crisis
23
are of particular interest to those seeking to disaggregate the police function. As discussed, interactions with this population are not without risk. Consider the previously discussed risk management framework, in the context of public safety. The recommended action to mitigate risk prescribes altering hazard to reduce severity, likelihood or both (see Figure 3, below). Sample Mitigation Where a Reduction in the Credible Worst-Case Scenario is Achieved by Adjustment to “Severity” and/or “Likelihood.” Hypothetical Scenarios are Labeled “H-A0” Thru “H-A3”and Boxed (Dashed Lines) With Movement of Severity and/or Likelihood Annotated (Blue Boxes)
Tiered Response Model as an Example of Safety Policy. Risk, as Defined by Severity and Likelihood and Acceptable Tolerances, Corresponds to Policy Governing Handling of That “Response Tier”
This is considered a best practice to mitigate the risks associated with serving this population. Many cities, including Seattle, follow the Memphis Model of police crisis intervention, widely considered to the standard in the United States (Compton et al., 2008). A search of Seattle Police Department records and other sources of local history 26 reveals no examples, documented or in recent memory, of an injury to a responder (sworn or civilian) during a co-response. 27 However organic, the development of a specialized response/responders to address an emerging risk suggests policing is inclined to a safety management.
The Role of Safety Policy
“Hedging” 28 may be the key to managing risk in emergency response. Pursuant the order to “reimagine policing” (Durkan, 2020), the City of Seattle commissioned the National Institute for Criminal Justice Reform (NICJR) to evaluate opportunities for a diversified response. The resulting proposed “Tiered Response Model” (see Table 4, below) hedges “the bet” (i.e., risk assessment) that a given service type may be safe for a specialist response by attending those marginal opportunities with a police officer, for safety. This set of standard instructions for managing risk constitutes “safety policy.”
Those calls classified as Tier 1 are service relate to obvious life/health/safety threats and/or criminal activity (in progress or having recently occurred). Much of the opportunity for a diversified response resides within Tier 2 and Tier 3 classifications (e.g., co and alternate response). In a Tier 2 response, a “Co-Responder Assisted Police Response,” the police officer takes primary responsibility for the contact until the credible worst-case scenario (e.g., injury to the responder) is no longer credible. Events assessed to be of lower risk are classified Tier 3, or “Police Assisted Co-Response.” In these situations, police may attend or be staged nearby for rapid intervention, but the credible worst-case scenario is safe enough to allow a more flexible problem-solving. Those events and even reports of crime not requiring an immediate response (e.g., theft from a car discovered hours after it occurred) are classified Tier 4 and are candidates for a differential police response (e.g., online reporting, appointment, etc.) or no police response (e.g., community service officer, outreach worker, etc.).
Safety policy standardizes and governs actions whether in the context of commercial aviation or the delivery of emergency response service. Under the current all-hazards responder model, every phase of the response is controlled by professional judgement. Call takers follow quality assurance and classification protocols, and they extract, interpret and classify information. All-hazards responders (i.e., police) observe, interpret and reclassify/recalibrate the response. All of these actions are governed by professional judgement. Safety policy standardizes those actions and can accommodate additional information (e.g., active risk assessment, call triage, etc.) to realize opportunities while mitigating opportunity cost(s).
Method
This method demonstrates the principles of risk management to assess the opportunity for a diversified response in Seattle, Washington, USA. Data from police calls for service were assessed using a risk matrix formed by the Seattle Police Department to represent risk tolerance(s) (e.g., acceptable, acceptable with mitigation, unacceptable) for the 356 types of service observed during the study period (2017 - 2019). The Tiered Response Model, proposed by the National Institute for Criminal Justice Reform 29 and contemplated above, was used to evaluate a diversified response where the credible worst-case scenario is mitigated to be, hypothetically, acceptable. The resulting estimates of the opportunity for a diversified response are expressed in the analysis as (1) the proportion of call types deemed to be acceptable/acceptable with mitigation risk for alternatives to police and (2) the sum and proportion of police “service hours” that might not have been expended on non-police service types, had these alternatives been available.
Data
Data for this study were formed from data representing police calls for service and fire medical or “aid run” taking place during the study period (2017 - 2019). For each response the most severe outcome was coded for the 727,423 records generated in response to a request for service from a member of the public (“dispatched” 30 ). Calls initiated by a police officer (i.e., “onview”), as well as those associated with any reportable police use of force were removed to control for any recursive effects (see Controls).
Severity was scored from documentation of injury made by Seattle Fire Department personnel. The observed frequency of these outcomes was fit to a curve (see Table 6, “Likelihood,” below) and final call type classifications were plotted on an amended risk matrix. That risk matrix was used to scale the 356 call types observed during the study period and summarize them across the four response tiers discussed, above.
Severity was coded based on objective criteria independent of the police response. These observations were made independent of the Seattle Police Department response and governed by the standard of care guidance established and supervised by licensed medical professionals. The independence of this observation was essential to the legitimacy of the project, as any potential biasing influence resulting from the police response was scrutinized carefully by stakeholders and members of the community. 31
Each observed response during the study period was coded “Severity 1” through “Severity 5” (see) based on the medical provider necessary to render aid to the person injured in the response. For events where the person was indicated “deceased” or transferred to the medical examiner’s office in the patient medical record the event was coded Severity 5 defining the top of the scale. Those events where a paramedic provided care to the patient, a standard referred to in the United States as Advanced Life Support, were coded “Severity 4.” Responses where an Emergency Medical Technician (EMT) 32 cared for the patient (i.e., Basic Life Support) and they were transferred to another medical provider (e.g., transported to the hospital by ambulance), were coded “Severity 3.” For aid rendered by a firefighter EMT where the patient did not require transfer to another medical provider, or for whom it was deemed safe for them to transport themselves (e.g., First Aid), the event was coded “Severity 2.” If no injury or aid was documented by a medical provider 33 the event was coded Severity 1.
Seattle RMD Severity Scale
The severity scale adopted by this project is configured to reflect the (1) relative safety of the opportunity and (2) management of collateral harms (e.g., use of force, constitutional seizure) associated with the delivery of police service. This scale does not address other objectives of interest to public safety managers (e.g., civil unrest, property damage, escalating series/spree, etc.). The proposed framework and decision-making system could, however. The sample severity scale provided above (see Table 1, above) demonstrates the agility of safety management. While this project focuses on objective criteria specifically tailored to the question of responder safety, other industries have taken a more subjective approach.
A Reasonable Approach
Likelihood or the Rate That the Outcome Occurs in the Universe of Observations. Definitions Arrived at Thru a Consensus Built “Reasonableness Test” (See Raters)
A plot of Likelihood fits an exponential form (y = 0.0062e0.8814x), well (R2 = .99),
35
suggesting a compounding effect as the probability of the worst-case scenario approaches one. This figure and its depiction of the non-linear implications for decision-making was presented to decision-makers and subject matter experts for validation of the Likelihood classification matrix. The scale was confirmed to reasonably reflect the values of the organization and was adopted by this project (Figure 4). Likelihood Plot
Like all safety policy, these definitions reflect the values and tolerances of the Seattle Police Department and are expected to be adapted to each individual use case. To the authors knowledge, no established standard or heuristic exists for the definition of severity and/or likelihood. The figures provided here are offered to demonstrate the concept but are not the definitive example. The author recommends that organizations seeking to adapt such an approach to their operational context do so in a way that is simple and transparent, allowing stakeholders and users of the system to fully understand and engage.
Raters
What a reasonable person might consider “acceptable” is critical to risk management (i.e., the legal fiction of the “reasonable man”). Not only does this reasonable standard reflect the collective values of a given organization, but it is also the same standard by which actions may be judged in a court of law. Questions of liability or even criminal responsibility often come down to what a reasonable person might have done in similar circumstances. The selection of and structure for rendering agreement (rating) are therefore essential governance.
For this exercise, three raters convened representing (1) department leadership (a member of command staff authorized to accept risk on behalf of the department), (2) operations (a ranking police officer with more than seventeen years of operational experience), and (3) analytics (the author). For both the initial risk assessment parameters (i.e., Severity and Likelihood), as well as the risk mitigation necessary to establish the final risk management decision (see Risk Management, below), absolute agreement was achieved in a consensus process (debate until agreement).
Seattle Police Risk Matrix. Color Coding Indicates Risk. Red is Unacceptable. Orange and Yellow are Acceptable With Mitigation and Those Service Types Coded Green are Always Acceptable
For all of the apparently quantitative rigor, safety management is a subjective exercise. Risk is relative and what is acceptable to the military is decidedly unacceptable to a commercial enterprise. Those values are best exercised by human raters under a systematic framework. The resulting safety policy, consisting of definitions (e.g., Severity, Likelihood, acceptable, unacceptable, etc.), and standardized actions (e.g., Tiered Response Model), affords strategic and tactical guidance for decision-makers.
Controls
Risk to non-police responders is difficult to ascertain from existing data because alternatives to police have not been widely fielded in Seattle. This “out of sample” condition (N. Silver, 2012) is a critical but common limitation in formative stages of innovation. For the purposes of this proof-of-concept or prototype risk matrix, the initial risk assessment employs injury to anybody associated with the response (excluding “onviews” and police user of force, see Data, above) as a proxy for the risk or danger inherent in the situation. This assessment is a starting point, a “datum,” intended to aid decision-making. The framework proposed here is designed to be enduring, however. As a diversified response matures, generating data as it does so, assumptions about the nature of risk (e.g., posteriors or priors) are updated as a direct reflection of risk.
Police data represent a unique and powerful source for certain public safety insights but poses some risk. Some real and some speculative. Their application, for example, in guiding future public safety policy comes with a risk of compounding/perpetuating the effects of systemic racism. Additionally, stakeholders engaged in early formative phases of this analysis expressed concern that this analysis originated from inside a police service and does not consider direct community input. In response to these concerns, efforts were made to (1) control conceivable recursive effects in the fundamental design of the system (i.e., independent raters, no police initiated activity, no injuries resulting from the police response), (2) produce a conservative initial estimate of risk, while establishing (3) a responsive framework (i.e., methodology) that could adapt as new observations are made (posteriors), risk is recalculated, and norms/values evolve.
In addition to only considering requests for service from the public, injuries resulting from the police response were controlled. Out of an abundance of caution, all events where a police use of force was reported (600 to 900 a year or between .24% and .37%) were removed, including those resulting in an injury to the officer, preserving the independence of outcome measures (i.e., severity). Additionally, calls associated with a documented assault on a police officer, regardless of a reported police use of force, were controlled. On the assumption that serious crime, “Crimes Against Persons” (FBI, 2012), represent an imperative and would always receive a police response, calls resulting in a report documenting an offense of this type were additionally removed.
Finally, the SARS-CoV-2 pandemic was observed to have significantly disrupted the response ecosystems in ways yet to be fully understood. To align with the National Institute for Criminal Justice Reform call analysis and control the biasing effects of abnormal operations, the data were restricted to three complete years (2017, 2018, and 2019) on the assumption that activity would return to pre-pandemic levels and findings would be generalizable.
Procedure
This analysis followed published guidance for safety management systems commonly employed to manage risk in commercial aviation (described above). All calls (n = 727,423) were scored from one to five based on the observed most severe outcome (see “severity,” Table 4, above). The rate at which the most severe was observed (see “likelihood,” Table 5, above) was calculated 36 for each of the call types. These data were then summarized by plotting the final call classification (n = 356) at the intersection of the worst-case outcome (severity) and its observed rate (likelihood) on the risk matrix (see below).
Once plotted, stakeholders at the Seattle Police Department determined what risks were acceptable (green), acceptable with mitigation (orange and yellow), and unacceptable (red) for non-police responders. The risk matrix was color coded to reflect the risk tolerance(s) of the organization and the desired mitigation under the Tiered Response Model (i.e., safety policy). Raters (see Ra) determined the risk of death or serious injury was unacceptable (Table 7, red). The Very Likely (≥50%) occurrence of injury requiring treatment (Severity 3), was also deemed unacceptable (Table 7, red); however, occurring between 10% and 25% was deemed acceptable, with-mitigation-high (Table 7, orange). The remainder of these outcomes, occurring less than 10% of the time, were assessed acceptable, with-mitigation-low (Table 7, yellow). Any call type for which the worst-case scenario injury was not observed to exceed basic first aid (Severity (2) more than 5% of the time or for which the observed worst-case outcome is the documentation of an offense or infraction, was coded green (see Table 7).
Risk Management
Risk Management Results. Table Depicts the Count of Call Types Either “Downgraded,” “Upgraded” or Left Unchanged (“No Change”) in the Course of Risk Management (See Risk Management). Results of Risk Management Reclassification are Depicted in Figure 5, Following
The visual comparison of risk models, pre and post risk management, depicted in Figure 5 (below), demonstrates the systematic management of risk. Distinct from and possibly indicative of government over industrial/commercial risk profiles is government’s apparent intolerance of significant injury or death. Commercial risk management often considers serious injury and even death to be, at least at some level, acceptable in the “opportunity space”
37
(dashed outline). Calculated/Managed Risk Comparison. Table on the Top Represents Raw Calculated Risk. Table on the Bottom Represents the Result of Risk Management “Recoding.” Concentration of Call Types Classified as “Unacceptable” (Red) Classification Depicts “Polarization,” Discussed Below
In industrial applications, this area is where prospective benefit (profit) may be realized. A commercial entity that can operate in this space, safely, has an “edge” on the competition. In government, a similar benefit (just right response) can be realized in this space. The result of that cost-benefit contemplation is depicted in the “Risk Managed” matrix (bottom) and serves as the foundation for continuous improvement. The decision-making process described above (see Raters) reflects the reality of such a contemplation and is familiar to myriad processes observed in any bureaucracy.
Analysis
Risk Managed Demand Resource Estimates by “Call Type” and “Service Hours”
Under the current model the risk matrix is polarized (see Figure 5, above). Call types are heavily loaded on the “Unacceptable” risk classification (red), reflecting a historically conservative approach to risk management. The all-hazards response model effectively mitigates the risk posed but what would otherwise be a dangerous operation (e.g., “BOMB THREATS…”, “ESCAPED PRISONER…”, “ROBBERY…”). Similarly, some responses were calculated high risk and received an urgent police response because someone was injured or killed, despite being accidental, in nature (e.g., “INJURED - PERSON/INDUSTRIAL ACCIDENT”, “ANIMAL - INJURED, DEAD HAZARD, OTHER”). As a diversified response is fielded, the data populating the risk matrix will begin to reflect the reality of a differential response and not the prospective risk as measured by proxy.
Opportunity exists for a more satisfying and appropriate response. Between Tier 2 and Tier 3, eighty-five (forty-two and forty-three respectively) call types are co-response candidates. Tier 4 responses represent 25% (n = 89) of call types and approximately 12.3% (56,095 hours) of all service hours expended during the study period (see Table 9, below). Combined, Tier 3 and Tier 4 responses suggest the potential to divert 15.2% of resource demand. It can be assumed most Tier 2 responses will revert to a Tier 3, allowing police to clear from the call after any life/health/safety threat is mitigated (e.g., a weapon in the environment is removed), increasing the potential resource recovery to as much as 26.8%. 39
While the analysis suggests significant opportunity, specialized triage systems and specialist resources are necessary to realize this potential. In some cases, municipal governments will have the competency (e.g., animal control, social workers) and will simply need to coordinate their response. Many of the types of specialist response indicated here, by virtue of the state of the social welfare state in the US (Garland, 2016), will need to be built from the ground up. Finally, as mentioned in the literature review, although this strategic analysis is promising, a confident triage system is essential. So called “intelligent call centers” employ several promising technologies which enable low/unskilled laborers to consistently serve customers requesting assistance. These systems have yet to be adapted to public safety, however.
Conclusion
This analysis indicates a framework adapted from commercial aviation can answer some key questions and contribute to the sustained operation of a diversified response. Importantly, a diversified response can be managed, safely (RQ-1). With clear, transparent, and objective criteria, the Seattle Police Department and stakeholders employed this approach to conceptualize and accept the opportunity costs of sending the right resource for the service being requested. Where necessary, this model demonstrated value in aiding decision-makers to optimize the opportunities for a right sized response (RQ-2), systematically addressing risk through a tiered-response model, until residual risks were acceptable. Nearly 50% of call types and approximately 26% of the total service hours expended during the study period could have been handled through a diversified response (RQ-3). In a time when police resources are scarce, such a system may realize efficiencies not possible under the all-hazards/street-level bureaucrat/streetcorner politician model.
The safety implications for systematic management of risk are substantial. The Second Great Awakening has sparked excitement, a flurry of serious discussion, and fresh research examining the viability of alternatives to police. While not all observers are convinced that at least some response diversification is necessary, the aftermath of the murder of George Floyd and the continuing calls to defund or otherwise radically reform American policing suggests otherwise. Although the adoption of a decision-making framework may not satisfy calls for reform and is likely to be viewed, reductively, as incrementalism or bureaucratic sleight of hand, Risk Managed Demand is an important step forward.
Public safety policy is responsive so called “sentinel events.” Statistical outliers or “edge cases” can stimulate extreme responses. These events are the worst-case scenario; however, the credibility (that they may recur) is less certain. Between the lag in reactive public safety policy and the “catastrophizing” reflex (Hilhorst, 2013), rational decision-making is challenging. Risk Managed Demand establishes a rubric by which events can be assessed, and safety policy formed to address a probable future condition, a credible worst-case scenario. The effect on government is to facilitate the utilitarian focus fundamental to collective efficacy (MacKay, 2018). Rather than fixate on an improbable limiting condition, Risk Managed Demand allows public policy to develop to manage the most risk, minimizing the maximum risk. As a system of thinking, Risk Managed Demand is both responsive to the inevitable failure and supportive of the critical thinking necessary to achieve an ideal future state, safely. Further, the use of an established system affords a level of indemnification critical to political decisions.
Adverse outcomes are inevitable. Even under a well-managed system, death and serious injury associated with a diversified response are a near certainty. These statistical realities do little to satisfy the expectation that a seriously injured person be attended to by a trained medical professional, expeditiously. Yet, the generalist first responders proffered by Friedman (2021) will inevitably become casualties. Without a system to assess, manage and accept these risks, police will continue to aggregate, rather than disaggregate function. For this reason, the development of “safety culture,” like that accompanying implementation of Safety Management Systems in commercial aviation, is essential. Although this exercise implements risk management for the purpose of responder safety, the safety of all involved is affected. Safety, as an outcome, is not exclusive or discriminating any more than aviation safety exists for the exclusive benefit of the crew.
This analysis demonstrates the use of Risk Managed Demand to manage risk in designing a diversified response. Each call type is assessed and managed (mitigated) to an acceptable level of risk. Tactically, Risk Managed Demand serves to configure technology designed to aid in human decision-making. So called “Intelligent Risk Management” has demonstrated viability in a proof-of-concept phase (Atherley, 2024). The “labels” generated by Risk Managed Demand can be used to “train” a machine to discriminate the potential risk of criminal violence in near-real-time. More importantly, when these decisions are called into question, Risk Managed. Demand serves to memorialize an organizations established safety policy and sanctioned decision-making guidance. Should a specialist response be deployed under this guidance, the decision is indemnified.
Given what appears as increasingly wild “oscillation between over-policing and under-policing” (Sherman, 2022), an evidence-based approach is essential to equitable, accountable, effective, and safe police service. Injuries and even death will happen. A sustainable diversification of responder resource depends upon a transparent and robust rationalization of the credible worst-case scenario. What will be most important in the final analysis is objective proof it was not the result of negligence, and an agile framework to assure history does not repeat itself. In other words, a safety management system for public safety, a Risk Managed Demand.
Footnotes
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
Correction (November 2025):
In the published version of the article, the third-order heading “How Does the Assessment Translate into Action” on page 11 has been changed to the second-order heading. Additionally, the term “Ratliffe’s” and “Ratcliff’s” have been corrected to “Ratcliffe’s” on pages 3 and 20. The article has been updated to reflect these changes.
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data Availability Statement
Full upon request.
