Abstract
Focusing on the Dutch tools SyRI and CAS, this paper describes predictive policing against the background of the broader development toward a pre-crime society, the accompanying culture of control and the new penal logic it gives rise to. It will explain the risks associated with the risk assessments predictive policing tools provide and end with the recommendation to use predictive policing not only for police deployment, but also to target problem-oriented responses to crime to the right persons and places.
Introduction
This paper describes the phenomenon of predictive policing, focusing on two Dutch predictive policing tools that recently generated a lot of public attention, namely: SyRI and CAS. The Dutch government stopped using SyRI, a predictive policing tool for predicting fraudsters, because the District Court of The Hague ruled it violates the right to privacy as contained in article 8 of the European Convention on Human Rights (Rb. Den Haag, 05-02-2020, ECLI: NL: RBDHA:2020:865, available at http://www.rechtspraak.nl). The use of CAS, a predictive policing tool for predicting crime locations, just increased as it has been implemented nationwide since 2019 (Halfjaarbericht politie 2019, bijlage 5: 3), making the Netherlands the first country in the world to deploy predictive policing on a national scale.
It will be argued predictive policing is part of the broader development toward a pre-crime society (Zedner, 2007), the accompanying culture of control (Garland, 2001) and the new penal logic (Feeley and Simon, 1992) it gives rise to and that the ethical issues surrounding this phenomenon can be better understood against this background. Predictive policing aims to prevent crime by providing risk assessments, but these risk assessments have risks of their own. Since the risk analyses often lack transparency and explainability, it is not possible to weigh the crime risks to be prevented and the risks of crime prevention properly, which may lead to disproportionate intrusions with the right to privacy and violation of the related rights to equal treatment in equal cases and to protection against discrimination, stereotyping and stigmatization.
Proponents claim that predictive policing tools for predicting offenders and crime locations actually restore instead of erode the rights of persons thought to be likely to commit a crime or living in a crime-prone neighborhood, because they provide neutral, quantitative evidence that a person is indeed likely to commit a crime or that an area is indeed a high-crime area. But the evidence predictive policing tools for predicting offenders or crime locations provide, is not entirely neutral, because it is based on assumptions that do not need to be true. Moreover, it is important to realize that numbers do not explain reasons. In the longrun, the causes of crime are an important key to crime prevention. Therefore, this paper will come to the conclusion that predictive policing should also be used to direct interventions to locations and persons where they are most needed if we really want to reduce crime.
Predictive policing: Definition, scope and effects
According to Perry
Recently, the Dutch SyRI (short for:
Nor the indicators, neither the risk model used by SyRI are known. Therefore, it cannot be described how it works. To indicate how a predictive policing tool for predicting offenders can work, another tool, namely: the Custom Notification program, is briefly discussed here. The Custom Notification program is used on a citywide basis by the Chicago Police Department since July 2013 and has been developed by the Chicago Police Department itself in collaboration with the Illinois Institute of Technology. It identifies potential victims and perpetrators associated with the continuum of gun violence in the city of Chicago. Based on empirical data it creates a Strategic Subjects List (SSL): a rank-order list of individuals that have an increased likelihood of engagement in violent criminal activity. These data include data on demographics, arrest history and social network variables. They are analyzed by means of a prediction model that uses co-arrests to previous homicide victims to predict the likelihood that a person becomes a homicide perpetrator or victim (Saunders et al., 2016: 354–357). For each individual on the list a letter is created which incorporates factors known about him or her. This letter is delivered and explained by police officers. The Custom Notifications program serves as notice that law enforcement action will be targeted specifically to the individual and continuation to participate in gun violence will have cognizable penalties (Chicago Police Department 2015).
Besides the above-mentioned predictive policing tool for predicting offenders SyRI, there is also a predictive policing tool for predicting crime locations in use in the Netherlands, namely: CAS (short for:
For the rest it is not clear how CAS works exactly, but it is known it makes use of a near-repeat concept (Drenth and Van Steden, 2017: 6). The near-repeat concept is based on the empirical phenomenon that there is an increased risk of crime occurring within a certain geographical and time window once a crime has taken place (Rummens et al., 2017: 264–266). The best predictor of victimization is prior victimization. ‘Repeat victimization’, as it is called, tends to occur quickly after the initial crime (Johnson and Bowers, 2004b). Victims are at an elevated risk of crime in the months directly following an event (Johnson et al., 2007). In case of burglary, not only the burgled home is at risk of being burgled soon again, but also other, similar properties in the neighborhood (Johnson and Bowers, 2004b). There is evidence that specifically burglary is geographically concentrated, as it clusters in space and time more than one would expect if the perceived ‘patterns’ of crime would simply result from the attractiveness of places to offenders (Johnson et al., 2007, 2009). This can be explained as follows. Once a perpetrator has committed a crime, it is easier to repeat that crime than to identify a new location and/or criminal act. In case of burglary, especially in wealthy housing areas where people have the means to do so, counter-measures are often taken against revictimization. That makes neighboring properties easier to invade than the original burglarized house (Johnson et al., 2007: 204). Properties within 400 meters of a burgled house are at a significantly elevated risk of burglary for up to two months after the initial event (Bowers et al., 2004). The location of clusters of burglary is not predictable over periods of 3 or more months, however (Johnson and Bowers, 2004a). There are a number of plausible explanations for this: the characteristics of an area can change over time (e.g. because of interventions from those responsible for crime reduction) and the perpetrator’s memory for the features of a particular house and those nearby may decay (Johnson et al., 2007: 215).
An example of another predictive policing tool for predicting crime locations that makes use of the near-repeat concept is
Prior victimization has a greater predictive power than any other variable (Johnson and Bowers, 2004b). A study of burglary patterns in two different areas in five separate countries reveals that ‘near repeat’ has also good predictive power (Johnson et al., 2007: 215). But the effects of Predpol and CAS on crime reduction (i.e. the outcome of the police response employed) are disappointing. Mohler
Little is known about the effectiveness of predictive policing tools for predicting offenders. There are, to my knowledge, no data available about SyRI. In a trial Saunders, Hunt & Hollywood found that individuals on the SSL produced by the Custom Notification program were not more (or less) likely to become a victim of a shooting than the comparison group, but that they were more likely to be arrested for one. A possible explanation is that the list was used as an intelligence-gathering source: when a shooting happened, the police looked at the SSL for possible suspects (Saunders et al., 2016: 365–367).
The broader development: The shift from harm to risk
Crime forecasting is not a new phenomenon; it has been around for decades. Scientists have used statistical and geospatial analyses to determine crime risk levels ever since the sociologist Shaw and the criminologist McKay studied the persistence of juvenile crime in specific neighborhoods of Chicago, and later also in 20 other American cities, and found juvenile delinquency to be highly correlated with, among other things, changes in population, inadequate housing, poverty, tuberculosis and mental disorders (Shaw and McKay, 1942). From the mid-1980s onward risk factor prevention became a dominant paradigm in crime control (Mehozay and Fisher, 2019: 524). In the 1990s algorithms for crime forecasting were computerized. In recent years, these algorithms have become more sophisticated and, due to an increase in computer power and storage, bigger data sets can be analyzed (Mehozay and Fisher, 2019: 524; Perry et al., 2013: 3–4). Meanwhile, the amount of data available has grown exponentially and still does: it doubles in volume every 2 years. Moreover, the investigatory utility of the data available improves because they are networked: law enforcement agencies and private companies connect their databases and, thereby, aggregate their data (Ferguson, 2015: 354, 360). These developments have led to predictive policing as defined and described in the last section.
Predictive policing does not stand on its own but is part of a broader development and the ethical issues surrounding this phenomenon can be better understood against this background. According to the criminologist Zedner we are on the cusp of a shift from a post-crime society, within which the dominant ordering practices arise post hoc, to a pre-crime society, within which ordering practices are pre-emptive. Where post hoc ordering practices respond to wrongs done, pre-emptive ordering practices shift ‘
The shift from post hoc ordering practices to pre-emptive ordering practices goes hand in hand with a shift from a perception of crime as harm to a perception of crime as risk (Zedner, 2007: 262). It is a more or less established fact that contemporary society can be characterized as a ‘risk society’ (Borgers and Van Sliedregt, 2009: 172). The concept of the risk society originally derives from the sociologist Beck (Beck, 1986). In broad terms, it entails that society is focused on the control and management of risks. According to the criminologist Garland this has led to a ‘
The view of crime as a risk to be calculated has also led to a ‘
From a criminological point of view, the Reoffender Institutionalization Measure can be seen as part of the managerial movement, within which risk assessments are conducted on an individual basis and based on clinical judgment by professionals. Predictive policing is part of a new phase in this movement, the actuarial phase, which can be described as an evolution toward evidence-based practices and mathematical tools to assess risk. The actuarial phase, again, represents a new penology as it focuses less on things like responsibility, guilt, intervention and rehabilitation, but instead focuses on setting out techniques for the identification, classification and management of groups according to risk levels. Proponents of these techniques argue that they introduce a new level of accuracy and may even eliminate forms of bias that were inherent in previous methods of risk assessment. The motivation for accuracy and bias-free analysis has led to more and more sophisticated analytical tools that incorporate big data and machine learning algorithms. The downside of this development is that these algorithms are often a ‘black box’ and it is impossible to explain how the risk score was established (Mehozay and Fisher, 2019: 524, 531–533), as is the case for the two Dutch predictive policing tools discussed in the last section: SyRI and CAS.
This is problematic because it goes against an important moral value, namely: the value of transparency, which is among the key requirements for Trustworthy Artificial Intelligence (Ethics Guidelines for Trustworthy AI, Chapter II). The value of transparency is usually not seen as an end in itself, but as an important prerequisite for the realization of other values, such as the value of privacy. As AI and other computer-related technology is often ‘
The lack of transparency and hence explainability of the risk calculations provided by predictive policing tools is even more problematic because of the nature of the decisions that can be based on them, such as the deployment of investigatory powers. According to the 19th century philosopher John Stuart Mill ‘
The desirability of the above-mentioned shift from the harm principle to the precautionary principle is under discussion. The influential American legal scholar Cass Sunstein, who focuses specifically on the precautionary principle as a moral ground for the criminalization of certain behaviors, claims that it ‘
The risk associated with predictive policing
In the SyRI case, the District Court of The Hague compared the objectives of the legislation regulating the use of) SyRI, i.e. to prevent and combat fraud in the interest of economic welfare, with the intrusion into private life that it makes. According to the court, the legislation does not meet the fair balance required by the ECHR in order to be able to speak of a sufficiently justified intrusion into private life, because it is insufficiently transparent and verifiable with regard to the use of SyRI. The legislation was therefore declared unlawful and non-binding (Rb. Den Haag, 05-02-2020, ECLI: NL: RBDHA: 2020:865, available at http://www.rechtspraak.nl). This verdict can be explained as follows.
In European countries the right to privacy is protected by article 8 of the European Convention on Human Rights. It reads as follows: ‘
The latter also goes for other predictive policing tools for predicting offenders. An evaluation of the Chicago Custom Notification program discussed before shows that there was ‘
Similar concerns rise with regard to predictive policing tools for predicting crime locations. When for example CAS or PredPol directs police officers toward a place where crime is likely to occur at a time it is likely to occur, they cannot reasonably assume that all persons there present are involved in a crime. Based on the circumstances, the police officers will have to determine which persons, if any, warrant further investigation. However, the fact that a predictive policing tool for predicting crime locations has identified the place as a place where crime is likely to occur, may influence their view on the situation. Imagine, for example, that a predictive policing tool for predicting crime locations has directed police officers to a certain area where property crime is likely to occur. There present they see a person carrying a duffel bag. That activity is by itself not obviously suspicious but in light of the fact that the predictive policing tool has identified the area as a place where property crime is likely to occur, the police officers find it suspicious and stop the person in order to search his bag for burglar’s tools or stolen property (Joh, 2014: 55–59). Then it turns out that the person, who has no criminal record, was on his way from his home to the laundromat and the duffel bag contains laundry. Had he lived in another area, police officers would probably not have thought they had reason to interfere with his right to privacy.
Some legal scholars belong to the proponents of predictive policing that claim that predictive policing tools for predicting offenders and crime locations actually restore instead of erode the right to privacy of persons thought to be likely to commit a crime or people living in high-crime neighborhoods (see e.g. Koss, 2015: 305). They argue as follows. Reasonable suspicion is, at its core, ‘
Other legal scholars contest this claim. The above-mentioned evidence predictive policing tools for predicting offenders or crime locations provide, is not entirely neutral. That is because the development of these tools necessarily involves human discretion (Joh, 2014: 58). Not all predictive policing tools for predicting offenders or crime locations are the same. They make use of different algorithms, requiring the input of different (amounts of) data and resulting in different statistical predictions of the likeliness that that a certain person will commit a crime or crime will occur at a certain place. Moreover, the developers of a particular predictive policing tool have selected the appropriate algorithm with a certain model or theory of crime prediction in mind (Bennett Moses and Chan, 2016: 6). As mentioned before, the models used in predictive policing have not been specifically developed for crime prediction, but originate from seismology and epidemiology (Johnson and Bowers, 2004b; Mohler et al., 2016: 1400). But several theories of crime prediction exist. Well-known theories for predicting offenders are the strain theories, which assume people can be pressured into crime. Agnew recently made an attempt to develop a more general strain theory. According to Agnew the most important strains are if people are treated negatively by others, lose something that is valuable to them or cannot achieve their goals. These strains evoke negative emotions which in turn encourage delinquent behavior (Agnew, 2006: 19; Kolthoff, 2016: 85–86). Examples of theories for predicting crime locations are the opportunity theory and the routine activity theory (Johnson et al., 2007: 203). The opportunity theory reasons that crime rates will be the highest in locations that contain the best opportunities for crime. The routine activity theory assumes that crime will not take place ‘
Assumptions underlying the algorithms at work in predictive policing
Not only is the selection of the appropriate algorithm, but also the design of the algorithms themselves based on assumptions. This section will discuss four common assumptions underlying the algorithms at work in predictive policing tools for predicting offenders or crime locations. It will also explain how they affect the accuracy of the predictions these tools offer.
The first general assumption underlying the algorithms at work in predictive policing tools of any type is that the data inputted accurately reflect reality. But this does not need to be the case, especially with regard to crime data. Whether or not something constitutes a crime and how that crime is classified or categorized is a matter of discretion and may differ for different police officers (Bennet Moses and Chan, 2016: 5). This problem is exacerbated because predictive policing tools do not only make use of data collected by police departments in their normal course of business, but also of data that come from other sources (Perry et al., 2013: 13). Certain classifications or categories may have different meanings in different organizations. Moreover, crime data are necessarily limited to reports by victims and police observations. On the one hand, there is a lot of unreported and unseen crime, especially in the area of domestic violence. Because there are no data on these hidden crimes, they will not be inputted into predictive policing tools and they will continue to ignore them (Bennet Moses and Chan 2016: 4–5). On the other hand, there are crimes that have a greater chance of being seen than others, because they are committed by an individual that is considered crime-prone or in a neighborhood that is considered high-crime. When data on these crimes are inputted into a predictive policing tool for predicting crimes or offenders, its predictions may simply reinforce stereotypes that certain neighborhoods or individuals need heavier police attention (Joh, 2014: 58). This way, prior police contacts become a kind of digital scarlet letters (Ferguson, 2015: 401). That is problematic because predictive policing results in ‘
The second general assumption underlying the algorithms at work in predictive policing tools of any type is that history repeats itself. The data analyzed in the context of predictive policing always consist of historical data. The algorithmic procedures used to analyze them look for patterns (Perry et al., 2013: 17). They assume that factors relevant in the past will continue to be relevant in the future (Chan and Bennett Moses, 2016: 32). As was established before, there is evidence that specifically burglary is geographically concentrated, as it clusters in space and time more than one would expect if the perceived patterns of crime would simply result from the attractiveness of places to offenders (Johnson et al., 2007, 2009). But that does not apply (to the same extent) to other crimes (Bennett Moses and Chan, 2016: 5). Moreover, the location of clusters of burglary is not predictable over periods of 3 or more months (Johnson and Bowers, 2004a), because the characteristics of an area change over time and the perpetrator’s memory for the features of a particular house and those nearby may decay (Johnson et al., 2007: 215). Predictive policing itself may spur the recording of crime in a neighborhood, but that may only mean that criminals change their work area and crime rates increase in adjacent areas (Bennett Moses and Chan, 2016: 5). The same goes for predictive policing tools for predicting offenders. If police would, for example, assume that tattoos correlate with crime and, therefore, focus surveillance on people with tattoos, people planning to commit crimes may decide against getting tattoos (Chan and Bennett Moses, 2016: 33). It should be added that advanced, machine-learning algorithms are able to predict such changes as well though (see e.g. Berk and Bleich, 2013: 541). But it is probably the reason why the risk indicators and risk model behind SyRI have never been made public.
The third general assumption underlying the algorithms at work in predictive policing tools of any type is that they focus on a relevant set of data and that the data omitted are irrelevant (Bennet Moses and Chan, 2016: 5). But the fact that certain data are omitted, does not mean that they are irrelevant. They might simply not be available, expensive or difficult to procure. Or it was not realized that they would be relevant when the predictive policing tool in question was being developed. Clearly, the omission of relevant data affects the accuracy of predictive policing tools (Bennet Moses and Chan, 2016: 6).
The three assumptions mentioned above show that we should be aware that the statistical predictions predictive policing tools for predicting crime locations or offenders provide ‘
Conclusion
Focusing specifically on the Dutch predictive policing tools SyRI (for predicting
Proponents claim that predictive policing tools for predicting offenders and crime locations actually restore instead of erode the rights of persons thought to be likely to commit a crime or living in a crime-prone neighborhood, because they provide neutral, quantitative evidence that a person is indeed likely to commit a crime or that an area is indeed a high-crime area. But the evidence predictive policing tools for predicting offenders or crime locations provide, is not entirely neutral. They make use of different algorithms, requiring the input of different (amounts of) data and resulting in different statistical predictions of the likeliness that that a certain person will commit a crime or crime will occur at a certain place. Also, the developers of a particular predictive policing tool have selected the appropriate algorithm with a certain theory or model of crime prediction in mind. Not only is the selection of the appropriate algorithm, but also the design of the algorithms themselves based on assumptions. Moreover, it is important to realize that numbers do not explain reasons. In the long run, the causes of crime are an important key to crime prevention. Problem-oriented responses, such as mentor programs, youth sports programs and neighborhood meetings, can be a more effective way to prevent crime than police deployment. Therefore, predictive policing should also be used to direct these interventions to locations and persons where they are most needed if we really want to reduce crime.
This conclusion gives rise to other questions, such as whether crime can be seen as a risk in itself, which is de underlying idea of predictive policing. Or are the causes of crime the actual risks that need to be prevented? Answering these questions is beyond the scope of this paper, however. Predictive policing and its underlying ideas are in need of further discussion and analysis not only in the field of legal philosophy, but also in the respective fields of police studies and criminology.
Footnotes
Acknowledgements
I would like to thank my brother Rense Strikwerda, data analyst, and two anonymous reviewers for their useful comments.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
