Prediction of Recidivism and Detection of Risk Factors Under Different Time Windows Using Machine Learning Techniques

Abstract

Following a comprehensive analysis of the initial three generations of prisoner risk assessment tools, the field has observed a notable prominence in the integration of fourth-generation tools and machine learning techniques. However, limited efforts have been made to address the explainability of data-driven prediction models and their connection with treatment recommendations. Our primary objective was to develop predictive models for assessing the likelihood of recidivism among prisoners released from their index incarceration within 1-year, 2-year, and 5-year timeframes. We aimed to enhance interpretability using SHapley Additive exPlanations (SHAP). We collected data from 20,457 in-prison records from February 10, 2005, to August 25, 2021, sourced from a Southwestern China prison’s data management system. Recidivism records were officially determined through data mining from an official website and combined identification data from neighboring prisons. We employed five machine learning algorithms, considering sociodemographic, physical health, psychological assessments, criminological characteristics, crime history, social support, and in-prison behaviors as factors. For interpretability, SHAP was applied to reveal feature contributions. Findings indicated that young prisoners accused of larceny, previous convictions, lower fines, and limited family support faced higher reoffending risk. Conversely, middle-aged and senior prisoners with no prior convictions, lower monthly supermarket expenses, and positive psychological test results had lower reoffending risk. We also explored interactions between significant predictive features, such as prisoner age at incarceration initiation and primary accusation, and the duration of current incarceration and cumulative prior incarcerations. Notably, our models consistently exhibited high performance, as shown by AUC on the test dataset across time windows. Interpretability results provided insights into evolving risk factors over time, valuable for intervention with high-risk individuals. These insights, with additional validation, could offer dynamic prisoner information for stakeholders. Moreover, interpretability results can be seamlessly integrated into prison and court management systems as a valuable risk assessment tool.

Keywords

recidivism different time windows machine learning SHapley additive exPlanations risk prediction

Get full access to this article

View all access options for this article.

References

Abby

(2018, February 3). Your future doctor may not be human. This is the rise of AI in medicine. Futurism. https://futurism.com/ai-medicine-doctor

Ahuja

A. S.

(2019). The impact of artificial intelligence in medicine on the future role of the physician. PeerJ, 7, e7702. https://doi.org/10.7717/peerj.7702

Altman

D. G.

Bland

J. M.

(1994). Diagnostic tests. 1: Sensitivity and specificity. BMJ, 308(6943), 1552. https://doi.org/10.1136/bmj.308.6943.1552

Andrews

D. A.

Bonta

Wormith

J. S.

(2006). The recent past and near future of risk and/or need assessment. Crime & Delinquency, 52(1), 7–27. https://doi.org/10.1177/0011128705281756

Angwin

Larson

Mattu

Kirchner

(2016, May 23). Machine Bias. ProPublica. https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing

Beaudry

Alaei

Fazel

(2022). Predicting violent reoffending in individuals released from prison in a lower-middle-income country: A validation of OxRec in Tajikistan. Frontiers in Psychiatry, 13, 805141. https://doi.org/10.3389/fpsyt.2022.805141

Biddle

J. B.

(2022). On predicting recidivism: Epistemic risk, tradeoffs, and values in machine learning. Canadian Journal of Philosophy, 52(3), 321–341. https://doi.org/10.1017/can.2020.27

Bonta

Andrews

D. A.

(2016). The psychology of criminal conduct, Taylor & Francis.

Borden

H. G.

(1928). Factors for predicting parole success. Journal of the American Institute of Criminal Law and Criminology, 19(3), 328. https://doi.org/10.2307/1134622

10.

Bureau of Justice Assistance . (2015). What is risk assessment. Bureau of Justice Assistance. https://bja.ojp.gov/program/psrac/basics/what-is-risk-assessment

11.

Chen

(2018). Rational considerations on gradually increasing parole rates in compliance with the law. Chinese Justice, 2018(9), 88–92.

12.

Chouldechova

(2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2), 153–163. https://doi.org/10.1089/big.2016.0047

13.

Corbett-Davies

Pierson

Feller

Goel

(2016, October 17). A computer program used for bail and sentencing decisions was labeled biased against blacks. It’s actually not that clear. Washington Post. https://www.washingtonpost.com/news/monkey-cage/wp/2016/10/17/can-an-algorithm-be-racist-our-analysis-is-more-cautious-than-propublicas/

14.

Dotan

Milli

(2019). Value-laden disciplinary shifts in machine learning. (arXiv:1912.01172). arXiv. https://arxiv.org/abs/1912.01172

15.

Dressel

Farid

(2018). The accuracy, fairness, and limits of predicting recidivism. Science Advances, 4(1), eaao5580. https://doi.org/10.1126/sciadv.aao5580

16.

Durnescu

(2018). The five stages of prisoner reentry: Toward a process theory. International Journal of Offender Therapy and Comparative Criminology, 62(8), 2195–2215. https://doi.org/10.1177/0306624X17706889

17.

Eisenberg

M. J.

van Horn

J. E.

Dekker

J. M.

Assink

van der Put

C. E.

Hendriks

Stams

G. J. J. M.

(2019). Static and dynamic predictors of general and violent criminal offense recidivism in the forensic outpatient population: A meta-analysis. Criminal Justice and Behavior, 46(5), 732–750. https://doi.org/10.1177/0093854819826109

18.

Fazel

Chang

Fanshawe

Långström

Lichtenstein

Larsson

Mallett

(2016). Prediction of violent reoffending on release from prison: Derivation and external validation of a scalable tool. The Lancet Psychiatry, 3(6), 535–543. https://doi.org/10.1016/S2215-0366(16)00103-6

19.

Fazel

Wolf

(2015). A systematic review of criminal recidivism rates worldwide: Current difficulties and recommendations for best practice. PLoS One, 10(6), Article e0130390. https://doi.org/10.1371/journal.pone.0130390

20.

Fernandez

de Lasala

(2021). Risk assessment in prison, Publications Office of the European Union.

21.

Gendreau

Little

Goggin

(1996). A meta-analysis of the predictors of adult offender recidivism: What works!*. Criminology, 34(4), 575–608. https://doi.org/10.1111/j.1745-9125.1996.tb01220.x

22.

Ghasemi

Anvari

Atapour

Stephen wormith

Stockdale

K. C.

Spiteri

R. J.

(2021). The application of machine learning to a general risk–need assessment instrument in the prediction of criminal recidivism. Criminal Justice and Behavior, 48(4), 518–538. https://doi.org/10.1177/0093854820969753

23.

Gillespie

(2014). The relevance of algorithms. In Gillespie

Boczkowski

P. J.

Foot

K. A.

, (Eds.), (pp. 167–194). The MIT Press. https://doi.org/10.7551/mitpress/9780262525374.003.0009

24.

Grieger

Hosser

(2014). Which risk factors are really predictive? An analysis of Andrews and bonta’s “central eight” risk factors for recidivism in German youth correctional facility inmates. Criminal Justice and Behavior, 41(5), 613–634. https://doi.org/10.1177/0093854813511432

25.

Grove

W. M.

Zald

D. H.

Lebow

B. S.

Snitz

B. E.

Nelson

(2000). Clinical versus mechanical prediction: A meta-analysis. Psychological Assessment, 12(1), 19–30. https://doi.org/10.1037/1040-3590.12.1.19

26.

Håkansson

Berglund

(2012). Risk factors for criminal recidivism – A prospective follow-up study in prisoners with substance abuse. BMC Psychiatry, 12(1), 111. https://doi.org/10.1186/1471-244X-12-111

27.

Halligan

Altman

D. G.

Mallett

(2015). Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: A discussion and proposal for an alternative approach. European Radiology, 25(4), 932–939. https://doi.org/10.1007/s00330-014-3487-0

28.

Hanson

R. K.

Harris

A. J. R.

Letourneau

Helmus

L. M.

Thornton

(2018). Reductions in risk based on time offense-free in the community: Once a sexual offender, not always a sexual offender. Psychology, Public Policy, and Law, 24(1), 48–63. https://doi.org/10.1037/law0000135

29.

Hare

(2003). Psychopathy Checklist-Revised. Psychological Assessment.

30.

Illinois Sentencing Policy Advisory Council . (2018). Illinois results first: The high cost of recidivism 2018 report. Illinois Sentencing Policy Advisory Council. https://spac.illinois.gov/publications/cost-benefit-analysis/high-cost-of-recidivism-2018

31.

Katsikas

S. K.

(2009). Chapter 35—risk management. In Vacca

J. R.

(Ed.), Computer and information security handbook (pp. 605–625). Morgan Kaufmann. https://doi.org/10.1016/B978-0-12-374354-1.00035-2

32.

(2021). Unravelling the mystery of parole in China. British Journal of Criminology, 62(4), 896–913. https://doi.org/10.1093/bjc/azab081

33.

Maltz

(1984). Recidivism. Academic Press. https://www.academia.edu/10061829/Recidivism

34.

Meehl

P. E.

(1954). Clinical versus statistical prediction: A theoretical analysis and a review of the evidence (pp. 10–149). University of Minnesota Press. https://doi.org/10.1037/11281-000

35.

Mulder

Brand

Bullens

van Marle

(2011). Risk factors for overall recidivism and severity of recidivism in serious juvenile offenders. International Journal of Offender Therapy and Comparative Criminology, 55(1), 118–135. https://doi.org/10.1177/0306624X09356683

36.

Newton

May

Eames

Ahmad

(2019). Economic and social costs of reoffending. In: Ministry of Justice Analytical Series (p. 51), Ministry of Justice.

37.

Ozkan

(2017). Predicting recidivism through machine learning. The University of Texas at Dallas. https://hdl.handle.net/10735.1/5405

38.

Ozkan

Clipper

S. J.

Piquero

A. R.

Baglivio

Wolff

(2020). Predicting sexual recidivism. Sexual Abuse: A Journal of Research and Treatment, 32(4), 375–399. https://doi.org/10.1177/1079063219852944

39.

Park

S. H.

Goo

J. M.

C.-H.

(2004). Receiver operating characteristic (ROC) curve: Practical review for radiologists. Korean Journal of Radiology, 5(1), 11–18. https://doi.org/10.3348/kjr.2004.5.1.11

40.

Pelham

W. E.

Petras

Pardini

D. A.

(2020). Can machine learning improve screening for targeted delinquency prevention programs? Prevention Science: The Official Journal of the Society for Prevention Research, 21(2), 158–170. https://doi.org/10.1007/s11121-019-01040-2

41.

Pepe

M. S.

Kerr

K. F.

Longton

Wang

(2013). Testing for improvement in prediction model performance. Statistics in Medicine, 32(9), 1467–1482. https://doi.org/10.1002/sim.5727

42.

PROSPERCSIS . (2018, December 14). Developing countries should invest in prisoners, not prisons. Prosper. https://csisprosper.com/2018/12/14/developing-countries-should-invest-in-prisoners-not-prisons/

43.

Rayaprolu

(2019, March 22). 25+ impressive big data statistics for 2023. Techjury. https://techjury.net/blog/big-data-statistics/

44.

Ritter

(2013). Predicting recidivism risk: New tool in Philadelphia shows great promise. National Institute of Justice Journal, 271, 4–13. https://nij.ojp.gov/topics/articles/predicting-recidivism-risk-new-tool-philadelphia-shows-great-promise

45.

Roach

M. A.

Schanzenbach

M. M.

(2015). The effect of prison sentence length on recidivism: Evidence from random judicial assignment. Northwestern Law & Econ Research Paper, (16–8). https://doi.org/10.2139/ssrn.2701549

46.

Schoeman

(2010). Recidivism: A conceptual and operational conundrum. Acta Criminologica: African Journal of Criminology & Victimology, 2010(sed-1), 80–94. https://doi.org/10.10520/EJC28571

47.

Seiter

R. P.

Kadela

K. R.

(2003). Prisoner reentry: What works, what does not, and what is promising. Crime & Delinquency, 49(3), 360–388. https://doi.org/10.1177/0011128703049003002

48.

Singh

Grann

Fazel

(2011). A comparative study of violence risk assessment tools: A systematic review and metaregression analysis of 68 studies involving 25,980 participants. Clinical Psychology Review, 31(3), 499–513. https://doi.org/10.1016/j.cpr.2010.11.009

49.

Thornton

Hanson

R. K.

Kelley

S. M.

Mundt

J. C.

(2021). Estimating lifetime and residual risk for individuals who remain sexual offense free in the community: Practical applications. Sexual Abuse: A Journal of Research and Treatment, 33(1), 3–33. https://doi.org/10.1177/1079063219871573

50.

Tollenaar

van der Heijden

P. G. M.

(2013). Which method predicts recidivism best? A comparison of statistical, machine learning and data mining predictive models. Journal of the Royal Statistical Society - Series A: Statistics in Society, 176(2), 565–584. https://doi.org/10.1111/j.1467-985X.2012.01056.x

51.

Travaini

G. V.

Pacchioni

Bellumore

Bosia

De Micco

(2022). Machine learning and criminal justice: A systematic review of advanced methodology for recidivism risk prediction. International Journal of Environmental Research and Public Health, 19(17), 10594. Article 17. https://doi.org/10.3390/ijerph191710594

52.

Vedelago

Amlung

Morris

Petker

Balodis

McLachlan

Mamak

Moulden

Chaimowitz

MacKillop

(2019). Technological advances in the assessment of impulse control in offenders: A systematic review. Behavioral Sciences & the Law, 37(4), 435–451. https://doi.org/10.1002/bsl.2420

53.

Wang

Han

Patel

Rudin

(2023). In pursuit of interpretable, fair and accurate machine learning for criminal recidivism prediction. Journal of Quantitative Criminology, 39(2), 519–581. https://doi.org/10.1007/s10940-022-09545-w

54.

Wartna

B. S. J.

Nijssen

L. T. J.

(2006). National studies on recidivism: An inventory of large-scale recidivism research in 33 European countries (Fact Sheets 2006-11), WODC. https://repository.wodc.nl/handle/20.500.12832/309

55.

Wong

S. C. P.

Gordon

(2006). The validity and reliability of the violence risk scale: A treatment-friendly violence risk assessment tool. Psychology, Public Policy, and Law, 12(3), 279–309. https://doi.org/10.1037/1076-8971.12.3.279

56.

Wormith

J. S.

(2017). Automated offender risk assessment: The next generation or a black hole? Criminology & Public Policy, 16(1), 281–303. Article 1. https://doi.org/10.1111/1745-9133.12277

57.

Zhang

Yuan

Chen

Liu

(2022). Temporal dynamics of clinical risk predictors for hospital-acquired acute kidney injury under different forecast time windows. Knowledge-Based Systems, 245, 108655. https://doi.org/10.1016/j.knosys.2022.108655

58.

Yukhnenko

Sridhar

Fazel

(2019). A systematic review of criminal recidivism rates worldwide: 3-year update. Wellcome Open Research, 4, 28. https://doi.org/10.12688/wellcomeopenres.14970.3

59.

Zeng

Ustun

Rudin

(2017). Interpretable classification models for recidivism prediction. Journal of the Royal Statistical Society - Series A: Statistics in Society, 180(3), 689–722. https://doi.org/10.1111/rssa.12227

60.

Zhang

(2017). On the integrated interaction between imprisonment execution and community correction. Journal of Beijing Police College, 3, 83–89. https://doi.org/10.16478/j.cnki.jbjpc.20170601.002

61.

Zhou

Q. M.

Zhe

Brooke

R. J.

Hudson

M. M.

Yuan

(2021). A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve. Diagnostic and Prognostic Research, 5(1), 13. https://doi.org/10.1186/s41512-021-00102-w

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

3.10 MB