Principled Machine Learning Using the Super Learner: An Application to Predicting Prison Violence

Abstract

A rapidly growing number of algorithms are available to researchers who apply statistical or machine learning methods to answer social science research questions. The unique advantages and limitations of each algorithm are relatively well known, but it is not possible to know in advance which algorithm is best suited for the particular research question and the data set at hand. Typically, researchers end up choosing, in a largely arbitrary fashion, one or a handful of algorithms. In this article, we present the Super Learner—a powerful new approach to statistical learning that leverages a variety of data-adaptive methods, such as random forests and spline regression, and systematically chooses the one, or a weighted combination of many, that produces the best forecasts. We illustrate the use of the Super Learner by predicting violence among inmates from the 2005 Census of State and Federal Adult Correctional Facilities. Over the past 40 years, mass incarceration has drastically weakened prisons’ capacities to ensure inmate safety, yet we know little about the characteristics of prisons related to inmate victimization. We discuss the value of the Super Learner in social science research and the implications of our findings for understanding prison violence.

Keywords

machine learning ensemble methods super learner prisons violence

Get full access to this article

View all access options for this article.

References

Beijersbergen

Karin A.

Dirkzwager

Anja J. E.

van der Laan

Peter H.

Nieuwbeerta

Paul

. 2014. “A Social Building? Prison Architecture and Staff–Prisoner Relationships.” Crime & Delinquency 62:843–74.

Berk

Richard A.

2008. Statistical Learning From a Regression Perspective. New York: Springer.

Berk

Richard A.

Bleich

Justin

. 2013. “Statistical Procedures for Forecasting Criminal Behavior.” Criminology & Public Policy 12:513–44.

Berk

Richard A.

Kriegler

Brian

Baek

Jong-Ho

. 2006. “Forecasting Dangerous Inmate Misconduct: An Application of Ensemble Statistical Procedures.” Journal of Quantitative Criminology 22:131–45.

Bierie

David M.

2012. “Is Tougher Better? The Impact of Physical Prison Conditions on Inmate Violence.” International Journal of Offender Therapy and Comparative Criminology 56:338–55.

Bottoms

Anthony E.

1999. “Interpersonal Violence and Social Order in Prisons.” Crime and Justice 26:205–81.

Breiman

Leo

. 1996. “Stacked regressions.” Machine Learning 24: 49–64.

Breiman

Leo

. 2001. “Random Forests.” Machine Learning 45:5–32.

Bushway

Shawn D.

2013. “Is there Any Logic to Using Logit.” Criminology & Public Policy 12:563–67.

10.

Byrne

J. M.

Hummer

Don

. 2008. “Examining the Impact of Institutional Culture On Prison Violence and Disorder: An Evidence-based Review.” Pp. 40–90 in The Culture of Prison Violence, edited by Byrne

J. M.

Hummer

Don

Taxman

F. S.

. Boston, MA: Pearson.

11.

Camp

Scott D.

Gaes

Gerald G.

Langan

Neal P.

Saylor

William G.

. 2003. “The Influence of Prisons on Inmate Misconduct: A Multilevel Investigation.” Justice Quarterly 20:501–33.

12.

Chipman

Hugh A.

George

Edward I.

McCulloch

Robert E.

. 2010. “BART: Bayesian Additive Regression Trees.” The Annals of Applied Statistics 4:266–98.

13.

Clear

Todd R.

1994. Harm in American Penology: Offenders, Victims, and Their Communities. Albany: SUNY Press.

14.

Clear

Todd R.

Frost

Natasha A.

. 2013. The Punishment Imperative: The Rise and Failure of Mass Incarceration in America. New York: NYU Press.

15.

Crewe

Ben

. 2013. “The Sociology of Imprisonment.” Pp. 123–51 in Handbook on Prisons, edited by Jewkes

Yvonne

. New York: Routledge.

16.

Finn

Peter

. 1996. “No-frills Prisons and Jails: A Movement in Flux.” Federal Probation 60:35–44.

17.

Franklin

Travis W.

Franklin

Cortney A.

Pratt

Travis C.

. 2006. “Examining the Empirical Relationship Between Prison Crowding and Inmate Misconduct: A Meta-analysis of Conflicting Research Results.” Journal of Criminal Justice 34:401–12.

18.

French

Sheila A.

Gendreau

Paul

. 2006. “Reducing Prison Misconducts: What Works!” Criminal Justice and Behavior 33:185–218.

19.

Freund

Yoav

Schapire

Robert E.

. 1997. “A Decision-theoretic Generalization of On-line Learning and an Application to Boosting.” Journal of Computer and System Sciences 55:119–39.

20.

Friedman

Jerome H.

1991. “Multivariate Adaptive Regression Splines.” The Annals of Statistics 19:1–67.

21.

Gaes

Gerald G.

1994. “Prison Crowding Research Reexamined.” The Prison Journal 74:329–63.

22.

Gaes

Gerald G.

Flanagan

Timothy J.

Motiuk

Laurence L.

Stewart

Lynn

. 1999. “Adult Correctional Treatment.” Crime and Justice 26:361–426.

23.

Gendreau

Paul

Goggin

Claire E.

Law

Moira A.

. 1997. “Predicting Prison Misconducts.” Criminal Justice and Behavior 24:414–31.

24.

Glymour

M. Maria

Osypuk

Theresa L.

Rehkopf

David H.

. 2013. “Invited Commentary: Off-roading With Social Epidemiology–Exploration, Causation, Translation.” American Journal of Epidemiology 178:858–63.

25.

Gonçalves

Leonel C.

Gonçalves

Rui A.

Martins

Carla

Dirkzwager

Anja J. E.

. 2014. “Predicting Infractions and Health Care Utilization in Prison a Meta-analysis.” Criminal Justice and Behavior 41:921–41.

26.

Gottschalk

Marie

. 2014. “Democracy and the Carceral State in America.” The ANNALS of the American Academy of Political and Social Science 651:288–95.

27.

Hamilton

Zachary

Neuilly

Melanie-Angela

Lee

Stephen

Barnoski

Robert

. 2014. “Isolating Modeling Effects in Offender Risk Assessment.” Journal of Experimental Criminology 11:299–318.

28.

Haney

Craig

. 2006. “The Wages of Prison Overcrowding: Harmful Psychological Consequences and Dysfunctional Correctional Reactions.” Washington University Journal of Law & Policy 22:265–93.

29.

Harding

Richard

. 2001. “Private Prisons.” Crime and Justice 28:265–346.

30.

Hastie

Trevor

Tibshirani

Robert

. 1986. “Generalized Additive Models.” Statistical Science 1:297–310.

31.

Hastie

Trevor

Tibshirani

Robert

Friedman

Jerome

. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York: Springer.

32.

Kinlock

Timothy W.

O’Grady

Kevin E.

Hanlon

Thomas E.

. 2003. “The Effects of Drug Treatment on Institutional Behavior.” The Prison Journal 83:257–76.

33.

Lahm

Karen F.

2008. “Inmate-on-inmate Assault: A Multilevel Examination of Prison Violence.” Criminal Justice and Behavior 35:120–37.

34.

Landenberger

Nana A.

Lipsey

Mark W.

. 2005. “The Positive Effects of Cognitive–Behavioral Programs for Offenders: A Meta-analysis of Factors Associated With Effective Treatment.” Journal of Experimental Criminology 1:451–76.

35.

Logan

Charles H.

1992. “Well Kept: Comparing Quality of Confinement in Private and Public Prisons.” Journal of Criminal Law and Criminology 83:577–613.

36.

MacKenzie

Doris Layton

. 2006. What Works in Corrections: Reducing the Criminal Activities of Offenders and Delinquents. New York: Cambridge University Press.

37.

McCorkle

Richard C.

Miethe

Terance D.

Drass

Kriss A.

. 1995. “The Roots of Prison Violence: A Test of the Deprivation, Management, and ‘Not-so-total’ Institution Models.” Crime & Delinquency 41:317–31.

38.

McFarland

Daniel A.

Lewis

Kevin

Goldberg

Amir

. 2016. “Sociology in the Era of Big Data: The Ascent of Forensic Social Science.” The American Sociologist 47:12–35.

39.

Morris

Robert G.

Carriaga

Michael L.

Diamond

Brie

Piquero

Nicole Leeper

Piquero

Alex R.

. 2012. “Does Prison Strain Lead to Prison Misbehavior? An Application of General Strain Theory to Inmate Misconduct.” Journal of Criminal Justice 40:194–201.

40.

Neugebauer

Romain

Fireman

Bruce

Roy

Jason A.

Raebel

Marsha A.

Nichols

Gregory A.

O’Connor

Patrick J.

. 2013. “Super Learning to Hedge Against Incorrect Inference from Arbitrary Parametric Assumptions in Marginal Structural Modeling.” Journal of Clinical Epidemiology 66: S99–109.

41.

Neugebauer

Romain

Schmittdiel

Julie A.

van der Laan

Mark J.

. 2014. “Targeted Learning in Real-world Comparative Effectiveness Research with Time-varying Interventions.” Statistics in Medicine 33:2480–520.

42.

Ngo

Fawn T.

Govindu

Ramakrishna

Agarwal

Anurag

. 2015. “Assessing the Predictive Utility of Logistic Regression, Classification and Regression Tree, Chi-squared Automatic Interaction Detection, and Neural Network Models in Predicting Inmate Misconduct.” American Journal of Criminal Justice 40:47–74.

43.

Pearson

Frank S.

Lipton

Douglas S.

. 1999. “A Meta-analytic Review of the Effectiveness of Corrections-based Treatments for Drug Abuse.” The Prison Journal 79:384–410.

44.

Pirracchio

Romain

Petersen

Maya L.

Laan

Mark van der

. 2015. “Improving Propensity Score Estimators’ Robustness to Model Misspecification Using Super Learner.” American Journal of Epidemiology 181:108–19.

45.

Rose

Sherri

. 2013. “Mortality Risk Score Prediction in an Elderly Population Using Machine Learning.” American Journal of Epidemiology 177:443–52.

46.

Sampson

Robert J.

Byron Groves

. 1989. “Community Structure and Crime: Testing Social-disorganization Theory.” American Journal of Sociology 94:774–802.

47.

Selman

Donna

Leighton

Paul

. 2010. Punishment for Sale: Private Prisons, Big Business, and the Incarceration Binge. Plymouth, UK: Rowman & Littlefield.

48.

Shaw

Clifford R.

McKay

Henry D.

. 1942. Juvenile Delinquency and Urban Areas. Chicago, IL: University of Chicago Press.

49.

Simon

Ellen

. 1992. “Who’s Minding the Rights of Inmates When Justice Goes to the Lowest Bidder.” Human Rights 19:22.

50.

Steiner

Benjamin

. 2009. “Assessing Static and Dynamic Influences on Inmate Violence Levels.” Crime & Delinquency 55:134–61.

51.

Steiner

Benjamin

Daniel Butler

Ellison

Jared M.

. 2014. “Causes and Correlates of Prison Inmate Misconduct: A Systematic Review of the Evidence.” Journal of Criminal Justice 42:462–70.

52.

Steiner

Benjamin

Wooldredge

John

. 2014. “Comparing Self-report to Official Measures of Inmate Misconduct.” Justice Quarterly 31:1074–101.

53.

Stuckler

David

Basu

Sanjay

. 2013. The Body Economic: Why Austerity Kills. New York: Basic Books.

54.

Sykes

Gresham M.

2007. The Society of Captives: A Study of a Maximum Security Prison. Princeton, NJ: Princeton University Press.

55.

Taxman

Faye S.

Kitsantas

Panagiota

. 2009. “Availability and Capacity of Substance Abuse Programs in Correctional Settings: A Classification and Regression Tree Analysis.” Drug and Alcohol Dependence 103:S43–53.

56.

Tibshirani

Robert

. 1996. “Regression Shrinkage and Selection via the Lasso.” Journal of the Royal Statistical Society. Series B (Methodological) 58:267–88.

57.

van der Laan

Mark J.

Dudoit

Sandrine

. 2003. “Unified Cross-validation Methodology for Selection Among Estimators and a General Cross-validated Adaptive Epsilon-net Estimator: Finite Sample Oracle Inequalities and Examples.” UC Berkeley Division of Biostatistics Working Paper Series Paper 130. Retrieved from http://biostats.bepress.com/ucbbiostat/paper130

58.

van der Laan

Mark J.

Polley

Eric C.

Hubbard

Alan E.

. 2007. “Super Learner.” Statistical Applications in Genetics and Molecular Biology 6:1–21.

59.

van der Laan

Mark J.

Rose

Sherri

. 2011. Targeted Learning: Causal Inference for Observational and Experimental Data. New York: Springer.

60.

Walters

Glenn D.

1998. “Time Series and Correlational Analyses of Inmate-initiated Assaultive Incidents in a Large Correctional System.” International Journal of Offender Therapy and Comparative Criminology 42:124–32.

61.

Wolff

Nancy

Shi

Jing

Siegel

Jane

. 2009. “Understanding Physical Victimization Inside Prisons: Factors That Predict Risk.” Justice Quarterly 26:445–75.

62.

Wooldredge

John

Griffin

Timothy

Pratt

Travis

. 2001. “Considering Hierarchical Models for Research on Inmate Behavior: Predicting Misconduct With Multilevel Data.” Justice Quarterly 18:203–31.

63.

Wolpert

D. H.

1992. “Stacked generalization.” Neural Networks 5:241–259.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.24 MB

0.28 MB

0.24 MB