Sage Journals: Discover world-class research

Abstract

Background

Correctional facilities can act as amplifiers of infectious disease outbreaks. Small community outbreaks can cause larger prison outbreaks, which can in turn exacerbate the community outbreaks. However, strategies for epidemic control in communities and correctional facilities are generally not closely coordinated. We sought to evaluate different strategies for coordinated control.

Methods

We developed a stochastic simulation model of an epidemic spreading across a network of communities and correctional facilities. We parameterized it for the initial phases of the COVID-19 epidemic for 1) California communities and prisons based on community data from covidestim, prison data from the California Department of Corrections and Rehabilitation, and mobility data from SafeGraph, and 2) a small, illustrative network of communities and prisons. For each community or prison, control measures were defined by the intensity of 2 activities: 1) screening to detect and isolate cases and 2) nonpharmaceutical interventions (e.g., masking and social distancing) to reduce transmission. We compared the performance of different control strategies including heuristic and reinforcement learning (RL) strategies using a reward function, which accounted for both the benefit of averted infections and nonlinear cost of the control measures. Finally, we performed analyses to interpret the optimal strategy and examine its robustness.

Results

The RL control strategy robustly outperformed other strategies including heuristic approaches such as those that were largely used during the COVID-19 epidemic. The RL strategy prioritized different characteristics of communities versus prisons when allocating control resources and exhibited geo-temporal patterns consistent with mitigating prison amplification dynamics.

Conclusion

RL is a promising method to find efficient policies for controlling epidemic spread on networks of communities and correctional facilities, providing insights that can help guide policy.

Highlights

For modelers, we developed a stochastic simulation model of an epidemic spreading across a network of communities and correctional facilities, and we parameterized it for the initial phases of the COVID-19 epidemic for California communities and prisons in addition to an illustrative network.

We compared different control strategies using a reward function that accounted for both the benefit of averted infections and cost of the control measures; we found that reinforcement learning robustly outperformed the other strategies including heuristic approaches such as those that were largely used during the COVID-19 epidemic.

For policy makers, our work suggests that they should consider investing in the further development of such methods and using them for the control of future epidemics.

We offer qualitative insights into different factors that might inform resource allocation to communities versus prisons during future epidemics.

Keywords

epidemics COVID-19 simulation modeling reinforcement learning correctional facilities

Get full access to this article

View all access options for this article.

References

Saloner

Parish

Ward

DiLaura

Dolovich

COVID-19 cases and deaths in federal and state prisons. JAMA. 2020;324(6):602–3.

Marquez

Ward

Parish

Saloner

Dolovich

COVID-19 incidence and mortality in federal and state prisons compared with the US population, April 5, 2020, to April 3, 2021. JAMA. 2021;326(18):1865–7.

Puglisi

Malloy

GSP

Harvey

Brandeau

Wang

EA.

Estimation of COVID-19 basic reproduction ratio in a large urban jail in the United States. Ann Epidemiol. 2021;53:103–5.

Weyant

Meyer

Bromberg

Beyrer

Altice

Goldhaber-Fiebert

JD.

Decarceration and COVID-19 infections in U.S. Immigration and Customs Enforcement detention facilities: a simulation modeling study. Lancet Reg Health Am. 2025;42:100971.

Cords

Martinez

Warren

, et al. Incidence and prevalence of tuberculosis in incarcerated populations: a systematic review and meta-analysis. Lancet Public Health. 2021;6(5):e300–8.

Beaudry

Zhong

Whiting

Javid

Frater

Fazel

Managing outbreaks of highly contagious diseases in prisons: a systematic review. BMJ Global Health. 2020;5(11):e003201.

Reinhart

Chen

DL.

Incarceration and its disseminations: COVID-19 pandemic lessons from Chicago’s Cook County Jail. Health Affairs. 2020;39(8):1412–8.

Reinhart

Chen

DL.

Carceral-community epidemiology, structural racism, and COVID-19 disparities. Proc Natl Acad Sci U S A. 2021;118(21):e2026577118.

Reinhart

Chen

DL.

Association of jail decarceration and anticontagion policies with COVID-19 case growth rates in US counties. JAMA Network Open. 2021;4(9):e2123405.

10.

Malloy

GSP

Puglisi

Bucklen

Harvey

Wang

Brandeau

. Predicting COVID-19 outbreaks in correctional facilities using machine learning. MDM Policy Pract. 2024;9(1):23814683231222469.

11.

Weyant

Lee

Andrews

Alarid-Escudero

Goldhaber-Fiebert

JD.

Dynamics of respiratory infectious diseases in incarcerated and free-living populations: a simulation modeling study. Med Decis Making. 2023;43(1):42–52.

12.

Keeling

Rohani

Modeling Infectious Diseases in Humans and Animals. Princeton (NJ): Princeton University Press; 2008. Available from: https://www.jstor.org/stable/j.ctvcm4gk0. [Accessed 20 October, 2021].

13.

California Department of Corrections and Rehabilitation. Prison Statistics. Available from: https://www.cdcr.ca.gov/research/. [Accessed 1 April, 2023].

14.

US Census Bureau. County population totals and components of change: 2020–2024 [Internet]. Census.gov. March 2025. Available from: https://www.census.gov/data/tables/time-series/demo/popest/2020s-counties-total.html. [Accessed 13 March, 2025].

15.

SafeGraph. Places data curated for accurate geospatial analytics. Available from: https://safegraph.com/. [Accessed 14 March, 2025].

16.

Kang

Gao

Liang

Rao

Kruse

Multiscale dynamic human mobility flow dataset in the U.S. during the COVID-19 epidemic. Sci Data. 2020;7(1):390.

17.

The Farama Foundation. Gymnasium documentation. Available from: https://gymnasium.farama.org/index.html. [Accessed 19 April, 2025].

18.

Schulman

Wolski

Dhariwal

Radford

Klimov

Proximal policy optimization algorithms. 2017. Available from: https://arxiv.org/abs/1707.06347v2. [Accessed 19 April, 2025].

19.

Stable-Baselines3. Stable-baselines3 docs - reliable reinforcement learning implementations. Available from: https://stable-baselines3.readthedocs.io/en/master/index.html#. [Accessed 19 April, 2025].

20.

Optuna. A hyperparameter optimization framework. Available from: https://optuna.org/. [Accessed 19 April, 2025].

21.

Chen

Guestrin

XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016, pp. 785–94. Available from: http://arxiv.org/abs/1603.02754. [Accessed 19 April, 2025].

22.

Friedman

Hastie

Tibshirani

Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33:1–22.

23.

Wright

Ziegler

Ranger: a fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw. 2017;77:1–17.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.78 MB

Reinforcement Learning-Based Control of Epidemics on Networks of Communities and Correctional Facilities

Abstract

Background

Methods

Results

Conclusion

Highlights

Keywords

Get full access to this article

References

Supplementary Material