Understanding and Mitigating the Impacts of Differentially Private Census Data on State-Level Redistricting

Abstract

Data from the Decennial Census are published only after applying a disclosure avoidance system (DAS). Data users were shaken by the adoption of differential privacy in the 2020 DAS, a radical departure from past methods. The goal of this article is to better understand how the perturbations from the 2020 DAS combine with sharp legal thresholds to impact redistricting. We consider two redistricting settings in which a data user might be concerned about the impacts of privacy-preserving noise: drawing equal population districts and litigating voting rights cases. What discrepancies arise if the user does nothing to account for disclosure avoidance? How can the discrepancies be understood and accounted for? We study these questions by comparing the official 2010 Redistricting Data to the 2010 Demonstration Data—created using the 2020 DAS—in an analysis of millions of algorithmically generated state legislative redistricting plans. We find that thresholding can amplify the impact of the noise from disclosure avoidance. Large discrepancies do occur, but in ways that are well-captured by simple models and appear to be possible to account for. We demonstrate the utility of these models by proposing an approach to mitigate discrepancies when balancing district populations. At least for state legislatures, Alabama’s claim that differential privacy “inhibits a State’s right to draw fair lines” lacks support.

Keywords

redistricting privacy ensemble analysis Voting Rights Act

Get full access to this article

View all access options for this article.

References

Abowd

John

, Ashmead

Robert

, Cumings-Menon

Ryan

, Garfinkel

Simson

, Heineck

Micah

, Heiss

Christine

, Johns

Robert

, Kifer

Daniel

, Leclerc

Philip

, Machanavajjhala

Ashwin

, Moran

Brett

, Sexton

William

, Spence

Matthew

, and Zhuravlev

Pavel

. 2022. “The 2020 Census Disclosure Avoidance System TopDown Algorithm.” Harvard Data Science Review (Special Issue 2).

Autry

Eric

, Carter

Daniel

, Herschlag

Gregory J.

, Hunter

Zach

, and Mattingly

Jonathan C.

. 2023. “Metropolized forest recombination for monte carlo sampling of graph partitions.” SIAM Journal on Applied Mathematics, 83(4):1366–1391.

Ballesteros

María

, Dwork

Cynthia

, King

Gary

, Olson

Conlan

, and, and Raghavan

Manish

. 2025. Evaluating the impacts of swapping on the us decennial census. In: Proceedings of the Symposium on Computer Science and Law, CSLAW ‘25. ACM; pp. 64–76.

Banks

Jim

. 2025. Senator Banks Calls for Investigation into 2020 Census Miscounts and Data Integrity, October 2025. Available from: https://perma.cc/2BE2-YGHK

Becker

Amariah

, Duchin

Moon

, Gold

Dara

, and Hirsch

Sam

. 2021. “Computational redistricting and the voting rights act.” Election Law Journal: Rules, Politics, and Policy, 20(4):407–441.

Boyd

Danah

and Sarathy

Jayshree

. 2022. “Differential Perspectives: Epistemic Disconnects Surrounding the U.S. Census Bureau’s Use of Differential Privacy.” Harvard Data Science Review (Special Issue 2).

Cannon

Sarah

, Goldbloom-Helzner

Ari

, Gupta

Varun

, Matthews

JN.

, and Suwal

Bhushan

. 2023. “Voting rights, Markov chains, and optimization by short bursts.” Methodology and Computing in Applied Probability, 25(1):36.

Chen

Jowei

and Rodden

Jonathan

. 2013. “Unintentional gerrymandering: Political geography and electoral bias in legislatures.” Quarterly Journal of Political Science, 8(3):239–269.

Christ

Miranda

, Radway

Sarah

, and, and Bellovin

Steven M

. 2022. Differential privacy and swapping: Examining de-identification’s impact on minority representation and privacy preservation in the US Census. In: 2022 IEEE Symposium on Security and Privacy (SP). IEEE; pp. 457–472.

10.

Cirincione

Carmen

, Darling

Thomas A.

, and O’Rourke

Timothy G.

. 2000. “Assessing South Carolina’s 1990s congressional districting.” Political Geography, 19(2):189–211.

11.

Cohen

Aloni

, Duchin

Moon

, Matthews

JN.

, and Suwal

Bhushan

. 2022. “Private numbers in public policy: Census, differential privacy, and redistricting.” Harvard Data Science Review (Special Issue 2).

12.

DeFord

Daryl

and Duchin

Moon

. 2022. Random walks and the universe of districting plans. In: Political Geometry: Rethinking Redistricting in the US with Math, Law, and Everything In Between. Springer; pp. 341–381.

13.

DeFord

Daryl

, Duchin

Moon

, and Solomon

Justin

. 2021. “Recombination: A Family of Markov Chains for Redistricting.” Harvard Data Science Review, 3(1). Available from: https://hdsr.mitpress.mit.edu/pub/1ds8ptxu

14.

Dick

Travis

, Dwork

Cynthia

, Kearns

Michael

, Liu

Terrance

, Roth

Aaron

, Vietri

Giuseppe

, and Wu

Zhiwei Steven

. 2023. “Confidence-ranked reconstruction of census microdata from published statistics.” Proceedings of the National Academy of Sciences of the United States of America, 120(8):e2218605120.

15.

Duchin

Moon

, Gladkova

Taissa

, Henninger-Voss

Eugene

, Klingensmith

Ben

, Newman

Heather

, and Wheelen

Hannah

. 2019. “Locating the representational baseline: Republicans in massachusetts.” Election Law Journal: Rules, Politics, and Policy, 18(4):388–401.

16.

Dwork

Cynthia

, McSherry

Frank

, Nissim

Kobbi

, and Smith

Adam

. 2006. Calibrating noise to sensitivity in private data analysis. In: Theory of Cryptography: Third Theory of Cryptography Conference, TCC 2006, New York, NY, USA, March 4-7, 2006. Proceedings 3. Springer; pp. 265–284.

17.

Engstrom

Richard L.

and Wildgen

John K.

. 1977. “Pruning thorns from the thicket: An empirical test of the existence of racial gerrymandering.” Legislative Studies Quarterly, 2(4):465–479.

18.

Fifield

Benjamin

, Imai

Kosuke

, Kawahara

Jun

, and Kenny

Christopher T.

. 2020. “The essential role of empirical validation in legislative redistricting simulation.” Statistics and Public Policy, 7(1):52–68.

19.

Fifield

Benjamin

, Higgins

Michael

, Imai

Kosuke

, and Tarr

Alexander

. 2020. “Automated redistricting simulation using markov chain monte carlo.” Journal of Computational and Graphical Statistics, 29(4):715–728.

20.

Garfinkel

Simson

, Abowd

John M.

, and Martindale

Christian

. 2018. “Understanding database reconstruction attacks on public data: These attacks on statistical databases are no longer a theoretical danger.” Queue, 16(5):28–53.

21.

Gelman

Andrew

, Carlin

John B

, Stern

Hal S

, Dunson

David B

, Vehtari

Aki

, and Rubin

Donald B

. 2013. Bayesian Data Analysis, 3rd ed. CRC Press.

22.

Jarmin

Ron S.

, Abowd

John M.

, Ashmead

Robert

, Cumings-Menon

Ryan

, Goldschlag

Nathan

, Hawes

Michael B.

, Keller

Sallie Ann

, Kifer

Daniel

, Leclerc

Philip

, Reiter

Jerome P.

, Rodríguez

Rolando A.

, Schmutte

Ian

, Velkoff

Victoria A.

, and Zhuravlev

Pavel

. 2023. “An in-depth examination of requirements for disclosure risk assessment.” Proceedings of the National Academy of Sciences of the United States of America, 120(43):e2220558120.

23.

Kenny

Christopher T.

, McCartan

Cory

, Kuriwaki

Shiro

, Simko

Tyler

, and Imai

Kosuke

. 2024. “Evaluating bias and noise induced by the us census bureau’s privacy protection methods.” Science Advances, 10(18):eadl2524.

24.

Kenny

Christopher T.

, Kuriwaki

Shiro

, McCartan

Cory

, Rosenman

Evan T R.

, Simko

Tyler

, and Imai

Kosuke

. 2021. “The use of differential privacy for census data and its impact on redistricting: The case of the 2020 us census.” Science Advances, 7(41):eabk3283.

25.

Kenny

Christopher T

, McCartan

Cory

, Fifield

Ben

, and Imai

Kosuke

. 2022. redist: Simulation Methods for Legislative Redistricting. R package.

26.

Kenny

Christopher

, Kuriwaki

Shiro

, McCartan

Cory

, Rosenman

Evan

, Simko

Tyler

, and Imai

Kosuke

. 2023. “Comment: The Essential Role of Policy Evaluation for the 2020 Census DisclosureAvoidance System.” Harvard Data Science Review (Special Issue 2).

27.

Khubba

Shadie

, Heim

Krista

, and Hong

Jinhee

. 2022. National Census Coverage Estimates for People in the United States by Demographic Characteristics: 2020 Post-Enumeration Survey Estimation Report. US Department of Commerce, US Census Bureau.

28.

Krieger

Nancy

, Nethery

Rachel C.

, Chen

Jarvis T.

, Waterman

Pamela D.

, Wright

Emily

, Rushovich

Tamara

, and Coull

Brent A.

. 2021. “Impact of differential privacy and census tract data source (decennial census versus american community survey) for monitoring health inequities.” American Journal of Public Health, 111(2):265–268.

29.

Manson

Steven

, Schroeder

Jonathan

, Van Riper

David

, Kugler

Tracy

, and Ruggles

Steven

. 2022. IPUMS National Historical Geographic Information System: Version 17.0 [dataset]. Minneapolis, MN: IPUMS.

30.

McCartan

Cory

and Imai

Kosuke

. 2023. “Sequential monte carlo for sampling balanced and compact redistricting plans.” The Annals of Applied Statistics, 17(4):3300–3323.

31.

McCartan

Cory

, Kenny

Christopher T.

, Simko

Tyler

, Garcia

George

, Wang

Kevin

, Wu

Melissa

, Kuriwaki

Shiro

, and Imai

Kosuke

. 2022. “Simulated redistricting plans for the analysis and evaluation of redistricting in the United States.” Scientific Data, 9(1):689.

32.

McKenna

Laura

. 2018. Disclosure avoidance techniques used for the 1970 through 2010 decennial censuses of population and housing. Technical report, U.S. Census Bureau.

33.

Merrill v. Milligan. 2022. Merrill v. Milligan Declarations of Moon Duchin (ECF 68-5) and William S. Cooper (ECF 48). in Supplemental Joint Appendix, On appeal from and writ of certiorari to the United States District Court for the Northern District of Alabama, Supreme Court of the United States, Available from: https://www.supremecourt.gov/DocketPDF/21/21-1086/221826/20220425150837756_42140%20pdf%20Bowdre%20IV%20Supplemental%20JA.pdf

34.

Petti

Samantha

and Flaxman

Abraham

. 2019. “Differential privacy in the 2020 us census: What will it do? Quantifying the accuracy/privacy tradeoff.” Gates Open Research, 3:1722.

35.

Schneider

Mike

. 2021. People, homes vanish due to 2020 census’ new privacy method. Associated Press; Available from: https://apnews.com/article/religion-wisconsin-new-york-tampa-florida-68c96e7eb701da74ae7c8df3c3476705

36.

Steed

Ryan

, Liu

Terrance

, Wu

Zhiwei Steven

, and Acquisti

Alessandro

. 2022. “Policy impacts of statistical uncertainty and privacy.” Science (New York, N.Y.), 377(6609):928–931.

37.

Suwal

Bhushan

, Rule

Parker

, and Sun

Matthew

. 2021. mggg/GerryChainJulia: Minor fixes + Save as HDF5: A high-performance implementation of GerryChain in Julia. Zenodo; doi: 10.5281/zenodo.4649464

38.

US Census Bureau. 2023a. 2020 Census Redistricting Noisy Measurement File (NMF). Available from: https://www.census.gov/newsroom/press-releases/2023/2020-redistricting-noisy-measurement-file.html

39.

US Census Bureau. 2021. Disclosure Avoidance for the 2020 Census: An Introduction. US Government Publishing Office: Washington, DC;

40.

US Census Bureau. 2023b. Redistricting Data Program. Available from: https://www.census.gov/programs-surveys/decennial-census/about/rdo.html

41.

US Census Bureau. 2022. Releases Estimates of Undercount and Overcount in the 2020 Census. US Census Bureau.

42.

US Census Bureau. 2025. TIGER/Line Shapefiles. Available from: https://www.census.gov/geo/maps-data/data/tiger-line.html

43.

US Census Bureau. 1994. Voting Districts. In: Geographic Areas Reference Manual. United States Department of Commerce: Washington, DC.

44.

Van Riper

David

, Kugler

Tracy

, and Schroeder

Jonathan

. 2020. IPUMS NHGIS Privacy-Protected 2010 Census Demonstration Data, version 20210608 [Database]. IPUMS: Minneapolis, MN.

45.

Virtanen

Pauli

, Gommers

Ralf

, Oliphant

Travis E.

, Haberland

Matt

, Reddy

Tyler

, Cournapeau

David

, Burovski

Evgeni

, Peterson

Pearu

, Weckesser

Warren

, Bright

Jonathan

, van der Walt

Stéfan J.

, Brett

Matthew

, Wilson

Joshua

, Millman

K Jarrod

, Mayorov

Nikolay

, Nelson

Andrew R J.

, Jones

Eric

, Kern

Robert

, Larson

Eric

, Carey

C J.

, Polat

İlhan

, Feng

, Moore

Eric W.

, VanderPlas

Jake

, Laxalde

Denis

, Perktold

Josef

, Cimrman

Robert

, Henriksen

Ian

, Quintero

E A.

, Harris

Charles R.

, Archibald

Anne M.

, Ribeiro

Antônio H.

, Pedregosa

Fabian

, and van Mulbregt

Paul

. andSciPy 1.0 Contributors. 2020. “SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python.” Nature Methods, 17(3):261–272.

46.

Wezerek

Gus

and Van Riper

David

. 2020. Changes to the Census Could Make Small Towns Disappear. The New York Times.

47.

Wright

Tommy

and Irimata

Kyle

. 2021. Empirical study of two aspects of the TopDown algorithm output for redistricting: Reliability & variability. Technical report, U.S. Census Bureau.

48.

Zalesin

Jeff

. 2020. “Beyond the adjustment wars: Dealing with uncertainty and bias in restistricting data.” Yale Law Journal Forum, 130:186.