Sage Journals: Discover world-class research

Abstract

Researchers studying income inequality, economic segregation, and other subjects must often rely on grouped data—that is, data in which thousands or millions of observations have been reduced to counts of units by specified income brackets. The distribution of households within the brackets is unknown, and highest incomes are often included in an open-ended top bracket, such as “$200,000 and above.” Common approaches to this estimation problem include calculating midpoint estimators with an assumed Pareto distribution in the top bracket and fitting a flexible multiple-parameter distribution to the data. The authors describe a new method, mean-constrained integration over brackets (MCIB), that is far more accurate than those methods using only the bracket counts and the overall mean of the data. On the basis of an analysis of 297 metropolitan areas, MCIB produces estimates of the standard deviation, Gini coefficient, and Theil index that are correlated at 0.997, 0.998, and 0.991, respectively, with the parameters calculated from the underlying individual record data. Similar levels of accuracy are obtained for percentiles of the distribution and the shares of income by quintiles of the distribution. The technique can easily be extended to other distributional parameters and inequality statistics.

Keywords

income distribution inequality grouped data estimation techniques

Get full access to this article

View all access options for this article.

References

Abdi

Hervé

. 2010. “Coefficient of Variation.”Encyclopedia of Research Design 1:169–71.

Allison

Paul D.

1978. “Measures of Inequality.”American Sociological Review 43(6):865–80.

Atkinson

Anthony B.

1970. “On the Measurement of Inequality.”Journal of Economic Theory 2(3):244–63.

Atkinson

Anthony B.

Piketty

Thomas

Saez

Emmanuel

. 2011. “Top Incomes in the Long Run of History.”Journal of Economic Literature 49(1):3–71.

Bronfenbrenner

Martin

. 1971. Income Distribution Theory. New York: Aldine.

Champernowne

D. G.

1974. “A Comparison of Measures of Inequality of Income Distribution.”Economic Journal 84(336):787–816.

Cloutier

Norman R.

1988. “Pareto Extrapolation Using Grouped Income Data.”Journal of Regional Science 28(3):415–19.

Conceição

Pedro

Ferreira

Pedro

. 2000. “The Young Person’s Guide to the Theil Index: Suggesting Intuitive Interpretations and Exploring Analytical Applications.” Retrieved March6, 2017 (https://papers.ssrn.com/sol3/papers.cfm?abstract_id= 228703).

Corcoran

Sean

Evans

William N.

2010. “Income Inequality, the Median Voter, and the Support for Public Education.”Cambridge, MA: National Bureau of Economic Research. Retrieved February18, 2017 (http://www.nber.org/papers/w16097).

10.

Cowell

F. A.

1977. Measuring Inequality. Oxford, UK: Philip Allan.

11.

Crimi

Nicole

Eddy

William

. 2014. “Top-coding and Public Use Microdata Samples from the U.S. Census Bureau.”Journal of Privacy and Confidentiality 6(2). Retrieved (http://repository.cmu.edu/jpc/vol6/iss2/2).

12.

Gini

Corrado

. 1912. “Variabilità e Mutabilità.” Reprinted in Memorie Di Metodologia Statistica, edited by Pizetti

Salvemini

Rome, Italy: Libreria Eredi Virgilio Veschi.

13.

Gini

Corrado

. 1921. “Measurement of Inequality of Incomes.”Economic Journal 31(121):124–26.

14.

Heitjan

Daniel F.

1989. “Inference from Grouped Continuous Data: A Review.”Statistical Science 4(2):164–79.

15.

Henson

Mary F.

1967. “Trends in the Income of Families and Persons in the United States, 1947–1964.” Washington, DC: U.S. Department of Commerce, Bureau of the Census.

16.

Andrew D.

Reardon

Sean F.

2012. “Estimating Achievement Gaps from Test Scores Reported in Ordinal ‘Proficiency’ Categories.”Journal of Educational and Behavioral Statistics 37(4):489–517.

17.

Jargowsky

Paul A.

1996. “Take the Money and Run: Economic Segregation in US Metropolitan Areas.”American Sociological Review 61(6):984–98.

18.

Kinney

Satkartar K.

Karr

Alan

. 2017. “Public-use vs. Restricted-use: An Analysis Using the American Community Survey.”Rochester, NY: Social Science Research Network. Retrieved February21, 2017 (https://papers.ssrn.com/abstract=2909935).

19.

Liebenberg

Maurice

Kaitz

Hyman

. 1951. “An Income Size Distribution from Income Tax and Survey Data, 1944.” Pp. 378–462 in Studies in Income and Wealth, edited by Conference on Research in Income and Wealth. Cambridge, MA: National Bureau of Economic Research. Retrieved February18, 2017 (http://www.nber.org/chapters/c5728.pdf).

20.

Lorenz

M. O.

1905. “Methods of Measuring the Concentration of Wealth.”Publications of the American Statistical Association 9(70):209–19.

21.

Mandelbrot

Benoit

. 1960. “The Pareto-Lévy Law and the Distribution of Income.”International Economic Review 1(2):79–106.

22.

McDonald

James B.

1984. “Some Generalized Functions for the Size Distribution of Income.”Econometrica 52(3):647–63.

23.

Miller

Herman Phillip

. 1966. “Income Distribution in the United States.” Washington, DC: U.S. Government Printing Office.

24.

Minoiu

Camelia

Reddy

Sanjay G.

2008. “Estimating Poverty and Inequality from Grouped Data: How Well Do Parametric Methods Perform?”Rochester, NY: Social Science Research Network. Retrieved March8, 2017 (https://papers.ssrn.com/abstract=925969).

25.

Minoiu

Camelia

Reddy

Sanjay G.

2014. “Kernel Density Estimation on Grouped Data: The Case of Poverty Assessment.”Journal of Economic Inequality 12(2):163–89.

26.

Piketty

Thomas

. 2014. Capital in the Twenty-first Century. Cambridge, MA: Belknap.

27.

Piketty

Thomas

. 2015. The Economics of Inequality. 3rd ed.Cambridge, MA: Belknap.

28.

Piketty

Thomas

Saez

Emmanuel

. 2003. “Income Inequality in the United States, 1913–1998.”Quarterly Journal of Economics 118(1):1–41.

29.

Quandt

Richard E.

1966. “Old and New Methods of Estimation and the Pareto Distribution.”Metrika 10(1):55–82.

30.

Ruggles

Steven

Genadek

Katie

Goeken

Ronald

Grover

Josiah

Sobek

Matthew

. 2015. “Integrated Public Use Microdata Series” [Data set]. Minneapolis: University of Minnesota.

31.

Shannon

C. E.

1948. “A Mathematical Theory of Communication.”Bell System Technical Journal 27(3):379–423, 623–56.

32.

Shorrocks

A. F.

1980. “The Class of Additively Decomposable Inequality Measures.”Econometrica 48(3):613–25.

33.

Soltow

Lee

. 1981. “The Distribution of Property Values in England and Wales in 1798.”Economic History Review 34(1):60–70.

34.

Theil

Henri

. 1967. Economics and Information Theory. Amsterdam, the Netherlands: North-Holland.

35.

U.S. Census Bureau. n.d. “American Community Survey and Puerto Rico Community Survey: 2011 Subject Definitions.” Retrieved (https://www2.census.gov/programs-surveys/acs/tech_docs/subject_definitions/2011_ACSSubjectDefinitions.pdf).

36.

von Hippel

Paul T.

Scarpino

Samuel V.

Holas

Igor

. 2016. “Robust Estimation of Inequality from Binned Incomes.” Pp. 212–51 in Sociological Methodology, Vol. 46, edited by Alwin

Duane F.

Thousand Oaks, CA: Sage.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

6.74 MB

Estimating Income Statistics from Grouped Data: Mean-constrained Integration over Brackets

Abstract

Keywords

Get full access to this article

References

Supplementary Material