Sage Journals: Discover world-class research

Abstract

In this article, I introduce the ipfraking package, which implements weight-calibration procedures known as iterative proportional fitting, or raking, of complex survey weights. The package can handle a large number of control variables and trim the weights in various ways. It also provides diagnostic tools for the weights it creates. I provide examples of its use and a suggested workflow for creating raked replicate weights.

Keywords

st0323 ipfraking mat2do xls2row survey calibration weights raking iterative proportional fitting

References

Battaglia

M. P.

, Izrael

, Hoaglin

D. C.

, and Frankel

M. R.

2009. Practical considerations in raking survey data. http://surveypractice.wordpress.com/2009/06/29/raking-survey-data/.

Bergmann

2011. ipfweight: Stata module to create adjustment weights for surveys. Statistical Software Components S457353, Department of Economics, Boston College. http://econpapers.repec.org/software/bocbocode/s457353.htm.

Bethlehem

2002. Weighting nonresponse adjustments based on auxiliary information. In Survey Nonresponse, ed. Groves

R. M.

, Dillman

D. A.

, Eltinge

J. L.

, and Little

R. J. A.

, 275–288. New York: Wiley.

Binder

D. A.

, and Roberts

G. R.

2003. Design-based and model-based methods for estimating model parameters. In Analysis of Survey Data, ed. Chambers

R. L.

, and Skinner

C. J.

, 29–48. Chichester, UK: Wiley.

Botman

S. L.

, Moore

T. F.

, Moriarity

C. L.

, and Parsons

V. L.

2000. Design and estimation for the National Health Interview Survey, 1995–2004. Technical Report 130, National Center for Health Statistics.

Chang

, and Kott

P. S.

2008. Using calibration weighting to adjust for nonresponse under a plausible model. Biometrika 95: 555–571.

D'Arrigo

, and Skinner

2010. Linearization variance estimation for generalized raking estimators in the presence of nonresponse. Survey Methodology 36: 181–192.

Deming

W. E.

, and Stephan

F. F.

1940. On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. Annals of Mathematical Statistics 11: 427–444.

Dever

J. A.

, and Valliant

2010. A comparison of variance estimators for poststratification to estimated control totals. Survey Methodology 36: 45–56.

10.

Deville

J.-C.

, and Särndal

C.-E.

1992. Calibration estimators in survey sampling. Journal of the American Statistical Association 87: 376–382.

11.

Deville

J.-C.

, Särndal

C.-E.

, and Sautory

1993. Generalized raking procedures in survey sampling. Journal of the American Statistical Association 88: 1013–1020.

12.

D'Souza

2011. calibest: Stata module to estimate proportions and means after survey data have been calibrated to population totals. Statistical Software Components S457241, Department of Economics, Boston College. http://ideas.repec.org/c/boc/bocode/s457241.html.

13.

Elliott

M. R.

2008. Model averaging methods for weight trimming. Journal of Official Statistics 24: 517–540.

14.

Gould

2001. Statistical software certification. Stata Journal 1: 29–50.

15.

Gould

2003. Stata tip 3: How to be assertive. Stata Journal 3: 448.

16.

Groves

R. M.

2006. Nonresponse rates and nonresponse bias in household surveys. Public Opinion Quarterly 70: 646–675.

17.

Groves

R. M.

, Dillman

D. A.

, Eltinge

J. L.

, and Little

R. J. A.

, eds. 2002. Survey Nonresponse. New York: Wiley.

18.

Holt

, and Smith

T. M. F.

1979. Post stratification. Journal of the Royal Statistical Society, Series A 142: 33–46.

19.

Horvitz

D. G.

, and Thompson

D. J.

1952. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association 47: 663–685.

20.

Judkins

D. R.

, Morganstein

, Zador

, Piesse

, Barrett

, and Mukhopadhyay

2007. Variable selection and raking in propensity scoring. Statistics in Medicine 26: 1022–1033.

21.

Kolenikov

2010. Resampling variance estimation for complex survey data. Stata Journal 10: 165–199.

22.

Korn

E. L.

, and Graubard

B. I.

1995. Analysis of large health surveys: Accounting for the sampling design. Journal of the Royal Statistical Society, Series A 158: 263–295.

23.

Korn

E. L.

, and Graubard

B. I.

1999. Analysis of Health Surveys. New York: Wiley.

24.

Kott

P. S.

2006. Using calibration weighting to adjust for nonresponse and coverage errors. Survey Methodology 32: 133–142.

25.

Kott

P. S.

2009. Calibration weighting: Combining probability samples and linear prediction models. In Sample Surveys: Inference and Analysis, ed. Pfeffermann

, and Rao

C. R.

, vol. 29B, 55–82. Oxford: Elsevier.

26.

Lundström

, and Särndal

C.-E.

1999. Calibration as a standard method for treatment of nonresponse. Journal of Official Statistics 15: 305–327.

27.

Lundström

, and Särndal

C.-E.

2010. Design for estimation: Identifying auxiliary vectors to reduce nonresponse bias. Survey Methodology 36: 131–144.

28.

McConnell

2004. Code Complete: A Practical Handbook of Software Construction. 2nd ed. Redmond, WA: Microsoft Press.

29.

Pew Research Center. 2012. Assessing the representativeness of public opinion surveys. Technical report, Pew Research Center for People and Press. http://www.people-press.org/files/legacy-pdf/AssessingtheRepresentativenessofPublicOpinionSurveys.pdf.

30.

Pfeffermann

1993. The role of sampling weights when modeling survey data. International Statistical Review 61: 317–337.

31.

Särndal

C.-E.

2007. The calibration approach in survey theory and practice. Survey Methodology 33: 99–119.

32.

Shao

1996. Resampling methods in sample surveys (with discussion). Statistics 27: 203–254.

33.

Skinner

C. J.

1989. Domain means, regression and multivariate analysis. In Analysis of Complex Surveys, ed. Skinner

C. J.

, Holt

, and Smith

T. M. F.

, 59–88. New York: Wiley.

34.

Théberge

2000. Calibration and restricted weights. Survey Methodology 26: 99–107.

35.

Thompson

M. E.

1997. Theory of Sample Surveys. London: Chapman & Hall.

36.

U.S. Census Bureau. 2009. Design and Methodology: American Community Survey. Washington, DC: U.S. Government Printing Office.

37.

Winter

2002. survwgt: Stata module to create and manipulate survey weights. Statistical Software Components S427503, Department of Economics, Boston College. http://ideas.repec.org/c/boc/bocode/s427503.html.

38.

Wittenberg

2010. An introduction to maximum entropy and minimum cross-entropy estimation using Stata. Stata Journal 10: 315–330.

Calibrating Survey Data using Iterative Proportional Fitting (Raking)

Abstract

Keywords

References