In this article, I introduce the ipfraking package, which implements weight-calibration procedures known as iterative proportional fitting, or raking, of complex survey weights. The package can handle a large number of control variables and trim the weights in various ways. It also provides diagnostic tools for the weights it creates. I provide examples of its use and a suggested workflow for creating raked replicate weights.
BethlehemJ.2002. Weighting nonresponse adjustments based on auxiliary information. In Survey Nonresponse, ed. GrovesR. M., DillmanD. A., EltingeJ. L., and LittleR. J. A., 275–288. New York: Wiley.
4.
BinderD. A., and RobertsG. R.2003. Design-based and model-based methods for estimating model parameters. InAnalysis of Survey Data, ed. ChambersR. L., and SkinnerC. J., 29–48. Chichester, UK: Wiley.
5.
BotmanS. L., MooreT. F., MoriarityC. L., and ParsonsV. L.2000. Design and estimation for the National Health Interview Survey, 1995–2004. Technical Report 130, National Center for Health Statistics.
6.
ChangT., and KottP. S.2008. Using calibration weighting to adjust for nonresponse under a plausible model. Biometrika95: 555–571.
7.
D'ArrigoJ., and SkinnerC.2010. Linearization variance estimation for generalized raking estimators in the presence of nonresponse. Survey Methodology36: 181–192.
8.
DemingW. E., and StephanF. F.1940. On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. Annals of Mathematical Statistics11: 427–444.
9.
DeverJ. A., and ValliantR.2010. A comparison of variance estimators for poststratification to estimated control totals. Survey Methodology36: 45–56.
10.
DevilleJ.-C., and SärndalC.-E.1992. Calibration estimators in survey sampling. Journal of the American Statistical Association87: 376–382.
11.
DevilleJ.-C., SärndalC.-E., and SautoryO.1993. Generalized raking procedures in survey sampling. Journal of the American Statistical Association88: 1013–1020.
12.
D'SouzaJ.2011. calibest: Stata module to estimate proportions and means after survey data have been calibrated to population totals. Statistical Software Components S457241, Department of Economics, Boston College. http://ideas.repec.org/c/boc/bocode/s457241.html.
13.
ElliottM. R.2008. Model averaging methods for weight trimming. Journal of Official Statistics24: 517–540.
14.
GouldW.2001. Statistical software certification. Stata Journal1: 29–50.
15.
GouldW.2003. Stata tip 3: How to be assertive. Stata Journal3: 448.
16.
GrovesR. M.2006. Nonresponse rates and nonresponse bias in household surveys. Public Opinion Quarterly70: 646–675.
17.
GrovesR. M., DillmanD. A., EltingeJ. L., and LittleR. J. A., eds. 2002. Survey Nonresponse.New York: Wiley.
18.
HoltD., and SmithT. M. F.1979. Post stratification. Journal of the Royal Statistical Society, Series A142: 33–46.
19.
HorvitzD. G., and ThompsonD. J.1952. A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association47: 663–685.
20.
JudkinsD. R., MorgansteinD., ZadorP., PiesseA., BarrettB., and MukhopadhyayP.2007. Variable selection and raking in propensity scoring. Statistics in Medicine26: 1022–1033.
21.
KolenikovS.2010. Resampling variance estimation for complex survey data. Stata Journal10: 165–199.
22.
KornE. L., and GraubardB. I.1995. Analysis of large health surveys: Accounting for the sampling design. Journal of the Royal Statistical Society, Series A158: 263–295.
23.
KornE. L., and GraubardB. I.1999. Analysis of Health Surveys.New York: Wiley.
24.
KottP. S.2006. Using calibration weighting to adjust for nonresponse and coverage errors. Survey Methodology32: 133–142.
25.
KottP. S.2009. Calibration weighting: Combining probability samples and linear prediction models. InSample Surveys: Inference and Analysis, ed. PfeffermannD., and RaoC. R., vol. 29B, 55–82. Oxford: Elsevier.
26.
LundströmS., and SärndalC.-E.1999. Calibration as a standard method for treatment of nonresponse. Journal of Official Statistics15: 305–327.
27.
LundströmS., and SärndalC.-E.2010. Design for estimation: Identifying auxiliary vectors to reduce nonresponse bias. Survey Methodology36: 131–144.
28.
McConnellS.2004. Code Complete: A Practical Handbook of Software Construction. 2nd ed. Redmond, WA: Microsoft Press.
PfeffermannD.1993. The role of sampling weights when modeling survey data. International Statistical Review61: 317–337.
31.
SärndalC.-E.2007. The calibration approach in survey theory and practice. Survey Methodology33: 99–119.
32.
ShaoJ.1996. Resampling methods in sample surveys (with discussion). Statistics27: 203–254.
33.
SkinnerC. J.1989. Domain means, regression and multivariate analysis. In Analysis of Complex Surveys, ed. SkinnerC. J., HoltD., and SmithT. M. F., 59–88. New York: Wiley.
34.
ThébergeA.2000. Calibration and restricted weights. Survey Methodology26: 99–107.
35.
ThompsonM. E.1997. Theory of Sample Surveys.London: Chapman & Hall.
36.
U.S. Census Bureau.2009. Design and Methodology: American Community Survey.Washington, DC: U.S. Government Printing Office.
37.
WinterN.2002. survwgt: Stata module to create and manipulate survey weights. Statistical Software Components S427503, Department of Economics, Boston College.http://ideas.repec.org/c/boc/bocode/s427503.html.
38.
WittenbergM.2010. An introduction to maximum entropy and minimum cross-entropy estimation using Stata. Stata Journal10: 315–330.