We develop robust methods for analyzing clustered data where estimation of marginal regression parameters is of interest. Inverse cluster size reweighting in the objective function to be minimized is incorporated to handle the issue of informative cluster size. Performance of the resulting estimators is studied by simulation. Large sample inference and variance estimation is carried out. The methodology is illustrated using a periodontal disease dataset.
BeckJDKochGCRozierRGTudorGE (1990) Prevalence and risk indicators for periodontal attachment loss in a population of older community-dwelling blacks and whites. Journal of Periodontology, 61, 521–28.
2.
BlazerDGGeorgeLK (2004) Established populations for epidemiologic studies of the elderly, 1996–1997: Piedmont health survey of the elderly, fourth in-person survey Durham, Warren, Vance, Granville, and Franklin Counties, North Carolina [Computer file]. ICPSR02744-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], doi:10.3886/ICPSR02744.
3.
DattaSSattenGA (2005) Rank-sum tests for clustered data. Journal of American Statistical Association, 100, 908–15.
4.
DattaSSattenGA (2008) A signed-rank test for clustered data. Biometrics, 64, 501–07.
5.
DattaSNevalainenJOjaH (2012) A general class of signed rank tests for clustered data when the cluster size is potentially informative. Journal of Nonparametric Statistics, 24, 797–808.
6.
GanskySANeuhausJM (2009) Missing data and informative cluster sizes. In LesaffreEFeineJLerouxBDeclerckD (eds), Statistical and methodological aspects of oral health research (pp. 241–58). Chichester: John Wiley & Sons.
JureckovaJ (1971) Nonparametric estimate of regression coeffcients. The Annals of Mathematical Statistics, 42, 1328–38.
12.
KoulHLSieversGMcKeanJW (1987) An estimator of the scale parameter for the rank analysis of linear models under general score functions. Scandinavian Journal of Statistics, 14, 131–41.
13.
McKeanJHettmanspergerT (1978) A robust analysis of the general linear model based on one step R-estimates. Biometrika, 65, 571–79.
14.
NeuhausJMMcCullochCE (2011) Estimation of covariate effects in generalized linear mixed models with informative cluster sizes. Biometrika, 98, 147–62.
15.
NevalainenJDattaSOjaH (2013) Inference on the marginal distribution of clustered data with informative cluster size. Statistical Papers. DOI 10.1007/s00362–013–0504–3.
16.
R Core Team (2012) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at http://www.Rproject.org/.
17.
SinghRS (1979) Mean squared errors of a density and its derivatives. Biometrika, 66, 177–80.
18.
WangMKongMDattaS (2011) Inference for marginal linear models with clustered longitudinal data with potentially informative cluster sizes. Statistical Methods in Medical Research, 20, 347–67.
19.
WangY-GZhaoY (2008) Weighted rank regression for clustered data analysis. Biometrics, 64, 34–45.
20.
WangY-GZhuM (2006) Rank-based regression for analysis of repeated measures. Biometrika, 93, 459–64.
21.
WilliamsonJMDattaSSattenGA (2003) Marginal analyses of clustered data when cluster size is informative. Biometrics, 59, 36–42.