Sage Journals: Discover world-class research

Abstract

We develop robust methods for analyzing clustered data where estimation of marginal regression parameters is of interest. Inverse cluster size reweighting in the objective function to be minimized is incorporated to handle the issue of informative cluster size. Performance of the resulting estimators is studied by simulation. Large sample inference and variance estimation is carried out. The methodology is illustrated using a periodontal disease dataset.

Keywords

Informative cluster size random cluster size R estimator dental data

Get full access to this article

View all access options for this article.

References

Beck

Koch

Rozier

Tudor

(1990) Prevalence and risk indicators for periodontal attachment loss in a population of older community-dwelling blacks and whites. Journal of Periodontology, 61, 521–28.

Blazer

George

(2004) Established populations for epidemiologic studies of the elderly, 1996–1997: Piedmont health survey of the elderly, fourth in-person survey Durham, Warren, Vance, Granville, and Franklin Counties, North Carolina [Computer file]. ICPSR02744-v1. Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], doi:10.3886/ICPSR02744.

Datta

Satten

(2005) Rank-sum tests for clustered data. Journal of American Statistical Association, 100, 908–15.

Datta

Satten

(2008) A signed-rank test for clustered data. Biometrics, 64, 501–07.

Datta

Nevalainen

Oja

(2012) A general class of signed rank tests for clustered data when the cluster size is potentially informative. Journal of Nonparametric Statistics, 24, 797–808.

Gansky

Neuhaus

(2009) Missing data and informative cluster sizes. In Lesaffre

Feine

Leroux

Declerck

(eds), Statistical and methodological aspects of oral health research (pp. 241–58). Chichester: John Wiley & Sons.

Hettmansperger

McKean

(2011) Robust nonparametric statistical methods, 2nd ed. New York: Chapman & Hall.

Hoffman

Sen

Weinberg

(2001) Within-cluster resampling. Biometrika, 88, 1121–34.

Jaeckel

(1972) Estimating regression coefficients by minimizing the dispersion of the residuals. The Annals of Mathematical Statistics, 43, 1449–58.

10.

Jung

Ying

(2003) Rank-based regression with repeated measurements data. Biometrika, 90, 732–740.

11.

Jureckova

(1971) Nonparametric estimate of regression coeffcients. The Annals of Mathematical Statistics, 42, 1328–38.

12.

Koul

Sievers

McKean

(1987) An estimator of the scale parameter for the rank analysis of linear models under general score functions. Scandinavian Journal of Statistics, 14, 131–41.

13.

McKean

Hettmansperger

(1978) A robust analysis of the general linear model based on one step R-estimates. Biometrika, 65, 571–79.

14.

Neuhaus

McCulloch

(2011) Estimation of covariate effects in generalized linear mixed models with informative cluster sizes. Biometrika, 98, 147–62.

15.

Nevalainen

Datta

Oja

(2013) Inference on the marginal distribution of clustered data with informative cluster size. Statistical Papers. DOI 10.1007/s00362–013–0504–3.

16.

R Core Team (2012) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at http://www.Rproject.org/.

17.

Singh

(1979) Mean squared errors of a density and its derivatives. Biometrika, 66, 177–80.

18.

Wang

Kong

Datta

(2011) Inference for marginal linear models with clustered longitudinal data with potentially informative cluster sizes. Statistical Methods in Medical Research, 20, 347–67.

19.

Wang

Y-G

Zhao

(2008) Weighted rank regression for clustered data analysis. Biometrics, 64, 34–45.

20.

Wang

Y-G

Zhu

(2006) Rank-based regression for analysis of repeated measures. Biometrika, 93, 459–64.

21.

Williamson

Datta

Satten

(2003) Marginal analyses of clustered data when cluster size is informative. Biometrics, 59, 36–42.

Robust estimation of marginal regression parameters in clustered data

Abstract

Keywords

Get full access to this article

References