Sage Journals: Discover world-class research

Abstract

The non-central X2 distribution can be used to calculate power for tests detecting departure from a null hypothesis. Required sample size can also be calculated because it is proportional to the non-centrality parameter for the distribution. We demonstrate how these calculations can be carried out in Stata using the example of calculating power and sample size for case–control studies of gene–gene and gene–environment interactions. Do-files are available for these calculations.

Keywords

st0032 gene-environment interaction gene-gene interaction power sample size study design non-central x2

References

Albert

P. S.

, Ratnasinghe

, Tangrea

, and Wacholder

2001. Limitations of the case-only design for identifying gene-environment interactions. American Journal of Epidemiology 154: 687–693.

Andrieu

, and Goldstein

A.M.

2000. A case-combined design using both population based- and related-controls: a potential alternative for increasing power in gene–environment interaction detection. Genetic Epidemiology 19: 235–236.

Andrieu

, Goldstein

A. M.

, Thomas

D. C.

, and Langholz

2001. Counter-matching in studies of gene–environment interaction: efficiency and feasibility. American Journal of Epidemiology 153: 265–274.

Begg

C. B.

, and Berwick

1997. A note on the estimation of relative risks of rare genetic susceptibility markers. Cancer Epidemiology Biomarkers Prevention 6: 99–103.

Brennan

2002. Gene-environment interaction and aetiology of cancer: what does it mean and how can we measure it? Carcinogenesis 23: 381–387.

Brown

B. W.

, Lovato

, and Russell

1999. Asymptotic power calculations: description, examples, computer code. Statistics in Medicine 18: 3137–3151.

Gauderman

W. J.

2002. Sample size requirements for association studies of gene–gene interaction. American Journal of Epidemiology 155: 478–484.

Longmate

J. A.

2001. Complexity and power in case–control association studies. American Journal of Human Genetics 68: 1229–1237.

Piegorsch

W. W.

, Weinberg

C. R.

, and Taylor

J.A.

1994. Non-hierarchical logistic models and case-only designs for assessing susceptibility in population-based case–control studies. Statistics in Medicine 13: 153–162.

10.

Self

S. G.

, Mauritsen

R. H.

, and Ohara

1992. Power calculations for likelihood ratio tests in generalized linear models. Biometrics 48: 31–39.

11.

Siegmund

K. D.

, and Langholz

2001. Stratified case sampling and the use of family controls. Genetic Epidemiology 20: 316–327.

12.

Sturmer

, and Brenner

2002. Flexible matching strategies to increase power and efficiency to detect and estimate gene–environment interactions in case–control studies. American Journal of Epidemiology 155: 593–602.

13.

Weinberg

C. R.

, and Umbach

D.M.

2000. Choosing a retrospective design to assess joint genetic and environmental contributions to risk. American Journal of Epidemiology 152: 197–203.

14.

Wilks

S. S.

1938. The large sample distribution of the likelihood ratio for testing composite hypotheses. Annals of Mathematical Statistics 9: 60–62.

15.

Witte

J. S.

, Gauderman

W. J.

, and Thomas

D.C.

1999. Asymptotic bias and efficiency in case–control studies of candidate genes and gene–environment interactions: basic family designs. American Journal of Epidemiology 149: 693–705.

Sample Size Calculations for Main Effects and Interactions in Case–control Studies using Stata's nchi2 and npnchi2 Functions

Abstract

Keywords

References