Bayesian modeling for genetic association in case-control studies: accounting for unknown population substructure

Abstract

A two-stage parametric Bayesian method is proposed to examine the association between a candidate gene and the occurrence of a disease after accounting for population substructure. This procedure, implemented via a Markov chain Monte Carlo numerical integration technique, first estimates the posterior probability of different unknown population substructures and then integrates this information into a disease-gene association model through the technique of Bayesian model averaging. The model relaxes certain assumptions of previous analyses and provides a unified computational framework to obtain an estimate of the log odds ratio parameter corresponding to the genetic factor after allowing for the allele frequencies to vary across subpopulations. The uncertainty in estimating the population substructure is taken into account while providing credible intervals for parameters in the disease-gene association model. Simulations on unmatched case-control studies that mimic an admixed Argentinean population are performed to demonstrate the statistical properties of our model. The method is also applied to a real data set coming from a genetic association study on obesity.

Keywords

Bayesian model averaging gene-disease association linkage equilibrium Markov chain Monte Carlo obesity

Get full access to this article

View all access options for this article.

References

Committee on DNA Forensic Science : An Update (1996) The evaluation of forensic DNA evidence. National Academy Press, Washington DC .

Freedman ML , Reich D , Penney KL , McDonald GJ , Mignault AA , Patterson N , Gabriel SB , Topol EJ , Smoller JW , Pato CN , Pato MT , Petryshen TL , Kolonel LN , Lander ES , Sklar P , Henderson B , Hirschhorn JN and Altshuler D (2004) Assessing the impact of population stratification on genetic association studies . Nature Genetics 36, 388-393 .

Falush D , Stephens M and Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies . Genetics 164, 1567-1587 .

Hayakawa T , Nagai Y , Kahara T , Yamashita H , Takamura T , Abe T , Nomura G and Kobayashi K (2000) Gln27Glu and Arg16Gly polymorphisms of the beta2-adrenergic receptor gene are not associated with obesity in Japanese men . Metabolism 49, 1215-1218 .

Hoggart CJ , Parra EJ , Shriver MD , Bonilla C , Kittles RA , Clayton DG and McKeigue PM (2003) Control of confounding of genetic associations in stratified populations . American Journal of Human Genetics 72, 1492-1504 .

Hoggart CJ , Shriver MD , Kittles RA , Clayton DG and McKeigue PM (2004) Design and analysis of admixture mapping studies . American Journal of Human Genetics 74, 965-978 .

Johnson JA , Terra SG (2002) b- Adrenergic receptor polymorphisms: cardiovascular disease associations and pharmacogenetics . Pharmaceutical Research 19, 1779-1787 .

Knowler WC , Williams RC , Pettitt DJ and Steinberg AG (1988) GM 3; 5, 13, 14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture . American Journal of Human Genetics 43, 520-526 .

Lander ES and Schork NJ (1994) Genetic dissection of complex traits . Science 265, 2037-2048 .

10.

Lin M , Aquilante C , Johnson JA and Wu R (2005) Sequencing drug response with HapMap . The Pharmacogenomics Journal 5, 149-156 .

11.

Lynch M and Walsh B (1998) Genetics and analysis of quantitative traits. Sinauer .

12.

Madigan D and Raftery AE (1994) Model selection and model uncertainty in graphical models using Occam’s Window . Journal of American Statistical Association, 89, 1535-1546

13.

Marchini J , Cardon2 LR , Phillips MS and Donnelly P (2004) The effects of human population structure on large genetic association studies . Nature Genetics 36, 512-517 .

14.

McKeigue PM (2005) Prospects for admixture mapping of complex traits . American Journal of Human Genetics 76, 1-7 .

15.

Morton NE and Collins A (1998) Tests and estimates of allelic association in complex inheritance . Proceedings of the National Academy of Sciences of the United States of America, USA 95, 11389-11393 .

16.

Patterson N , Hattangadi N , Lane B , Lohmueller KE , Hafler DA , Oksenberg JR , Hauser SL , Smith MW , O’Brien SJ , Altshuler D , Daly MJ and Reich D (2004) Methods for high-density admixture mapping of disease genes . American Journal of Human Genetics. 74, 979-1000 .

17.

Pritchard JK , Stephens M and Donnelly P (2000a) Inference of population structure using multiocus genotype data . Genetics 155, 945-959 .

18.

Pritchard JK , Stephens M , Rosenberg NA and Donnelly P (2000b) Association mapping in structured populations . American Journal of Human Genetics 67, 170-181 .

19.

Risch N and Merikangas K (1996) The future of genetic studies of complex diseases . Science 273, 1516-1517 .

20.

Sala A , Penacino G , Carnese R and Corach D (1999) Reference database of hypervariable genetic markers of Argentina: application for molecular anthropology and forensic casework . Electrophoresis 20, 1733-1739 .

21.

Sala A , Penacino G and Corach D (1998) Comparison of allele frequencies of eight Loci from Argentinean Amerindian and European populations . Human Biology 70, 937-947 .

22.

Satten GA , Flanders WD and Yang Q (2001) Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model . American Journal of Human Genetics 68, 466-477 .

23.

Spielman RS and Ewens WJ (1998) A sibship test for linkage in the presence of association: the sib transmission/disequilibrium test . American Journal of Human Genetics 62, 450-458 .

24.

Spielman RS , McGinnis RE and Ewens WJ (1993) Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM) . American Journal of Human Genetics 52, 506-516 .

25.

Sullivan PF , Eaves LJ , Kendler KS and Neale MC (2001) Genetic case-control association studies in neuropsychiatry . Archives of General Psychiatry 58, 1015-1024 .

26.

Takami S , Wong ZYH , Stebbing M and Harrap SB (1999) Linkage analysis of glucocorticoid and b2-adrenergic receptor genes with blood pressure and body mass index . American Journal of Physiology, Heart and Circulatory Physiology 276, 1379-1384 .