Abstract
Studies have shown an association between ARID5B gene polymorphisms and childhood acute lymphoblastic leukemia. However, the association between ARID5B variants and acute lymphoblastic leukemia among the Arab population still needs to be studied. The aim of this study was to investigate the association between ARID5B variants with acute lymphoblastic leukemia in Yemeni children. A total of 14 ARID5B gene single nucleotide polymorphisms (SNPs) were genotyped in 289 Yemeni children, of whom 136 had acute lymphoblastic leukemia and 153 were controls, using the nanofluidic Dynamic Array (Fluidigm 192.24 Dynamic Array). Using logistic regression adjusted for age and gender, the risks of acute lymphoblastic leukemia were presented as odds ratios and 95% confidence intervals. We found that nine SNPs were associated with acute lymphoblastic leukemia under additive genetic models: rs7073837, rs10740055, rs7089424, rs10821936, rs4506592, rs10994982, rs7896246, rs10821938, and rs7923074. Furthermore, the recessive models revealed that six SNPs were risk factors for acute lymphoblastic leukemia: rs10740055, rs7089424, rs10994982, rs7896246, rs10821938, and rs7923074. The gender-specific impact of these SNPs under the recessive genetic model revealed that SNPs rs10740055, rs10994982, and rs6479779 in females, and rs10821938 and rs7923074 in males were significantly associated with acute lymphoblastic leukemia risk. Under the dominant model, SNPs rs7073837, rs10821936, rs7896246, and rs6479778 in males only showed striking association with acute lymphoblastic leukemia. The additive model revealed that SNPs with significant association with acute lymphoblastic leukemia were rs10821936 (both males and females); rs7073837, rs10740055, rs10994982, and rs4948487 (females only); and rs7089424, rs7896246, rs10821938, and rs7923074 (males only). In addition, the ARID5B haplotype block (CGAACACAA) showed a higher risk for acute lymphoblastic leukemia. The haplotype (CCCGACTGC) was associated with protection against acute lymphoblastic leukemia. In conclusion, our study has shown that ARID5B variants are associated with acute lymphoblastic leukemia in Yemeni children with several gender biases of ARID5B single nucleotide polymorphisms reported.
Introduction
Acute lymphoblastic leukemia (ALL) is the most common pediatric cancer among patients younger than 15 years, representing 26% of all tumors and 78% of leukemias in these patients. 1 At least 54,000 new ALL cases in Asia are estimated to occur each year. 2 In Yemen, the estimated incidence rate of leukemia is 10% of all cancers, with a mortality rate of 12.3%. 3 The available data on the epidemiology of childhood leukemia in Yemen are limited, but it has been reported that the highest frequency of leukemia occurs in children under the age of 4 years. 4
Leukemic clones have been detected early in pre-natal stages in both individuals who develop and do not develop the disease.5,6 Additional genetic lesions are required to activate the leukemogenesis process.7,8 Genome wide association (GWA) studies on populations of European ancestry have revealed that ARID5B single nucleotide polymorphisms (SNPs) are associated with childhood ALL.9–11 Several follow-up studies have confirmed the association of ALL risk with ARID5B genetic variations.12–20
ARID5B belongs to the family of AT-rich interaction domain transcription factors 21 that plays a critical role in embryogenesis and cell growth regulation. 22 Childhood ALL shows substantial variations in disease incidence across geographic areas, with higher prevalence in European than that reported in Asian populations. 23 Different populations are well distinguished by race/ethnicity-related genetic polymorphisms, which may lead to these incidence disparities, in addition to environmental risk factors.24,25 It is important to study the effects of these genetic variations in different ethnicities to understand the differences in susceptibility to ALL and infer the causes and mechanism of ALL progression.
To the best of our knowledge, the impact of ARID5B variants on childhood ALL in Yemen has not been studied. Therefore, the goal of this study was to determine the possible association of ARID5B gene SNPs and haplotypes with ALL in Yemeni children of Arab Asian ancestry.
Materials and methods
Ethical approval
Approval for this case–control study was obtained from the Medical Ethics Committee of the University Malaya Medical Centre (RF No.: 989.39). Permission was obtained from the hospitals authorities in Sana’a, Yemen before commencing the study. Written informed consent was obtained from all parent or guardian respondents.
Subjects and collection of data
Leukemic children aged between 2 and 15 years who had been diagnosed with ALL and were under treatment at government hospitals in Sana’a, Yemen, were recruited to this study as the case group. The diagnosis of ALL was based on bone marrow and peripheral blood morphology and histochemistry. For the control group, healthy children with normal complete blood counts were approached to participate in this study. Venous blood samples (4 mL) were collected under sterile conditions from each child into two labeled Vacutainer® containing K2EDTA: one for hematologic analysis and the second for DNA extraction.
Hematologic analysis
Complete blood counts were performed using a Mythic™ 22 AL fully automated 22-parameter hematology analyzer (Orphee SA, Switzerland).
Genetic analysis
ARID5B intronic SNPs—rs7073837, rs10740055, rs7089424, rs10821936, rs4506592, rs10994982, rs7896246, rs10821938, rs7923074, rs6479778, rs4948487, rs6479779, rs2893881, and rs10994990—were chosen based on the findings of previous studies.9–11,15,17,19,26,27 Extraction of genomic DNA from peripheral blood leukocytes was performed using the FlexiGene DNA Kit (Qiagen, Valencia, CA, USA). ARID5B SNPs genotyping was conducted by Fluidigm 192.24 Dynamic Array and read by a BioMark HD Reader (Fluidigm Corporation, San Francisco, CA, USA) according to the manufacturer’s protocol. The call rates for ARID5B SNPs rs10740055, rs10821936, rs4506592, rs10994982, rs10821938, and rs4948487 were > 96% in both control and ALL groups. The call rates for rs7073837, rs7089424, rs7896246, rs6479778, and rs6479779 were ≥86% in the control group and ≥90% in the ALL group, while rs7923074 showed a call rate ≥86% in both groups. The concordance rate was 100% based on 40% blind duplicate samples. Controls without template were analyzed in each run of the Fluidigm 192.24 Dynamic Array to ensure no DNA contamination was present.
Statistical analysis
Hardy–Weinberg Equilibrium (HWE) analysis was achieved with DeFinetti software (http://ihg.gsf.de/cgi-bin/hw/hwa1.pl; the Institute of Human Genetics). SNP & Variation Suite v8.x software (Golden Helix, Bozeman, MT, USA) was used to analyze linkage disequilibrium (LD) and for building up haplotypes and diplotypes of related SNPs. Statistical analyses were done using Social Package of Statistical Science (SPSS) 20.0 (LEAD Technologies, Inc., Charlotte, NC, USA). Hematologic parameters were log-transformed and then transformed back and presented as geometric means. Comparisons between groups were assessed by independent sample t-test. The evaluation of the association between ALL and ARID5B SNP genetic models (recessive, dominant, and additive), haplotypes, and diplotypes was performed by logistic regression analysis controlled for age and gender as covariates. A significant difference was considered when p < 0.05.
Results
A total of 136 children with ALL and 153 non-leukemic (control) children were included in this study. The demographic information and hematologic test results of the children are provided in Table 1.
Demography and hematologic parameters.
ALL: acute lymphoblastic leukemia.
Results presented represent geometric means (95% confidence interval of means), ALL versus normal control group evaluated by independent samples t-test.
ARID5B SNPs rs7073837, rs10740055, rs7089424, rs10821936, rs4506592, rs10994982, rs7896246, rs10821938, rs7923074, rs6479778, rs4948487, and rs6479779 were consistent with HWE (p > 0.05) in the control group. Two SNPs rs2893881 and rs10994990 deviated from HWE (p = 0.02, 0.0001, respectively) so they were no longer analyzed.
The logistic regression models (controlled for age and gender) revealed that the additive genetic models of ARID5B SNPs rs7073837 rs10740055, rs7089424, rs10821936, rs4506592, rs10994982, rs7896246, rs10821938, and rs7923074 were significantly associated with ALL (odds ratio (OR) = 1.66, 1.60, 1.46, 1.69, 1.44, 1.58, 1.75, 1.48, 1.7; p = 0.008, 0.009, 0.047, 0.004, 0.04, 0.01, 0.004, 0.02, 0.005, respectively) (Table 2).
Association of ARID5B polymorphisms with acute lymphoblastic leukemia among Yemeni children.
Freq.: risk allele frequency; SNP: single nucleotide polymorphism; OR: odds ratio; CI: confidence interval.
Controlled for age and gender. In the additive model, genotype of homozygote for the non-risk allele 11, heterozygote 12, and homozygote for the risk allele 22. The recessive model was defined as 22 versus 12 + 11 and dominant model as 22 + 12 versus 11. The outliers (studentized residual is greater than 2.0 or less than −2.0) were excluded. Bold values are significant.
The dominant genetic models revealed a significant association of rs7073837 (OR = 2.44, p = 0.004), rs10740055 (OR = 1.83, p = 0.04), rs10821936 (OR = 2.00, p = 0.009), rs7896246 (OR = 2.08, p = 0.01), and rs7923074 (OR = 2.02, p = 0.02) with ALL. Similarly, rs10740055 (OR = 1.96, p = 0.02), rs7089424 (OR = 2.19, p = 0.03), rs10994982 (OR = 2.02, p = 0.017), rs7896246 (OR = 2.14, p = 0.04), rs10821938 (OR = 1.93, p = 0.03), and rs7923074 (OR = 2.12, p = 0.02) showed a significant association with ALL under recessive genetic models. ARID5B SNPs rs6479778, rs4948487, and rs6479779 showed no association (Table 2).
In relation to gender, a significant association with ALL under recessive genetic models at SNPs rs10740055 (OR = 4.89, p = 0.003), rs10994982 (OR = 4.92, p = 0.002) and rs6479779 (OR = 5.18, p = 0.009) was found in females, while rs10821938 (OR = 2.61, p = 0.02) and rs7923074 (OR = 2.51, p = 0.035) were significantly associated with ALL in males (Table 3). The additive genetic models of rs7073837 (OR = 1.90, p = 0.04), rs10740055 (OR = 2.13, p = 0.01), rs10994982 (OR = 2.08, p = 0.02), and rs4948487 (OR = 1.87, p = 0.047) in females showed significant association with ALL. Conversely, rs7089424 (OR = 1.72, p = 0.03), rs7896246 (OR = 1.88, p = 0.01), rs10821938 (OR = 1.75, p = 0.02), and rs7923074 (OR = 1.69, p = 0.03) revealed a significant association under additive genetic models in males. The additive genetic model of rs10821936 was significantly associated with ALL in both males and females (OR = 1.70, p = 0.02; OR = 2.00, p = 0.03, respectively) (Table 3).
Associations by gender for ARID5B SNPs with acute lymphoblastic leukemia among Yemeni children.
Freq.: risk allele frequency; SNP: single nucleotide polymorphism; OR: odds ratio; CI: confidence interval.
Controlled for age. In the additive model, genotype of homozygote for the non-risk allele 11, heterozygote 12, and homozygote for the risk allele 22. The recessive model was defined as 22 versus 12 + 11 and dominant model as 22 + 12 versus 11. The outliers (studentized residual is greater than 2.0 or less than −2.0) were excluded. Bold values are significant.
Nine SNPs are located in one haplotype block with significant LD (Figure 1 and Table 4). The frequencies of haplotypes and diplotypes that were <0.03 in the total sample were excluded from the additional analysis. The haplotype (CGAACACAA) was significantly associated with ALL risk (OR = 4.29, p = 0.015). In contrast, the frequencies of the haplotypes (CCCGACTGC) and (CGCACATGA) were higher in the control group than in leukemic children (OR = 0.24, p = 0.048; OR = 0.13, p = 0.076, respectively). The diplotypes revealed no significant association with acute lymphoblastic leukemia (Table 5).

Pairwise linkage disequilibrium among ARID5B SNPs in Yemeni children.
The estimated values of linkage disequilibrium analysis between the nine ARID5B SNPs in Yemeni children.
SNP: single nucleotide polymorphism.
The correlation coefficients D′ and r2 between nine SNPs were shown in the above and below diagonal, respectively.
Association of common haplotypes and diplotypes with acute lymphoblastic leukemia among Yemeni children.
OR: odds ratio; CI: confidence interval.
Controlled for age and gender. The outliers (studentized residual is greater than 2.0 or less than −2.0) were excluded. Bold values are significant.
Discussion
We evaluated the association between ARID5B SNPs and ALL in Yemeni children who are of Arab Asian descent. Several genome wide and candidate gene association studies have reported strong associations between ARID5B SNPs and ALL risk in different populations.9–12,15,17–20,27–30 Previously, GWA studies have been conducted in populations of European, Hispanic, and African American ancestry. Our findings in Yemeni population are in agreement with these studies, confirming ARID5B SNPs as general susceptibility markers for ALL. The main findings in our study are that rs7073837, rs10740055, rs7089424, rs10821936, rs4506592, rs10994982, rs7896246, rs10821938, and rs7923074 polymorphisms are associated with an increased risk of childhood ALL. The strongest associations were with rs10821936 and rs7896246 and these genetic effects seem to be additive. The effects of rs7073837, rs10740055, rs10994982, and rs4948487 were more pronounced in females than in males, while rs7089424, rs7896246, rs10821938, and rs7923074 were significantly associated with ALL in males. Noteworthy, rs10821936 was the only SNP found to be associated with ALL in both males and females.
B-cell progenitors and bone marrow cellularity are decreased in homozygous Arid5b knockout mice, confirming the role of ARID5B in B-cell lineage development.31,32 Abnormal expression of the ARID5B gene during fetal life has been found to inhibit B-lymphocyte development and contribute to leukemogenesis. 15 It has been suggested that ARID5B variations may affect the function of this gene during the maturation of B-progenitor cells, leading to an increased risk of B-ALL.11,31 ARID5B SNPs rs7089424, rs7073837, rs10821936, and rs10821938 were found to have putative roles in the regulation of gene transcription, thereby affecting either ARID5B expression or splicing, likely producing different isoforms or preventing the binding of different transcription factors.33,34
In comparison to other Asian populations, studies in Thailand and Taiwan showed no association between rs7089424 and ALL;32,35 however, we found that this SNP was associated with ALL risk in Yemeni children. Different populations are likely to have different LD patterns and may not share the same susceptibility loci. The different levels of ALL risk are related to the genetic variations in different ethnicities. 20 Furthermore, these genetic variations may interact with different ancestral backgrounds, genetic make-up, and environmental factors in multiple ways to either increase or decrease the susceptibility to ALL in different areas. Conversely, we are in line with a Korean study that revealed an association between rs7089424 and ALL in Korean children. 14
This study shows that rs10821936 and rs7896246 were the most significant signals detected for ARID5B SNPs and were in strong LD with each other (r2 = 0.83), correlating with reports of rs10821936 being highly related to ALL susceptibility in European, French-Canadian, and Chinese populations.11,15,30 In addition, Xu et al. 19 reported that the strongest association between this SNP and ALL risk is in Whites and Hispanics. A recent meta-analysis study of 13 case–control studies involving 7147 patients and 31,969 controls reported that ARID5B rs10821936 SNP is associated with a high risk for ALL. 36 Functional studies are needed to explore how this SNP contributes to the increased risk of ALL.
Due to conflicting studies relating to possible gender-specific associations with ALL, a stratified analysis of ALL between males and females was also performed. Although some previous studies have reported that the association between ARID5B SNPs and ALL is more marked in females,12,16 Healy et al. 15 contradicted those studies by reporting a male bias in their gender-specific association analysis. However, there have been other reports showing no differences between males and females in the influence of ARID5B variations on the increased risk of ALL.19,27,29 As a consequence of these diverging results, the effect of gender on ALL risk is as yet unknown.
We performed haplotype analysis and demonstrated that the haplotype (CGAACACAA), consisting of the risk alleles of eight SNPs, was associated with a fourfold increase in ALL risk. The associations of these individual ARID5B SNPs are not independent. However, the haplotype (CCCGACTGC) includes non-risk alleles which have revealed a protective effect.
Conclusion
We have conducted the first study on the association of ARID5B gene variants with ALL risk in Yemeni children. Some ARID5B SNPs are associated with increased susceptibility to ALL in females than in males and vice versa. In addition, the ARID5B haplotype containing SNP risk alleles supports the contribution of ARID5B SNPs to an increased risk of childhood ALL. Further studies with a larger sample size are warranted to confirm our results. Also, functional studies are needed to determine the causative variants of ARID5B gene and their effects on increased childhood ALL risk. Understanding how these SNP variations affect the overall structure and function of the ARID5B protein would be of important value in the diagnosis of childhood leukemia, improving risk-directed treatments and disease outcomes.
Footnotes
Acknowledgements
The authors are grateful to all children and their parents for their valuable participation. They also appreciate the kind cooperation of all medical staff and nursing in this study at Government hospitals in Sana’a city, Yemen. B.A., S.M., R.S.A., and S.M.N. contributed to study design; B.A. contributed to sample collection; B.A., R.H.A., and S.D.S. performed the lab work and statistical analyses; B.A., S.M., S.M.N., and M.F.M.R. contributed to reagents/materials/analysis tools; and B.A., S.M., S.M.N., R.S.A., and M.F.M.R. wrote the paper.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
This study was supported by grants from the University of Malaya (HIR/420001-E000046) and IPPP (PG039/2013A).
