OBJECTIVE—We performed a comprehensive genetic association study of common variation spanning the IGF2BP2 locus in order to replicate the association of the “confirmed” type 2 diabetes susceptibility variants rs4402960 and rs1470579 in the French Caucasian population and to further characterize the susceptibility variants at this novel locus.
RESEARCH DESIGN AND METHODS—We genotyped a total of 21 tagging single nucleotide polymorphisms spanning the IGF2BP2 locus in our type 2 diabetes case-control cohort comprising 3,093 French Caucasian subjects.
RESULTS—IGF2BP2 variants rs4402960 and rs1470579 were not associated with type 2 diabetes in the present study (P = 0.632 and P = 0.896, respectively). Meta-analysis of genotype data from over 34,000 subjects demonstrated that our inability to replicate rs4402960/rs1470579 was consistent with the findings from several previous genome-wide association study (GWAS) datasets that were underpowered to detect this modest association signal (odds ratio [OR] 1.14). We obtained novel evidence that rs9826022, a borderline rare variant (5% minor allele frequency) in the 3′ downstream region, was associated with type 2 diabetes (P = 0.0002; OR 1.53 [95% CI 1.22–1.91]). This result was corroborated by the meta-analysis of 10,542 genotypes from the current study and GWAS datasets using both fixed (P = 9.47 × 10−6; 1.30 [1.16–1.46]) and random effects (P = 0.001; 1.30 [1.11–1.52)] calculations.
CONCLUSIONS—We were unable to replicate the confirmed rs4402960/rs1470579 susceptibility variants but found novel evidence for a rare variant in the 3′ downstream region of IGF2BP2. Further genetic and functional studies are required to identify the etiological IGF2BP2 variants.
The insulin-like growth factor 2 mRNA binding protein 2 (IGF2BP2) gene on chromosome 3q27 is a paralog of IGF2BP1, a known regulator of IGF2 gene expression. Genome-wide association studies (GWASs) carried out by the Finland-U.S. Investigation of NIDDM Genetics (FUSION) (1), the Wellcome Trust Case Control Consortium (WTCCC) (2), and the Diabetes Genetics Initiative (DGI) (3) groups each found modest evidence that single nucleotide polymorphisms (SNPs) in the IGF2BP2 region are associated with type 2 diabetes. The subsequent meta-analysis of primary and replication datasets from these GWASs corroborated these findings and identified two strongly correlated IGF2BP2 variants, rs1470579 and rs4402960, as “confirmed” type 2 diabetes susceptibility variants (1–3). By contrast, the French/Canadian GWAS (4) typed 10 SNPs across the IGF2BP2 locus, including rs1470579, in 1,363 subjects, but found no nominal (P < 0.05) association signals at IGF2BP2. In an attempt to replicate the IGF2BP2 association findings in the French Caucasian population in a larger study and to further characterize the susceptibility variants at this novel locus, we performed an association study of HapMap Phase II tag SNPs spanning the IGF2BP2 locus in 3,093 French Caucasian subjects.
RESEARCH DESIGN AND METHODS
All subjects were of French Caucasian ancestry. Individuals identified by Sladek et al. (4) as lying outside the HapMap CEU ancestry cluster were excluded from the study. Type 2 diabetic case subjects were known diabetic patients. Normoglycemic control subjects were selected to have a fasting blood glucose concentration <7.0 mmol/l (5). Case subjects were composed of 1) 372 probands from diabetic families (6), recruited in Lille; and 2) 1,083 patients with a family history of type 2 diabetes, recruited at the Corbeil-Essonne Hospital. Control subjects were composed of 1) 353 normoglycemic parents from type 2 diabetic families; 2) 543 subjects from the SUVIMAX (Supplementation en Vitamines et Minéraux Antioxidant) prospective population-based cohort study (7); and 3) 742 subjects selected from the DESIR (Data from an Epidemiologic Study on the Insulin Resistance Syndrome) cohort, a large prospective study of insulin resistance in French subjects (8). Informed consent was obtained from all subjects, and the study was approved by local ethics committees.
Statistical power.
The case-control cohort comprised 1,455 type 2 diabetic subjects (age 60 ± 12 years, BMI 29.0 ± 6.0 kg/m2, sex [male:female] 56:44%) and 1,638 normoglycemic subjects (aged 54 ± 13 years, BMI 24.1 ± 3.3 kg/m2, sex 43:57%). At α = 0.05, this sample size provided 76% power (9) to detect the type 2 diabetes susceptibility variants rs1470579 and rs4402960, assuming an allele frequency of 0.30, a disease prevalence of 0.1, a heterozygote relative risk of 1.14 (1–3), a multiplicative model, and a 100% genotype call rate.
IGF2BP2 tag SNP selection.
The genomic target region for tag SNP selection was extended 10 kb upstream and downstream of the NCBI36 IGF2BP2 locus (chr. 3:186,844,221.0.187,025,521). A total of 19 HapMap Phase II multimarker tagging SNPs (HapMap Data Release 21a/Jan07) with r2 and minor allele frequency thresholds of 0.8 and 0.05, respectively, were identified for genotyping. In addition, the two GWAS-identified susceptibility variants rs4402960 and rs1470579 (1–3) were added to the genotyped SNP set, making a total of 21 genotyped SNPs.
SNP genotyping.
Genotyping was performed with the Sequenom MassARRAY iPLEX system (10). SNP genotype frequencies were tested for accordance with Hardy-Weinberg equilibrium using χ2 analysis. Regarding quality control, all 21 genotyped SNPs exhibited a call rate >90% and a Hardy-Weinberg P > 0.05, with well-defined genotype clusters. There was no evidence (at α = 0.01) of differential call rates across case and control subjects for any SNP (supplementary Table 2 [available in an online appendix at http://dx.doi.org/10.2337/db07-1789]).
Statistical analyses.
To test for association of IGF2BP2 SNPs with type 2 diabetes, χ2 analysis of allele and genotype counts was performed. Pairwise SNP linkage disequilibrium (LD) values were calculated from the genotype data of the control cohort with Haploview (11). Quantitative metabolic phenotypes, BMI, waist-to-hip ratio, fasting serum levels of triacylglycerol, total and HDL cholesterol, glucose, insulin, apolipoprotein A-I (ApoA1), and apolipoprotein B (ApoB), measured in 1,539 normoglycemic subjects from the control cohort, were log transformed and adjusted for age, sex, and BMI, as appropriate. SNPs were tested for association with adjusted quantitative traits using SPSS 14.0 with the ANOVA test under a codominant model. Quantitative trait association P values are presented uncorrected for multiple testing. Combined analysis of association datasets was carried out with the Mantel-Haenszel (fixed effects) meta-analysis method. Interstudy heterogeneity was assessed with Cochran's Q statistic and the I2 metric (12,13). All calculations were performed using R (version 2.5.1) statistical software and the Meta (version 0.8–2) package (14). Association analysis of SNPs captured by multimarker tags was carried out with the PLINK software package (15). Haplotype association was performed with the WHAP (version 2.09) software package (16).
RESULTS AND DISCUSSION
A total of 21 HapMap Phase II multimarker tag SNPs (r2 ≥ 0.8; minor allele frequency ≥0.05) spanning the IGF2BP2 locus, including the susceptibility variants rs4402960 and rs1470579, were tested for association with type 2 diabetes in 3,093 French subjects. The allele and genotype counts for all SNPs are presented in online supplementary Tables 1 and 2, respectively. Figure 1 shows that SNPs rs4402960 and rs1470579 exhibited very strong LD (r2 = 0.95) in agreement with the GWAS (1–3) and HapMap data (17). However, the allele frequencies of SNPs rs4402960 and rs1470579 were not significantly different in the case and control groups (rs4402960, P = 0.632; rs1470579, P = 0.896), indicating that these variants were not associated with type 2 diabetes in the present study (Table 1). None of the three SNPs captured by multimarker tags (rs4575929, P = 0.159; rs4686692, P = 0.566; and rs16860216, P = 0.972;) were associated with type 2 diabetes (online supplementary Table 3).
Our inability to replicate the confirmed rs4402960/rs1470579 association result can be attributed to a lack of power to detect this modest signal. An examination of the published association evidence for these variants (Fig. 2A and B and online supplementary Tables 4 and 5) illustrates this point and demonstrates that our results are not inconsistent with those of previous studies. Of the nine published rs4402960 datasets, the three statistically well-powered studies (those with ≥90% power) all obtained an association for this variant, while the six underpowered studies showed either no association or a weak association with type 2 diabetes. Overall, the combined data shows a 3% difference in allele frequency between the case and control groups in over 34,000 subjects, which equates to very strong evidence of association (P = 1.9 × 10−14; OR 1.13 [95% CI 1.10–1.17]). Similarly for the rs1470579 variant, the underpowered datasets were either nonsignificant or weakly associated with type 2 diabetes. The combined data shows a 2% allele frequency difference in over 22,000 case-control subjects and a clearly significant association with type 2 diabetes (P = 2.6 × 10−9; 1.13 [1.09–1.18]). All of this serves as a reminder that the meta-analysis of individually underpowered studies has an invaluable role to play in the identification and confirmation of susceptibility variants of small effect.
The between-study heterogeneity metric I2 (12,13) was calculated for these two variants (Fig. 2D). Heterogeneity was moderate for rs4402960 (I2 = 21%) in agreement with a recent study (18). For rs1470579, the meta-analyzed signal is clearly driven by the DGI Replication set “S” result (P = 3.73 × 10−8). In accordance with this standout result and the smaller number of studies available for this SNP, heterogeneity was higher (I2 = 58%); the random effects OR gave a mere P = 0.001 compared with the Mantel-Haenszel P = 2.56 × 10−9; and Cochran's Q statistic was also statistically significant (P = 0.037).
We obtained novel evidence that rs9826022 in the 3′ downstream region (P = 0.0002; OR 1.53 [95% CI 1.22–1.91]) was associated with type 2 diabetes (Table 1). This result survived Bonferroni correction for the number of SNPs tested (adjusted P = 0.003), and we sought confirmation in the publicly available GWAS data. The WTCCC (http://www.wtccc.org.uk/) and DGI (http://www.broad.mit.edu/diabetes/) GWAS did not directly type the rs9826022 variant but instead typed rs9878208, an rs9826022 proxy (HapMap r2 = 1). Meta-analysis of this data (Fig. 2C and online supplementary Table 6) provided support for the association, although the random effects evidence (P = 0.001) was weaker than that produced by the Mantel-Haenszel analysis (P = 9.47 × 10−6). The heterogeneity between these three studies was moderate (I2 = 44%). The disparity between the fixed and random effects may indicate that rs9826022 is not the “causative” variant but merely in partial LD with the true susceptibility variant; or it may simply reflect the “winner's curse” result of the present study and the inherently larger variance of the genetic effect of rare variants in moderately sized studies. The rs9826022 result will clearly require confirmation in further large, independent studies before a definitive assessment of the contribution of this rare variant to type 2 diabetes susceptibility can be made.
The only other nominal association signal, rs9864104 (P = 0.012; OR 1.19 [95% CI 1.04–1.37]), was modest and disappeared upon multiple test correction. Since rs9826022 and rs9864104 were in low-moderate LD (r2 = 0.26), we carried out haplotype analysis of these variants. Two SNP haplotypes containing the rare allele of rs9826022 showed a virtually identical frequency and P value as the single-point rs9864104 analysis (online supplementary Table 7), indicating that the haplotype analysis did not add anything to the single-point analysis and that the weak rs9864104 signal was caused by this variant being in partial LD with rs9826022. There were no significant differences in SNP allele frequencies between men and women, and no association with type 2 diabetes was uncovered by stratifying for sex (data not shown).
SNPs rs4402960, rs1470579, rs9826022, and rs9864104 were also tested for association with a number of metabolic quantitative phenotypes (online supplementary Table 8). SNPs rs4402960 and rs1470579 presented weak associations with ApoA1 (P = 0.019 and 0.028, respectively). SNP rs9864104 was associated with ApoA1 (P = 0.008) and ApoB levels (P = 0.002), although there was no linear trend between the three genotype groups. The association of IGF2BP2 variation with apolipoprotein levels may be consistent with the role of the insulin-like growth factor system in regulating lipid metabolism. However, in the absence of replication, we emphasize that these quantitative trait associations are of nominal significance and require confirmation in further large studies.
In conclusion, we have carried out a comprehensive association study of common variation spanning the IGF2BP2 locus and type 2 diabetes in French Caucasians. We were unable to replicate the confirmed susceptibility variants rs4402960 and rs1470579 but found novel evidence for a rare variant in the 3′ downstream region of IGF2BP2. Further genetic and functional studies are required to identify the etiological variants at the IGF2BP2 locus and to determine the cellular and physiological mechanisms by which they act to modulate type 2 diabetes susceptibility.
SNP . | Chr. Position (bp) NCBI36 . | Gene region* . | Allele . | Allele count . | . | P . | OR (95% CI) . | |
---|---|---|---|---|---|---|---|---|
. | . | . | . | Type 2 diabetic case subjects (%) . | Normoglycemic control subjects (%) . | . | . | |
rs9826022 | 186,839,954 | 3′ downstream | A | 2,532 (93.0) | 2,949 (95.3) | 0.0002 | 1.53 (1.22–1.91) | |
(+4267 bp) | G | 190 (7.0) | 145 (4.7) | |||||
rs9864104 | 186,840,225 | 3′ downstream | C | 2,260 (81.7) | 2,628 (84.2) | 0.012 | 1.19 (1.04–1.37) | |
(+3996 bp) | T | 506 (18.3) | 494 (15.8) | |||||
rs4402960 | 186,994,381 | Intron 6 | G | 1,786 (67.5) | 2,111 (68.1) | 0.632 | 1.03 (0.92–1.15) | |
T | 858 (32.5) | 987 (31.9) | ||||||
rs1470579 | 187,011,773 | Intron 5 | A | 1,753 (67.3) | 2,120 (67.4) | 0.896 | 1.01 (0.90–1.13) | |
C | 853 (32.7) | 1,024 (32.6) |
SNP . | Chr. Position (bp) NCBI36 . | Gene region* . | Allele . | Allele count . | . | P . | OR (95% CI) . | |
---|---|---|---|---|---|---|---|---|
. | . | . | . | Type 2 diabetic case subjects (%) . | Normoglycemic control subjects (%) . | . | . | |
rs9826022 | 186,839,954 | 3′ downstream | A | 2,532 (93.0) | 2,949 (95.3) | 0.0002 | 1.53 (1.22–1.91) | |
(+4267 bp) | G | 190 (7.0) | 145 (4.7) | |||||
rs9864104 | 186,840,225 | 3′ downstream | C | 2,260 (81.7) | 2,628 (84.2) | 0.012 | 1.19 (1.04–1.37) | |
(+3996 bp) | T | 506 (18.3) | 494 (15.8) | |||||
rs4402960 | 186,994,381 | Intron 6 | G | 1,786 (67.5) | 2,111 (68.1) | 0.632 | 1.03 (0.92–1.15) | |
T | 858 (32.5) | 987 (31.9) | ||||||
rs1470579 | 187,011,773 | Intron 5 | A | 1,753 (67.3) | 2,120 (67.4) | 0.896 | 1.01 (0.90–1.13) | |
C | 853 (32.7) | 1,024 (32.6) |
Data are n (%) unless otherwise indicated.
Relative to the NCBI36 coordinates of the IGF2BP2 genomic locus (chr. 3:186,844,221.0.187,025,521). χ2 P values are shown.
Published ahead of print at http://diabetes.diabetesjournals.org on 22 April 2008.
Readers may use this article as long as the work is properly cited, the use is educational and not for profit,and the work is not altered. See http://creativecommons.org/licenses/by-nc-nd/3.0/ for details.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Article Information
This work was supported by a Wellcome grant to F.G. (WT081510MA).
We thank the anonymous referees and editorial staff for critically appraising and improving draft versions of the manuscript.