Two recent publications reported association of common polymorphisms in the P2 promoter of hepatocyte nuclear factor 4α (HNF4α) (the MODY1 gene) with risk for type 2 diabetes. We attempted to reproduce this putative association by genotyping 11 single nucleotide polymorphism (SNPs) spanning the HNF4α coding region and the P2 promoter in >3,400 patients and control subjects from Sweden, Finland, and Canada. One SNP that was consistently associated in the two previous reports (rs1884613, in the P2 promoter region) also trended in the same direction in our sample, albeit with a lower estimated odds ratio (OR) of 1.11 (P = 0.05, one-tailed). We genotyped this SNP (rs1884613) in an additional 4,400 subjects from North America and Poland. In this sample, the association was not confirmed and trended in the opposite direction (OR 0.88). Meta-analysis of our combined sample of 7,883 people (three times larger than the two initial reports combined) yielded an OR of 0.97 (P = 0.27). Finally, we provide an updated analysis of haplotype structure in the region to guide any further investigation of common variation in HNF4α. Although our combined results fail to replicate the previously reported association of common variants in HNF4α with risk for type 2 diabetes, we cannot exclude an effect smaller than that originally proposed, heterogeneity among samples, variation in as-yet-unmeasured genotypic or environmental modifiers, or true association secondary to linkage disequilibrium (LD) with as-yet-undiscovered variant(s) in the region.

Type 2 diabetes is extremely common, affecting 5–10% of people in the industrialized world. Twin studies document the role of inherited genetic factors, whereas secular trends demonstrate the powerful role of environmental and behavioral influences. In the search for genetic factors, both monogenic and polygenic forms of the disease are recognized, with only a few percentage of cases attributable to monogenic forms and the rest presumed to be attributable to a more complex cause.

Maturity-onset diabetes of the young (MODY) is an autosomal dominant form of type 2 diabetes, characterized by early onset and a defect in the function of the pancreatic β-cells. Six genes are known to cause MODY (16), and mutations in these genes are thought to act by causing abnormal regulation of transcription in β-cells, creating a defect in metabolic signaling of insulin secretion and/or β-cell mass (7). Another single-gene defect that causes type 2 diabetes is maternally inherited diabetes and deafness, caused by mutations in the mitochondrial DNA (8). Although the molecular identification of the MODY genes has illuminated the pathophysiology of diabetes, inherited variation in these genes has not previously been demonstrated to contribute in a causal manner to the common, late-onset form of type 2 diabetes.

Attempts to identify genes that are involved in the common form of type 2 diabetes have relied on genome-wide linkage studies and candidate gene association studies. Although genetic association studies for complex traits have been challenged by irreproducibility (9), several common genetic variants now have been widely replicated as influencing risk for type 2 diabetes, including the Pro12Ala polymorphism in the peroxisome proliferator–activated receptor-γ, the E23K polymorphism in the Kir6.2 gene (both reviewed in 10,11), and variation in the region of Calpain10 (12,13). For each of these genes, multiple large studies independently observed association under a consistent model of genotype-phenotype correlation, meta-analysis of all published studies gave a significant positive result, and subsequent large studies replicated the model proposed by the initial meta-analysis. Experience with these and other complex disease genes has led to a model in which common variants may have modest effects, such that very large study populations and substantial statistical significance (much greater than the typically used thresholds of P < 0.05 or P < 0.01) are required to achieve reproducible results (11,1418).

It is not yet clear how often the same genes that cause monogenic forms of common diseases (e.g., MODY) also harbor common variants of more modest effects that contribute to late-onset forms of disease. Recently, two groups independently obtained evidence suggesting that common variants near the P2 promoter of hepatocyte nuclear factor-4α (HNF4α), the MODY1 gene, are associated with risk for type 2 diabetes (19,20). The gene for HNF4α spans 29 kb on chromosome 20q13.1–13.2 (21). The P2 promoter, located ∼46 kb upstream of the P1 promoter and the coding start site, controls the predominantly expressed splice form in the β-cell (22,23). A number of linkage studies have observed suggestive linkage peaks in the region of HNF4α, further highlighting this region for study (2432).

In the study by Love-Gregory et al. (20), the strongest associations were for a single nucleotide polymorphism (SNP) near the P2 promoter (rs1884614; OR 1.45; nominal P = 0.008) and another in a 3′ intron (rs3818247; OR 1.49; nominal P = 0.003). Ten SNPs in the Silander et al. study (19) were nominally associated (below threshold P = 0.05), the strongest of which was rs1884613, also near the P2 promoter (OR 1.34; P = 0.01). Each study included ∼1,200 case patients and control subjects and genotyped 16 and 13 SNPs, respectively.

The similarity in results between the Love-Gregory et al. (20) and Silander et al. (19) studies and the high previous probability of a gene involved in MODY diabetes and in a region of putative linkage all support the hypothesis that variation in HNF4α might influence risk for the common form of type 2 diabetes. Nevertheless, given the history of nonreplication in genetic association studies (9,33), all such hypotheses need to be explored for consistency in additional samples. We studied SNPs in HNF4α in an additional 7,883 patients (approximately three times the combined study sample of Love-Gregory et al. [20] and Silander et al. [19]) to extend knowledge of genotype-phenotype correlation at this important locus.

The characteristics of four of our patient samples have been described elsewhere (11). They include 1,189 Scandinavian siblings who were discordant for type 2 diabetes; two Scandinavian case-control samples that contained 942 and 1,028 subjects, respectively; and 254 subjects from the Saguenay Lac-St. Jean region in Quebec. The case-control samples were individually matched for age, BMI, and geographic region. The type 2 diabetic patients met World Health Organization 1998 criteria for type 2 diabetes. Severe impaired glucose tolerance was defined as 10.0 mmol/l >120 min, blood glucose ≥8.5 mmol/l. An oral glucose tolerance test was performed when fasting plasma glucose was <11 mmol/l. Age of onset is available for 641 individuals in this study, with a mean ± SD of 53.0 ± 11.7 years.

This study also includes analysis of two additional case-control samples from Genomics Collaborative, Inc. (GCI), which have previously been described (34).The first sample contains 2,452 individuals of U.S. white ancestry, and the second includes 2,018 individuals from Poland. To qualify to be enrolled in the GCI diabetes study, participants were required to meet American Diabetes Association criteria for definition of type 2 diabetes and to be treated by the enrolling physician for diabetes at the time of study entry. Control subjects are healthy individuals who have no known history of chronic diseases and are required to have had a normal fasting plasma glucose <7 mmol/l within the last year. Case patients and control subjects were matched one-to-one on the basis of sex, age (±5 years), and self-reported ethnicity. For this study, both case patients and control subjects were required to report two parents and four grandparents with the same self-reported ethnicity. In addition, we matched on country of birth for the participant, both parents, and all four grandparents as much as possible in the case-control matching. The phenotypic characteristics of all samples are described in Table 1. Plasma glucose (fasting and for nondiabetic individuals during an oral glucose tolerance test) was measured by a glucose oxidase method with a Beckman Glucose analyzer (Beckman Instruments, Fullerton, CA).

Genotyping.

Genotyping was performed as previously described by primer extension of multiplex products with detection by matrix-assisted laser desorption ionization time of flight mass spectroscopy using a Sequenom platform (35,36). The average genotype completeness for working assays was 96%. The genotyping consensus error was determined to be 0.25%, using both duplicate genotypes (2,761 comparisons) and errors in Mendelian inheritance.

Statistical analysis.

To determine the allelic association of each particular SNP with type 2 diabetes, we used simple χ2 analysis in the case-control samples and the Discordant Allele Test in the sibling pairs (37). Results for the subsamples were combined using Mantel-Haenszel meta-analysis of the odds ratios (ORs) (33). Power calculations were performed using the program of Purcell et al. (38), available at http://statgen.iop.kcl.ac.uk/gpc. Homogeneity among studies was tested using a Pearson χ2 goodness-of-fit as previously described (33).

Haplotype structure.

To evaluate the haplotype structure of the HNF4α region, we genotyped 119 SNPs from dbSNP in a multigenerational panel of 12 Centre d’Etude du Polymorphisme Humain (CEPH) pedigrees that contained 96 chromosomes. These SNPs span 108 kb, from ∼17 kb upstream of the P2 promoter to ∼16 kb downstream of the end of the HNF4α 3′ UTR. SNPs were initially selected on the basis of an evenly spaced grid across the region, with additional SNPs added on the basis of the extent of linkage disequilibrium (LD). Nineteen (16%) of the SNPs attempted were technical failures (failing either Hardy Weinberg equilibrium or to attain a 75% genotyping percentage), and 42 (42%) of the remaining 100 SNPs either were monomorphic in this population or had a minor allele frequency <5%, totaling a final set of 58 working, high-frequency SNPs. The average spacing between these 58 SNPs is 1.9 kb, with the largest interval being 12.2 kb (in a region of strong LD). Haplotype blocks were determined as described previously (11).

SNP evaluation.

To determine how selected SNPs for our study and the Love-Gregory et al. (20) and Silander et al. (19) studies captured variation in the HNF4α region, we evaluated all genotyped SNPs for their correlation to one another in the CEPH samples described above. Specifically, we recorded the maximal pairwise r2 value of each tag SNP to the complete set of other variants typed in the region. On the hypothesis that any of these variants could be a putative causal variant or proxy thereof, we report the fraction of all markers (>5% frequency) that had an r2 > 0.2, r2 > 0.5, and r2 > 0.8 to at least one of the tag SNPs typed in the patient sample. We note that this is a nonconservative estimate of power, because we have not typed all SNPs in the region but only the one per 1.9 kb found in dbSNP. Because the total number of SNPs >5% frequency is only approximately one per 500 bp on average across the human genome (39,40), our survey likely has captured approximately one-fourth of all such sites; at a density of one SNP per 1.9 kb, most (but not all) of the remaining sites will typically show a high r2 value to one of the sites already observed (P.d.B., M.J.D., and D.A., unpublished observations).

We began by genotyping 11 SNPs in 3,413 type 2 diabetic patients and matched control subjects from Scandinavia and Canada previously described (11,18), (Table 1). The markers were selected before the publication of Love-Gregory et al. (20) and Silander et al. (19) but were found to include each of the SNPs significantly associated in both of the previous studies or a proxy thereof with r2 > 0.9 (online appendix [available at http://diabetes.diabetesjournals.org]) in an initial panel of 96 CEPH chromosomes. In addition to the SNPs that were relevant to the two previous papers, we genotyped the common missense polymorphism T130I and seven other SNPs from the HNF4α region. The single SNP association results are presented in Table 2. For previously associated SNPs, we report one-tailed P values (with the P value corresponding to an OR in the same direction as the previous report); for SNPs that were not previously published, two-tailed P values are reported.

Ten of the 11 genotyped SNPs had a nominal P > 0.1, including the missense SNP previously associated in another report (41). However, SNP rs1884613 near the P2 promoter (the strongest result in the Silander et al. study [19] and the second best in the Love-Gregory et al. [20] analysis) showed nominally significant evidence for replication, with a one-sided P = 0.05 uncorrected for the multiple tests performed. The OR in our samples was considerably lower than that in the two previous reports (1.11 vs. 1.34 and 1.38). The previous studies used family-based samples (some of which showed some evidence for linkage in this region), and such genetic loading could potentially account for the higher OR. In our family-based subsample, many of which were multiplex, the observed OR was 1.09, similar to our Scandinavian unrelated case-control sample (OR 1.11).

Because these results were consistent with the previous results but not definitive, we genotyped the most consistently associated SNP, rs1884613, in an additional sample of 4,470 patients (type 2 diabetic patients and control subjects) from North America and Poland. These patient samples have been previously published (34,42) and replicate both the previously associated peroxisome proliferator–activated receptor-γ P12A SNP (42) and the Kir6.2 E23K variant (34). Association testing of the single SNP in this additional sample did not support the previous model but trended in the opposite direction from previously described studies (OR 0.88). When we performed a meta-analysis of the 7,883 patients who were studied in this report, the estimated OR for rs1884613 was 0.97 (P = 0.27; Table 3).

Failure to replicate a previous result could possibly be due to lack of power for replication or heterogeneity between the original patient sample and the one used for replication. Power calculations (38) indicate that our combined sample of 7,883 individuals provides 98% power (at a significance level of 0.05) to detect an OR of 1.34 as proposed for rs1884613 by Silander et al. (19). Of course, the OR in an initial report is often overestimated as a result of the so-called “winner’s curse”; under the scenario that the true effect (if one exists) is smaller than in the original reports, our power is less than these estimates suggest (11,18,42).

When we considered the similar results obtained in both the Silander et al. (19) and our Scandinavian samples, we wondered whether heterogeneity across samples (as a result of ethnicity or geography) could be responsible for the lack of replication in the North American/Polish samples. A statistical test for heterogeneity in the OR of rs1884613 across all of our sub-samples demonstrated borderline evidence for heterogeneity among these seven samples (P = 0.06). On the basis of this test, these data are not inconsistent with having been randomly sampled from the same underlying distribution, despite that the Scandinavian/Canadian samples trended in the opposite direction of the North American/Polish samples (11,18,42) (Fig. 1).

In addition to the models discussed above (the “winner’s curse” and ethnic heterogeneity), variable results could arise if one or more variants are involved in the common form of type 2 diabetes and the previous results had been due to LD to untyped causal SNP(s). To evaluate this possibility will require a comprehensive association study of all common variation in the region. When we began the current investigation, the public SNP databases contained a low density of SNPs relative to the extent of LD in the region, and we lacked the resources to resequence completely the gene locus; thus, coverage of markers was far from complete. Recently, the density of markers in dbSNP has risen dramatically in the region, and we have typed additional markers to augment previous LD maps.

The LD patterns of 108 kb surrounding the HNF4α gene were determined by genotyping 119 SNPs from the public database in a panel composed of 12 CEPH multigenerational pedigrees totaling 90 Utah white individuals. Fifty-eight SNPs that were genotyped successfully were in Hardy-Weinberg equilibrium and had a minimum minor allele frequency of 5% (Fig. 1). The average spacing of these working high-frequency SNPs was less than one per 2 kb. LD breakdown is observed in several places in this region, such that the coding region lies in three blocks of strong LD (11,18), and the P2 promoter falls in a distinct upstream block of strong LD.

We used this denser SNP map to determine how well the marker sets used in the current study and in the studies by Love-Gregory et al. (20) and Silander et al. (19) captured the variation described by the updated map of common genetic variation in the region of HNF4α. This analysis is based on the hypothesis that the markers tested represent proxies for putative disease-causing alleles in the locus. Of course, this is an overestimation of the true power, because there are variants that we did not type because they have not been discovered yet, some of which will display patterns not captured by the markers that we were able to type. For comparison, we note that the density of SNPs that we have typed (one per 1.9 kb) likely represents approximately one-fourth of all of the common (>5%) SNPs in a typical region of the genome (approximately one per 500 bp) (39,40).

All three groups’ SNP sets performed very similarly (Table 4); this is not surprising, as the three groups used very similar sets of SNPs on the basis of what was available in dbSNP 1 year or more ago. We found that none provided particularly complete coverage of the current LD map. Specifically, the average correlation coefficient (r2) of the genotyped tag SNPs to the remaining markers in the LD map averaged only 0.51–0.54, and only 28–35% of typed SNPs were captured with an r2 > 0.8. This means that the studies of HNF4α performed to date (including the current study) are as yet far from complete in their evaluation of all of the currently known common variants in the region for association with diabetes risk. For groups that are interested in doing a more comprehensive study, a set of tag SNPs selected using this updated haplotype map is given in the online appendix. Finally, we note that the extent of LD is considerably less at this locus than in the genome as a whole (http://www.hapmap.org/) (36).

Given 25,000 or more genes in the human genome, the a priori probability that any given gene carries mutations that influence a trait of interest is very low. Where a gene is already known to play a functional role in the biological process of interest or, better still, is causative in a monogenic form of the disease (as in HNF4α), the a priori expectation of a causal role clearly rises. By any measure, the two recently published association results relating common variants in HNF4α to risk for type 2 diabetes are particularly intriguing. Severe mutations in the gene clearly cause a monogenic form of type 2 diabetes, and at least one SNP showed similar and nominally significant results that were consistent across two studies. However, both studies were only modestly significant, and so only if the previous probability for such variants in HNF4α were very high would the chance of a true positive be greater than that of a false positive (43). Also weighing in the equation is the lack of a demonstrated functional consequence of the putative associated alleles, although this may be a common situation for complex genetic diseases.

To try to demonstrate the reproducibility of the genotype-phenotype correlation, we genotyped 11 tag SNPs in 3,413 individuals and tested the single site that was most consistently associated across studies in 4,470 additional patients. Initially, we observed a modestly significant replication of association (one-tailed P = 0.05, not corrected for the multiple hypotheses examined) for the rs1884613 SNP, consistent in direction with previously published studies, albeit with a much weaker estimated effect size (OR 1.11, as compared with ∼1.3–1.4 in Love-Gregory et al. [20] and Silander et al. [19]). The larger North American and Polish study trended in the opposite direction, however, such that meta-analysis of all of our results fails to confirm that the variation at the HNF4α P2 promoter is associated with increased risk for type 2 diabetes.

There are several possible explanations for our inability to replicate consistently the findings of Love-Gregory et al. (20) and Silander et al. (19). First, it is possible that variation at this locus does have an effect on disease risk and that our results in the U.S./Poland sample are false negative. Given 4,400 people, it is unlikely that an effect as large as in the original reports (OR 1.34) would be missed. However, it is well understood that the true effect of an association is often more modest than estimated in the original studies, as a result of the so-called “winner’s curse” (33). We estimate that ∼8,000 case patients would be necessary to have 80% power to detect an OR of 1.15, and so all of the published results are consistent with a somewhat smaller effect.

Second, it is possible that the initial associations could have been statistical fluctuations, as the corrected P values in the original report were ∼0.01. Although it might seem unlikely that two studies should get the same result with a P value of ∼0.01, we note that a large number of such studies are being performed, and there was no a priori hypothesis that this SNP and gene would be the one to show up in these particular two studies.

A third potential explanation is that the previously reported association signals are real but that there is heterogeneity among populations, and the variant in question is not associated with risk in the U.S. and Poland samples as a result of unmeasured variation in genetic, environmental, or behavioral modifiers. For example, the Scandinavian and Canadian samples are matched for BMI, whereas the GCI samples are not. Although both study designs are valid, it is possible that the individuals in these samples could have a different mix of predisposing genetic risk factors. Moreover, although we failed to demonstrate significant evidence of heterogeneity, it is possible that with increased data, the trend toward heterogeneity (P = 0.06) might be documented as significant. For distinguishing between these hypotheses, additional large positive studies should be obtained or proof of significant heterogeneity documented.

There is general importance to the eventual outcome of genetic association testing in the genes implicated in monogenic forms of type 2 diabetes such as MODY1, the other MODY genes, and the mtDNA. During the past decade, many genes that cause Mendelian forms of common diseases have been identified. Examples include MODY, maternally inherited diabetes and deafness, >20 inherited forms of blood pressure regulation (44), early-onset breast cancer (BRCA1, BRCA2, ATM), Alzheimer’s (APP, PS1, PS2), and others. It is as yet unclear how often the genes that cause the Mendelian form of these diseases also contribute to the common form of disease. The extent to which the genes for common and rare forms of disease turn out to be overlapping will inform understanding of the diversity of the biological and evolutionary paths to a shared clinical endpoint.

FIG. 1.

LD plot across the HNF4α region. The horizontal black line shows the 108-kb region of chromosome 20q13.1–2 analyzed in our CEPH sample. The HNF4α gene is shown in purple, with the P2 promoter and alternative first exon indicated by the purple bar labeled FLJ39654. The 58 successful SNPs are listed below the black line. The bottom part of the figure contains a LD plot based on the measure D′. Each diamond represents the pairwise magnitude of LD, with red indicating strong LD (D′ > 0.8) and a logarithm of odds score >2.0. There are five blocks of strong LD, with the P2 promoter contained in the leftmost block and the HNF4α gene found in three blocks on the right side of the figure. The haplotypes in each of these blocks are shown above the LD plot, with the thickness of the blue line indicating their frequency in CEPH. The presence of a red line below the haplotypes indicates that these haplotypes represent >95% of the chromosomes. (Figure prepared using LocusView, unpublished software by T. Petryshen, A. Kirby, and M. Ainscow.)

FIG. 1.

LD plot across the HNF4α region. The horizontal black line shows the 108-kb region of chromosome 20q13.1–2 analyzed in our CEPH sample. The HNF4α gene is shown in purple, with the P2 promoter and alternative first exon indicated by the purple bar labeled FLJ39654. The 58 successful SNPs are listed below the black line. The bottom part of the figure contains a LD plot based on the measure D′. Each diamond represents the pairwise magnitude of LD, with red indicating strong LD (D′ > 0.8) and a logarithm of odds score >2.0. There are five blocks of strong LD, with the P2 promoter contained in the leftmost block and the HNF4α gene found in three blocks on the right side of the figure. The haplotypes in each of these blocks are shown above the LD plot, with the thickness of the blue line indicating their frequency in CEPH. The presence of a red line below the haplotypes indicates that these haplotypes represent >95% of the chromosomes. (Figure prepared using LocusView, unpublished software by T. Petryshen, A. Kirby, and M. Ainscow.)

Close modal
TABLE 1

Clinical characteristics of patient samples

SampleTypeSex (M/F)Age (years)BMI (kg/m2)Fasting plasma glucose (mmol/l)2-h OGTT PG (mmol/l) or HbA1c (%)
Discordant sibs Diabetes/severe IGT sib 280/329 65 ± 10 29 ± 5 9.3 ± 3.3 14.3 ± 5.6 
 NGT sib 275/305 62 ± 10 26 ± 3 5.4 ± 0.4 6.0 ± 1.1 
Scandinavia Diabetes/severe IGT 252/219 60 ± 10 28 ± 5 9.8 ± 3.4 15.0 ± 5.3 
 NGT 254/217 60 ± 10 27 ± 4 6.2 ± 1.8 6.8 ± 2.8 
Canada Diabetes 70/57 53 ± 8 29 ± 5 6.4 ± 1.8 12.8 ± 2.1 
 NGT 70/57 52 ± 8 29 ± 4 5.1 ± 0.6 6.1 ± 1.1 
Sweden Diabetes/severe IGT 267/247 66 ± 12 28 ± 4 8.5 ± 2.5 6.5 ± 1.5 
 NGT 267/247 66 ± 12 28 ± 4 4.8 ± 0.7 ND 
GCI U.S. Diabetes 644/582 63 ± 11 33 ± 7 9.8 ± 3.0 8.0 ± 3.1 
 NGT 644/582 61 ± 10 27 ± 5 5.1 ± 0.9 ND 
GCI Poland Diabetes 422/587 62 ± 10 30 ± 5 8.9 ± 4.0 7.9 ± 1.3 
 NGT 422/587 59 ± 7 26 ± 4 4.8 ± 1.2 ND 
SampleTypeSex (M/F)Age (years)BMI (kg/m2)Fasting plasma glucose (mmol/l)2-h OGTT PG (mmol/l) or HbA1c (%)
Discordant sibs Diabetes/severe IGT sib 280/329 65 ± 10 29 ± 5 9.3 ± 3.3 14.3 ± 5.6 
 NGT sib 275/305 62 ± 10 26 ± 3 5.4 ± 0.4 6.0 ± 1.1 
Scandinavia Diabetes/severe IGT 252/219 60 ± 10 28 ± 5 9.8 ± 3.4 15.0 ± 5.3 
 NGT 254/217 60 ± 10 27 ± 4 6.2 ± 1.8 6.8 ± 2.8 
Canada Diabetes 70/57 53 ± 8 29 ± 5 6.4 ± 1.8 12.8 ± 2.1 
 NGT 70/57 52 ± 8 29 ± 4 5.1 ± 0.6 6.1 ± 1.1 
Sweden Diabetes/severe IGT 267/247 66 ± 12 28 ± 4 8.5 ± 2.5 6.5 ± 1.5 
 NGT 267/247 66 ± 12 28 ± 4 4.8 ± 0.7 ND 
GCI U.S. Diabetes 644/582 63 ± 11 33 ± 7 9.8 ± 3.0 8.0 ± 3.1 
 NGT 644/582 61 ± 10 27 ± 5 5.1 ± 0.9 ND 
GCI Poland Diabetes 422/587 62 ± 10 30 ± 5 8.9 ± 4.0 7.9 ± 1.3 
 NGT 422/587 59 ± 7 26 ± 4 4.8 ± 1.2 ND 

Data are presented as means ± SD. Plasma glucose (PG) was measured at baseline (fasting) and 2 h after an oral glucose tolerance test (OGTT). IGT, impaired glucose tolerance; NGT, normal glucose tolerance; ND, not determined.

TABLE 2

Association of HNF4α tag SNPs with type 2 diabetes in 3,413 patients

SNPMinor alleleORP95% CI
rs4812828* 0.99 0.42 0.88–1.11 
rs1884613* 1.11 0.05 0.97–1.27 
rs2144908* 1.09 0.12 0.94–1.25 
rs4364072 0.98 0.71 0.87–1.10 
rs3092370 0.97 0.62 0.88–1.08 
rs1800963* 1.01 0.92 0.90–1.13 
rs3212184 1.02 0.74 0.91–1.14 
rs1885088* 1.05 0.22 0.92–1.20 
rs1800961 (T130I) 0.92 0.58 0.68–1.23 
rs1028583* 0.92 0.15 0.82–1.03 
rs3818247* 0.98 0.35 0.87–1.10 
SNPMinor alleleORP95% CI
rs4812828* 0.99 0.42 0.88–1.11 
rs1884613* 1.11 0.05 0.97–1.27 
rs2144908* 1.09 0.12 0.94–1.25 
rs4364072 0.98 0.71 0.87–1.10 
rs3092370 0.97 0.62 0.88–1.08 
rs1800963* 1.01 0.92 0.90–1.13 
rs3212184 1.02 0.74 0.91–1.14 
rs1885088* 1.05 0.22 0.92–1.20 
rs1800961 (T130I) 0.92 0.58 0.68–1.23 
rs1028583* 0.92 0.15 0.82–1.03 
rs3818247* 0.98 0.35 0.87–1.10 
*

SNP was also genotyped by both the Love-Gregory et al. (20) and Silander et al. (19) studies.

SNP was found to have a significant association to type 2 diabetes in at least one previous study. One-tailed P values are given for the SNPs with the dagger symbol (); all others are two tailed.

TABLE 3

Association of SNP rs1884613 with type 2 diabetes in individual samples

SampleFrequency (case/control)ORP95% CI
Discordant sibs 0.19/0.18 1.09 0.89 0.75–1.59 
Canada 0.12/0.17 0.67 0.25 0.41–1.12 
Scandinavia 0.20/0.19 1.11 0.73 0.89–1.39 
Sweden 0.19/0.15 1.25 0.12 0.99–1.58 
GCI U.S. 0.15/0.17 0.85 0.07 0.73–0.99 
GCI Poland 0.17/0.18 0.93 0.66 0.79–1.09 
Combined  0.97 0.59 0.90–1.06 
SampleFrequency (case/control)ORP95% CI
Discordant sibs 0.19/0.18 1.09 0.89 0.75–1.59 
Canada 0.12/0.17 0.67 0.25 0.41–1.12 
Scandinavia 0.20/0.19 1.11 0.73 0.89–1.39 
Sweden 0.19/0.15 1.25 0.12 0.99–1.58 
GCI U.S. 0.15/0.17 0.85 0.07 0.73–0.99 
GCI Poland 0.17/0.18 0.93 0.66 0.79–1.09 
Combined  0.97 0.59 0.90–1.06 

Results from the Scandinavian and Canadian subsamples shown in Table 2 are listed individually here for rs1884613. The U.S. and Polish samples are from GCI. Frequencies are given for the minor (G) allele. Results from all samples are combined by Mantel-Haenszel meta-analysis. All P values are two tailed.

TABLE 4

Comparison of LD between tag SNPs and known SNPs in the HNF4α region (n = 58)

GroupNo. of SNPsPercent of comparisons with
Mean r2
r2 > 0.2r2 > 0.5r2 > 0.8
Winckler et al. 11 71 52 31 52 
Love-Gregory et al. 13* 71 59 35 55 
Silander et al. 12 71 48 28 51 
GroupNo. of SNPsPercent of comparisons with
Mean r2
r2 > 0.2r2 > 0.5r2 > 0.8
Winckler et al. 11 71 52 31 52 
Love-Gregory et al. 13* 71 59 35 55 
Silander et al. 12 71 48 28 51 

r2 results show the fraction of 58 SNPs in our reference CEPH panel that met that particular correlation threshold (based on our LD map).

*

Total does not include the resequencing SNPs R1, R2–3, or rs3212210 from Love-Gregory et al. (20) because that SNP was severely out of Hardy-Weinberg equilibrium and had four Mendel errors in our CEPH panel;

total does not include rs3212183 because that assay was a technical failure in our CEPH panel. We note that the Love-Gregory et al. and Silander et al. (19) tag SNP performances might improve slightly if these SNPs were included.

L.G. and D.A. jointly supervised the project.

J.N.H. is a member of an advisory board for and has received consulting fees from Correlagen. D.A. is a member of an advisory board for and has received consulting fees from Genomics Collaborative, Inc.

Additional information for this article can be found in an online appendix at http://diabetes.diabetesjournals.org.

J.N.H. is a recipient of a Burroughs Wellcome Career Award in Biomedical Sciences. D.A. is a Charles E. Culpeper Scholar of the Rockefeller Brothers Fund and a Burroughs Wellcome Fund Clinical Scholar in Translational Research, the latter of which supported this work. L.G., T.T., and the Botnia Study are principally supported by the Sigrid Juselius Foundation, the Academy of Finland, the Finnish Diabetes Research Foundation, the Folkhalsan Research Foundation, EC (BM4-CT95-0662, GIFT), the Swedish Medical Research Council, the JDF Wallenberg Foundation, and the Novo Nordisk Foundation.

We thank the Botnia research team for clinical contributions and the members of the Altshuler, Hirschhorn, and Groop laboratories for helpful discussions.

1.
Horikawa Y, Iwasaki N, Hara M, Furuta H, Hinokio Y, Cockburn BN, Lindner T, Yamagata K, Ogata M, Tomonaga O, Kuroki H, Kasahara T, Iwamoto Y, Bell GI: Mutation in hepatocyte nuclear factor-1 beta gene (TCF2) associated with MODY.
Nat Genet
17
:
384
–385,
1997
2.
Stoffers DA, Ferrer J, Clarke WL, Habener JF: Early-onset type-II diabetes mellitus (MODY4) linked to IPF1.
Nat Genet
17
:
138
–139,
1997
3.
Yamagata K, Furuta H, Oda N, Kaisaki PJ, Menzel S, Cox NJ, Fajans SS, Signorini S, Stoffel M, Bell GI: Mutations in the hepatocyte nuclear factor-4α gene in maturity-onset diabetes of the young (MODY1).
Nature
384
:
458
–460,
1996
4.
Yamagata K, Oda N, Kaisaki PJ, Menzel S, Furuta H, Vaxillaire M, Southam L, Cox RD, Lathrop GM, Boriraj VV, Chen X, Cox NJ, Oda Y, Yano H, Le Beau MM, Yamada S, Nishigori H, Takeda J, Fajans SS, Hattersley AT, Iwasaki N, Hansen T, Pedersen O, Polonsky KS, Bell GI: Mutations in the hepatocyte nuclear factor-1α gene in maturity-onset diabetes of the young (MODY3).
Nature
384
:
455
–458,
1996
5.
Vionnet N, Stoffel M, Takeda J, Yasuda K, Bell GI, Zouali H, Lesage S, Velho G, Iris F, Passa P, et al.: Nonsense mutation in the glucokinase gene causes early-onset non-insulin-dependent diabetes mellitus.
Nature
356
:
721
–722,
1992
6.
Malecki MT, Jhala US, Antonellis A, Fields L, Doria A, Orban T, Saad M, Warram JH, Montminy M, Krolewski AS: Mutations in NEUROD1 are associated with the development of type 2 diabetes mellitus.
Nat Genet
23
:
323
–328,
1999
7.
Fajans SS, Bell GI, Polonsky KS: Molecular mechanisms and clinical pathophysiology of maturity-onset diabetes of the young.
N Engl J Med
345
:
971
–980,
2001
8.
van den Ouweland JM, Lemkes HH, Trembath RC, Ross R, Velho G, Cohen D, Froguel P, Maassen JA: Maternally inherited diabetes and deafness is a distinct subtype of diabetes and associates with a single point mutation in the mitochondrial tRNA(Leu(UUR)) gene.
Diabetes
43
:
746
–751,
1994
9.
Hirschhorn JN, Lohmueller K, Byrne E, Hirschhorn K: A comprehensive review of genetic association studies.
Genet Med
4
:
45
–61,
2002
10.
Florez JC, Hirschhorn J, Altshuler D: The inherited basis of diabetes mellitus: implications for the genetic analysis of complex traits.
Annu Rev Genomics Hum Genet
4
:
257
–291,
2003
11.
Florez JC, Burtt N, de Bakker PI, Almgren P, Tuomi T, Holmkvist J, Gaudet D, Hudson TJ, Schaffner SF, Daly MJ, Hirschhorn JN, Groop L, Altshuler D: Haplotype structure and genotype-phenotype correlations of the sulfonylurea receptor and the islet ATP-sensitive potassium channel gene region.
Diabetes
53
:
1360
–1368,
2004
12.
Weedon MN, Schwarz PE, Horikawa Y, Iwasaki N, Illig T, Holle R, Rathmann W, Selisko T, Schulze J, Owen KR, Evans J, Del Bosque-Plata L, Hitman G, Walker M, Levy JC, Sampson M, Bell GI, McCarthy MI, Hattersley AT, Frayling TM: Meta-analysis and a large association study confirm a role for calpain-10 variation in type 2 diabetes susceptibility.
Am J Hum Genet
73
:
1208
–1212,
2003
13.
Song Y, Niu T, Manson JE, Kwiatkowski DJ, Liu S: Are variants in the CAPN10 gene related to risk of type 2 diabetes? A quantitative assessment of population and family-based association studies.
Am J Hum Genet
74
:
208
–222,
2004
14.
Sklar P, Schwab SG, Williams NM, Daly M, Schaffner S, Maier W, Albus M, Trixler M, Eichhammer P, Lerer B, Hallmayer J, Norton N, Williams H, Zammit S, Cardno AG, Jones S, McCarthy G, Milanova V, Kirov G, O’Donovan MC, Lander ES, Owen MJ, Wildenauer DB: Association analysis of NOTCH4 loci in schizophrenia using family and population-based controls.
Nat Genet
28
:
126
–128,
2001
15.
Rioux JD, Daly MJ, Silverberg MS, Lindblad K, Steinhart H, Cohen Z, Delmonte T, Kocher K, Miller K, Guschwan S, Kulbokas EJ, O’Leary S, Winchester E, Dewar K, Green T, Stone V, Chow C, Cohen A, Langelier D, Lapointe G, Gaudet D, Faith J, Branco N, Bull SB, McLeod RS, Griffiths AM, Bitton A, Greenberg GR, Lander ES, Siminovitch KA, Hudson TJ: Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease.
Nat Genet
29
:
223
–228,
2001
16.
Laitinen T, Daly MJ, Rioux JD, Kauppi P, Laprise C, Petays T, Green T, Cargill M, Haahtela T, Lander ES, Laitinen LA, Hudson TJ, Kere J: A susceptibility locus for asthma-related traits on chromosome 7 revealed by genome-wide scan in a founder population.
Nat Genet
28
:
87
–91,
2001
17.
Martin ER, Lai EH, Gilbert JR, Rogala AR, Afshari AJ, Riley J, Finch KL, Stevens JF, Livak KJ, Slotterbeck BD, Slifer SH, Warren LL, Conneally PM, Schmechel DE, Purvis I, Pericak-Vance MA, Roses AD, Vance JM: SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease.
Am J Hum Genet
67
:
383
–394,
2000
18.
Altshuler D, Hirschhorn JN, Klannemark M, Lindgren CM, Vohl MC, Nemesh J, Lane CR, Schaffner SF, Bolk S, Brewer C, Tuomi T, Gaudet D, Hudson TJ, Daly M, Groop L, Lander ES: The common PPARγ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes.
Nat Genet
26
:
76
–80,
2000
19.
Silander K, Mohlke KL, Scott LJ, Peck EC, Hollstein P, Skol AD, Jackson AU, Deloukas P, Hunt S, Stavrides G, Chines PS, Erdos MR, Narisu N, Conneely KN, Li C, Fingerlin TE, Dhanjal SK, Valle TT, Bergman RN, Tuomilehto J, Watanabe RM, Boehnke M, Collins FS: Genetic variation near the hepatocyte nuclear factor-4 alpha gene predicts susceptibility to type 2 diabetes.
Diabetes
53
:
1141
–1149,
2004
20.
Love-Gregory LD, Wasson J, Ma J, Jin CH, Glaser B, Suarez BK, Permutt MA: A common polymorphism in the upstream promoter region of the hepatocyte nuclear factor-4 alpha gene on chromosome 20q is associated with type 2 diabetes and appears to contribute to the evidence for linkage in an Ashkenazi Jewish population.
Diabetes
53
:
1134
–1140,
2004
21.
Argyrokastritis A, Kamakari S, Kapsetaki M, Kritis A, Talianidis I, Moschonas NK: Human hepatocyte nuclear factor-4 (hHNF-4) gene maps to 20q12–q13.1 between PLCG1 and D20S17.
Hum Genet
99
:
233
–236,
1997
22.
Boj SF, Parrizas M, Maestro MA, Ferrer J: A transcription factor regulatory circuit in differentiated pancreatic cells.
Proc Natl Acad Sci U S A
98
:
14481
–14486,
2001
23.
Thomas H, Jaschkowitz K, Bulman M, Frayling TM, Mitchell SM, Roosen S, Lingott-Frieg A, Tack CJ, Ellard S, Ryffel GU, Hattersley AT: A distant upstream promoter of the HNF-4α gene connects the transcription factors involved in maturity-onset diabetes of the young.
Hum Mol Genet
10
:
2089
–2097,
2001
24.
Silander K, Scott LJ, Valle TT, Mohlke KL, Stringham HM, Wiles KR, Duren WL, Doheny KF, Pugh EW, Chines P, Narisu N, White PP, Fingerlin TE, Jackson AU, Li C, Ghosh S, Magnuson VL, Colby K, Erdos MR, Hill JE, Hollstein P, Humphreys KM, Kasad RA, Lambert J, Lazaridis KN, Lin G, Morales-Mena A, Patzkowski K, Pfahl C, Porter R, Rha D, Segal L, Suh YD, Tovar J, Unni A, Welch C, Douglas JA, Epstein MP, Hauser ER, Hagopian W, Buchanan TA, Watanabe RM, Bergman RN, Tuomilehto J, Collins FS, Boehnke M: A large set of Finnish affected sibling pair families with type 2 diabetes suggests susceptibility loci on chromosomes 6, 11, and 14.
Diabetes
53
:
821
–829,
2004
25.
Mori Y, Otabe S, Dina C, Yasuda K, Populaire C, Lecoeur C, Vatin V, Durand E, Hara K, Okada T, Tobe K, Boutin P, Kadowaki T, Froguel P: Genome-wide search for type 2 diabetes in Japanese affected sib-pairs confirms susceptibility genes on 3q, 15q, and 20q and identifies two new candidate Loci on 7p and 11p.
Diabetes
51
:
1247
–1255,
2002
26.
Permutt MA, Wasson JC, Suarez BK, Lin J, Thomas J, Meyer J, Lewitzky S, Rennich JS, Parker A, DuPrat L, Maruti S, Chayen S, Glaser B: A genome scan for type 2 diabetes susceptibility loci in a genetically isolated population.
Diabetes
50
:
681
–685,
2001
27.
Ghosh S, Watanabe RM, Hauser ER, Valle T, Magnuson VL, Erdos MR, Langefeld CD, Balow J Jr, Ally DS, Kohtamaki K, Chines P, Birznieks G, Kaleta HS, Musick A, Te C, Tannenbaum J, Eldridge W, Shapiro S, Martin C, Witt A, So A, Chang J, Shurtleff B, Porter R, Boehnke M: Type 2 diabetes: evidence for linkage on chromosome 20 in 716 Finnish affected sib pairs.
Proc Natl Acad Sci U S A
96
:
2198
–2203,
1999
28.
Zouali H, Hani EH, Philippi A, Vionnet N, Beckmann JS, Demenais F, Froguel P: A susceptibility locus for early-onset non-insulin dependent (type 2) diabetes mellitus maps to chromosome 20q, proximal to the phosphoenolpyruvate carboxykinase gene.
Hum Mol Genet
6
:
1401
–1408,
1997
29.
Ji L, Malecki M, Warram JH, Yang Y, Rich SS, Krolewski AS: New susceptibility locus for NIDDM is localized to human chromosome 20q.
Diabetes
46
:
876
–881,
1997
30.
Bowden DW SM, Howard TD, Qadri A, Spray BJ, Rothschild CB, Akots G, Rich SS, Freedman BI: Linkage of genetic markers on human chromosomes 20 and 12 to NIDDM in Caucasian sib pairs with a history of diabetic nephropathy.
Diabetes
46
:
882
–886,
1997
31.
Luo TH, Zhao Y, Li G, Yuan WT, Zhao JJ, Chen JL, Huang W, Luo M: A genome-wide search for type II diabetes susceptibility genes in Chinese Hans.
Diabetologia
44
:
501
–506,
2001
32.
Klupa T, Malecki MT, Pezzolesi M, Ji L, Curtis S, Langefeld CD, Rich SS, Warram JH, Krolewski AS: Further evidence for a susceptibility locus for type 2 diabetes on chromosome 20q13.1-q13.2.
Diabetes
49
:
2212
–2216,
2000
33.
Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN: Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease.
Nat Genet
33
:
177
–182,
2003
34.
Florez JC, Sjogren M, Burtt N, Orho-Melander M, Schayer S, Sun M, Almgren P, Tuomi T, Gaudet D, Hudson TJ, Ardlie KG, Hirschhorn JN, Altshuler D, Groop L: Association testing in 9,000 people fails to confirm the association of the insulin receptor substrate-1 G972R polymorphism with type 2 diabetes.
Diabetes
53
:
3313
–3317,
2004
35.
Tang K, Fu DJ, Julien D, Braun A, Cantor CR, Koster H: Chip-based genotyping by mass spectrometry.
Proc Natl Acad Sci U S A
96
:
10016
–10020,
1999
36.
Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D: The structure of haplotype blocks in the human genome.
Science
296
:
2225
–2229,
2002
37.
Boehnke M, Langefeld CD: Genetic association mapping based on discordant sib pairs: the discordant-alleles test.
Am J Hum Genet
62
:
950
–961,
1998
38.
Purcell S, Cherny SS, Sham PC: Genetic Power Calculator: design of linkage and association genetic mapping studies of complex traits.
Bioinformatics
19
:
149
–150,
2003
39.
Kruglyak L, Nickerson DA: Variation is the spice of life.
Nat Genet
27
:
234
–236,
2001
40.
Cargill M, Altshuler D, Ireland J, Sklar P, Ardlie K, Patil N, Shaw N, Lane CR, Lim EP, Kalyanaraman N, Nemesh J, Ziaugra L, Friedland L, Rolfe A, Warrington J, Lipshutz R, Daley GQ, Lander ES: Characterization of single-nucleotide polymorphisms in coding regions of human genes.
Nat Genet
22
:
231
–238,
1999
41.
Zhu Q, Yamagata K, Miura A, Shihara N, Horikawa Y, Takeda J, Miyagawa J, Matsuzawa Y: T130I mutation in HNF-4α gene is a loss-of-function mutation in hepatocytes and is associated with late-onset type 2 diabetes mellitus in Japanese subjects.
Diabetologia
46
:
567
–573,
2003
42.
Ardlie KG, Lunetta KL, Seielstad M: Testing for population subdivision and association in four case-control studies.
Am J Hum Genet
71
:
304
–311,
2002
43.
Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N: Assessing the probability that a positive report is false: an approach for molecular epidemiology studies.
J Natl Cancer Inst
96
:
434
–442,
2004
44.
Wilson FH, Disse-Nicodeme S, Choate KA, Ishikawa K, Nelson-Williams C, Desitter I, Gunel M, Milford DV, Lipkin GW, Achard JM, Feely MP, Dussol B, Berland Y, Unwin RJ, Mayan H, Simon DB, Farfel Z, Jeunemaitre X, Lifton RP: Human hypertension caused by mutations in WNK kinases.
Science
293
:
1107
–1112,
2001