This study focused on resolving the relationship between BMI and type 2 diabetes. The availability of multiple variants associated with BMI offers a new chance to resolve the true causal effect of BMI on type 2 diabetes; however, the properties of these associations and their validity as genetic instruments need to be considered alongside established and new methods for undertaking Mendelian randomization (MR). We explore the potential for pleiotropic genetic variants to generate bias, revise existing estimates, and illustrate value in new analysis methods. A two-sample MR approach with 96 genetic variants was used with three different analysis methods, two of which (MR-Egger and the weighted median) have been developed specifically to address problems of invalid instrumental variables. We estimate an odds ratio for type 2 diabetes per unit increase in BMI (kg/m2) of between 1.19 and 1.38, with the most stable estimate using all instruments and a weighted median approach (1.26 [95% CI 1.17, 1.34]). TCF7L2(rs7903146) was identified as a complex effect or pleiotropic instrument, and removal of this variant resulted in convergence of causal effect estimates from different causal analysis methods. This indicated the potential for pleiotropy to affect estimates and differences in performance of alternative analytical methods. In a real type 2 diabetes–focused example, this study demonstrates the potential impact of invalid instruments on causal effect estimates and the potential for new approaches to mitigate the bias caused.

Observational studies have shown BMI to be associated with risk of type 2 diabetes as well as with a range of diabetes-related metabolic traits (1,2). However, it is well known that confounding, reverse causation, and biases can generate such associations and that even with careful study design, incorrect inference is possible (3). One approach to circumventing these problems is to use genetic association results within a Mendelian randomization (MR) framework (3,4). In MR analyses, genetic variants act as proxies for an exposure in a manner independent of confounders. If in addition the variants only affect an outcome of interest through the chosen exposure, then they are said to be valid instrumental variables (IVs). This enables evaluation of the causal effect of the exposure on the outcome, escaping some of the limitations of observational epidemiology (5).

After the success of genome-wide association studies (GWAS), the number of MR analyses using large numbers of mostly uncharacterized variants associated with complex health outcomes or intermediates is rapidly increasing (6,7). In the case of BMI, there are now 97 genetic variants reliably associated and there are examples where multiple variants have been used as a composite IV to estimate the causal impact of BMI on health (8). Although using many IVs can increase the power of MR analyses, it brings with it the concern that enlarged sets of genetic variants are more likely to contain invalid IVs due to violations of the assumptions necessary for valid causal inference using traditional methods (9). In particular, horizontal pleiotropy, where a genetic variant affects the outcome via more than one biological pathway (10), is a concern. Importantly, the properties of these associations and their validity as genetic instruments need to be considered alongside established and new methods for undertaking MR.

In response to the general issue of using multiple genetic variants in MR, Bowden et al. (9) propose both MR-Egger regression, an approach developed from the original Egger regression technique for assessing small study bias in meta-analysis, and a weighted median approach (11) as alternatives to the standard MR analysis. The MR-Egger and weighted median approaches both operate using distinct, but critically weaker, versions of the IV assumptions, and therefore have the potential to deliver robust causal effect estimates. The MR-Egger method also provides a formal statistical test as to whether or not the average pleiotropic effect of the genetic variants is equal to zero (9).

With increasing evidence for multiple biological pathways underlying type 2 diabetes (12,13) and increasing numbers of genetic variants available as IVs for BMI, we set out to test the potential for bias in causal estimates from MR using these state-of-the-art approaches. We compared results from MR-Egger regression (9) and weighted median (11) approaches to a traditional inverse-variance weighted (IVW) method (which makes the strong assumption that all variants are valid IVs) (14) in an investigation of the causal relationship between BMI and type 2 diabetes. These methods all undertake two-sample MR whereby the GWAS results for a disease outcome are unified with those of an exposure of interest and together used to estimate the causal impact of that exposure on disease. We used published data in a two-sample analysis strategy taking single nucleotide polymorphism (SNP)–exposure and SNP-outcome associations from different sources (15,16).

The effect sizes for BMI-associated SNPs with associated SE from a mixed-sex cohort of European ancestry were taken from the Genetic Investigation of ANthropometric Traits (GIANT) consortium (17) along with results for type 2 diabetes from the DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) consortium (18). To avoid sample overlap, GIANT estimates were recalculated in the absence of DIAGRAM cohorts, yielding a maximum sample size at any given locus of 189,079. To aid interpretation of the effects of BMI on type 2 diabetes, effect sizes were transformed to BMI units prior to analysis, assuming one SD = 4.5 kg/m2 (17). For the corresponding SNP-outcome association, we took odds ratios (ORs) and CIs from a GWAS meta-analysis conducted by the DIAGRAM consortium. This genome-wide meta-analysis includes data from 12,171 case subjects with type 2 diabetes and 56,862 control subjects of mainly European descent imputed at up to 2.5 million autosomal SNPs (DIAGRAMv3) (18). All but one (rs4787491, INO80E) of the BMI-associated SNPs (P < 5 × 10−8) from GIANT had results listed in the DIAGRAMv3 data set so 96 SNPs with results in both data sets were taken forward for analysis.

SNP-exposure and SNP-outcome associations were combined using the three different approaches outlined above. All analyses were conducted in R 3.2.0 (19). First, an IVW method was implemented to provide a weighted average of the causal effect estimates (14). This method assumes that all genetic variants satisfy the IV assumptions (including zero pleiotropy) and uses weights that assume the gene-exposure association estimates are measured without error (the no measurement error [NOME] assumption).

Second, we performed MR-Egger regression (9), which assumes NOME but allows each variant to exhibit pleiotropy. MR-Egger estimates remain consistent only if the magnitude of the gene exposure associations across all variants are independent of their pleiotropic effects (the InSIDE assumption) (9). As recommended by Bowden et al. (9), the extent to which pleiotropy was balanced across the set of instruments as a whole was visually assessed by plotting the causal effect estimates against their precision, using a funnel plot and checking for asymmetry (Fig. 1A). The NOME assumption was assessed for MR-Egger via an adaptation of the I2 statistic () (20) and adjusted for by combining MR-Egger with the method of simulation extrapolation (SIMEX) (21). Using SIMEX, new data sets are created by simulating gene-exposure association estimates under increasing violations of NOME and recording the amount of attenuation in the estimate that occurs. The set of attenuated estimates are then used to extrapolate back to the estimate that would have been obtained if NOME had been satisfied.

Figure 1

Genetic associations with BMI and type 2 diabetes from 96 variants measured in GIANT (17) and DIAGRAM (18), respectively. TCF7L2(rs7903146) and FTO(rs1558902) are marked with an “X” and labeled. A: Funnel plot of minor allele frequency–corrected genetic associations with BMI (interpreted as instrument strength) against causal estimates based on each genetic variant individually, where the causal effect is expressed in log OR of type 2 diabetes for each unit increase in BMI. The overall causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by IVW (solid black line), MR-Egger (dashed black line), and weighted median (dotted black line) methods are also shown. Gray solid line represents x = 0, which is a causal estimate of zero. B: Scatter plot of genetic associations with type 2 diabetes against associations with BMI, with causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by IVW (solid line), MR-Egger (dashed line), and weighted median (dotted line) methods.

Figure 1

Genetic associations with BMI and type 2 diabetes from 96 variants measured in GIANT (17) and DIAGRAM (18), respectively. TCF7L2(rs7903146) and FTO(rs1558902) are marked with an “X” and labeled. A: Funnel plot of minor allele frequency–corrected genetic associations with BMI (interpreted as instrument strength) against causal estimates based on each genetic variant individually, where the causal effect is expressed in log OR of type 2 diabetes for each unit increase in BMI. The overall causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by IVW (solid black line), MR-Egger (dashed black line), and weighted median (dotted black line) methods are also shown. Gray solid line represents x = 0, which is a causal estimate of zero. B: Scatter plot of genetic associations with type 2 diabetes against associations with BMI, with causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by IVW (solid line), MR-Egger (dashed line), and weighted median (dotted line) methods.

Close modal

Finally, a weighted median estimation method was applied (11). The weighted median provides a consistent estimate of causal effect if at least 50% of the information in the analysis comes from variants that are valid IVs. For a more detailed description of the three methods applied, see the Supplementary Methods. A leave-one-out permutation analysis was conducted across all methods to assess the influence of potentially pleiotropic SNPs on the causal estimates (22). In the case of the linear models (IVW and MR-Egger), two additional analyses were conducted (23,24). First, the extent to which the causal estimate from each SNP in the set could be considered an outlier was assessed using studentized residuals. Second, Cook’s distance (25) was used as a measure of the aggregate impact of each SNP on the model.

All three approaches provide evidence of a positive causal relationship between BMI and type 2 diabetes. This is demonstrated in Fig. 1B where the slope of the lines show the causal effect estimates as predicted by the IVW, MR-Egger, and weighted median approaches. Estimates correspond to an OR for type 2 diabetes per unit increase in BMI (kg/m2) of 1.19, 1.26, and 1.38 for the IVW, weighted median, and MR-Egger analyses, respectively, and are in line with a previous MR estimate of 1.27 (95% CI 1.18, 1.36) (2) (Table 1). Assessment of the NOME assumption with respect to the MR-Egger estimate gave = 0.83, suggesting an ∼15% attenuation of the causal estimate toward zero. Bias adjustment via SIMEX gave a corrected MR-Egger causal estimate of 1.46 (95% CI 1.16, 1.84) for type 2 diabetes per unit increase in BMI (kg/m2).

Table 1

Estimates from the application of IVW, MR-Egger, and weighted median MR methodologies

MethodEstimate95% CIP value
Complete variant set (n = 96 SNPs) 
 IVW 1.20 1.09, 1.30 8.00 × 10−05 
 MR-Egger 1.39 1.14, 1.68 1.53 × 10−03 
 MR-Egger* −0.019 −0.041, 0.004 0.10 
 Weighted median 1.26 1.17, 1.34 5.26 × 10−9 
TCF7L2(rs7903146) removed from the variant set (n = 95 SNPs) 
 IVW 1.22 1.16, 1.28 1.49 × 10−11 
 MR-Egger 1.34 1.17, 1.51 9.71 × 10−06 
 MR-Egger* −0.011 −0.024, −0.024 0.13 
 Weighted median 1.26 1.19, 1.32 3.29 × 10−10 
FTO(rs1558902) removed from the variant set (n = 95 SNPs) 
 IVW 1.16 1.06, 1.27 1.31 × 10−03 
 MR-Egger 1.30 1.01, 1.65 0.04 
 MR-Egger* −0.012 −0.038, 0.014 0.34 
 Weighted median 1.21 1.13, 1.28 6.81 × 10−08 
MethodEstimate95% CIP value
Complete variant set (n = 96 SNPs) 
 IVW 1.20 1.09, 1.30 8.00 × 10−05 
 MR-Egger 1.39 1.14, 1.68 1.53 × 10−03 
 MR-Egger* −0.019 −0.041, 0.004 0.10 
 Weighted median 1.26 1.17, 1.34 5.26 × 10−9 
TCF7L2(rs7903146) removed from the variant set (n = 95 SNPs) 
 IVW 1.22 1.16, 1.28 1.49 × 10−11 
 MR-Egger 1.34 1.17, 1.51 9.71 × 10−06 
 MR-Egger* −0.011 −0.024, −0.024 0.13 
 Weighted median 1.26 1.19, 1.32 3.29 × 10−10 
FTO(rs1558902) removed from the variant set (n = 95 SNPs) 
 IVW 1.16 1.06, 1.27 1.31 × 10−03 
 MR-Egger 1.30 1.01, 1.65 0.04 
 MR-Egger* −0.012 −0.038, 0.014 0.34 
 Weighted median 1.21 1.13, 1.28 6.81 × 10−08 

Estimates represent the estimated causal effect of BMI on type 2 diabetes.

*The average pleiotropic effect of a genetic variant on type 2 diabetes risk.

Considering the individual SNP-based contributions to MR analysis, there is one clear outlier in the distribution of effects shown in Fig. 1, and that is TCF7L2(rs7903146). TCF7L2(rs7903146) shows an association with BMI that is in the opposite direction to the overall trend (and weak relative to its effect on type 2 diabetes), resulting in a large negative causal estimate from this SNP alone. The presence of at least some unbalanced pleiotropy was detected within the set of variants, as reflected by the intercept estimate of −0.019 (P = 0.10) in the MR-Egger analysis.

To illustrate the impact of TCF7L2(rs7903146) on causal estimates, we performed a sensitivity analysis in which each SNP in turn was removed from the set in a leave-one-out permutation analysis. We saw a shift in the causal estimates from the IVW (an increase) and MR-Egger (a decrease) as a result of the removal of TCF7L2(rs7903146) but no difference in the estimate from the weighted median approach (Table 1 and Fig. 2). The results of the leave-one-out permutation analysis showed that the impact of removing TCF7L2(rs7903146) from the variant set on the IVW and MR-Egger estimates was greater than that of removing almost any other variant, with the exception of FTO(rs1558902) (Fig. 2A and B). When FTO(rs1558902) was removed, causal estimates from both the IVW and MR-Egger analysis decreased (Table 1 and Fig. 2). In this instance we also observed movement in the causal effect estimate from the weighted median (Table 1 and Fig. 2C). The estimate of the intercept from MR-Egger moved closer to zero after the removal of both TCF7L2(rs7903146) and FTO(rs1558902) (Fig. 2D). TCF7L2(rs7903146) was also identified as an outlier in both IVW and MR-Egger (studentized residuals, Bonferroni-corrected P < 1 × 10−19) but FTO(rs1558902) was not (Supplementary Fig. 1A and B). Calculation of Cook’s distance showed both variants to have a disproportionate level of influence on the model compared with other variants in the set (Supplementary Fig. 2A and B).

Figure 2

Distributions of regression estimates resulting from leave-one-out permutation analysis. Solid line, estimate from main analysis (n = 96 variants); dashed line, estimate with TCF7L2(rs7903146) removed; dotted line, estimate with FTO(rs1558902) removed. A: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by an IVW method. B: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by MR-Egger. C: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by a weighted median method. D: Estimates of the intercept by MR-Egger.

Figure 2

Distributions of regression estimates resulting from leave-one-out permutation analysis. Solid line, estimate from main analysis (n = 96 variants); dashed line, estimate with TCF7L2(rs7903146) removed; dotted line, estimate with FTO(rs1558902) removed. A: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by an IVW method. B: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by MR-Egger. C: Causal estimates (β-coefficients) of BMI on type 2 diabetes estimated by a weighted median method. D: Estimates of the intercept by MR-Egger.

Close modal

These results suggest that TCF7L2(rs7903146) may be pleiotropic with respect to the outcome, i.e., that it influences type 2 diabetes through an alternative pathway (other than BMI). Evidence from existing literature supports this assertion as the type 2 diabetes risk-increasing allele at TCF7L2(rs7903146) has been associated with both increased fasting glucose (26) and decreased BMI (17). Under the assumption that TCF7L2(rs7903146) demonstrates horizontal pleiotropy with respect to type 2 diabetes, we would expect its inclusion in the variant set to bias the causal estimate predicted by the IVW approach, but not that predicted by MR-Egger or the weighted median. Removing TCF7L2(rs7903146) from the variant set causes a slight shift in the causal estimates from the IVW and MR-Egger approaches, bringing them more in line with one another and also with the weighted median estimate, which remained stable in this instance. Also of note is the reduction in the 95% CI of the MR-Egger estimate after removal of the TCF7L2(rs7903146). This increase in precision after removal of a likely invalid instrument from the set is another potentially favorable quality of this estimator. The relatively small changes observed across all methods as a result of removing TCF7L2(rs7903146) are in line with the relatively weak effect of the SNP as shown in Fig. 1B.

In contrast, the effect of removing FTO(rs1558902) is more noticeable. Regardless of the method used, removing this variant results in a lower causal estimate (Table 1 and Fig. 2). The substantial influence of FTO(rs1558902) was predictable given the strength of its effect relative to the other variants (Fig. 1B), although properties of this effect are not in line with other variants used to instrument BMI as reported elsewhere for physical activity (27), thyroid function (28), and depression (29). The concomitant increase in SE associated with the estimates here point toward increased uncertainty moving the estimates toward the null in the absence of FTO(rs1558902). The weighted median appears robust, even to the removal of FTO(rs1558902), as demonstrated by the relatively tight distribution of estimates returned from the leave-one-out permutation analysis (Fig. 2C). This is as expected given the tolerance of weighted median approaches to outliers.

By applying new analytical techniques to an old question (the causal relationship between BMI and type 2 diabetes), we have explored the potential for invalid instruments to bias causal estimates in MR. In this case where BMI is the exposure, the opportunity to use a large instrument list in causal analyses presents both opportunity, through variance explained, but also cost, through complications generated by instrument properties or methods used. Results here suggest that both TCF7L2 and FTO appear to have genetic variation that predicts BMI reliably, but for which associations with type 2 diabetes do not fully align with that for other variants (given BMI effects and assumed causality).

For TCF7L2, only recently suggested to be associated with BMI directly (17), this is not surprising and reinforces the important point that the validity of a specific method’s MR estimate depends on whether the genetic variants collectively satisfy its assumptions. In this case, it is possible that the negative association with BMI observed in GIANT is the product of a form of bias where the risk of type 2 diabetes is leading to effective treatment, health benefit, and BMI reduction. This is supported by the apparently causal negative relationship between type 2 diabetes and BMI seen in a reciprocal analysis where BMI is the outcome of interest (Supplementary Fig. 3), although it is likely to be more a comment on study design than biological effect.

In this example, the use of recently derived methods (9,11) designed to overcome problems caused by directional pleiotropy yields estimates that are more stable in the presence or absence of potentially invalid instruments and confirm the likely magnitude of the average effect of BMI on type 2 diabetes (i.e., from the most likely and stable estimate, an elevation of odds of disease of ∼26% for each additional unit of BMI). The comparison of results from different methods for any set of potential instruments is important when assessing the reliability of causal inferences and is important for downstream interpretation. In this case, although it is impossible to model precisely, one can estimate the hypothetical impact of an average population level change in life course BMI on type 2 diabetes. Given a population size of 64.1 million in the U.K. in mid-2013 (30) and a modeled prevalence of type 2 diabetes (including nondiagnosed cases) of 7.4% (31,32), the estimated reduction in odds for a 1 kg/m2 reduction would potentially yield a reduction in the number of cases from ∼4.7 to 3.6 million (a shift in prevalence to 5.6%).

See accompanying article, p. 2824.

Acknowledgments. Data on BMI were contributed by GIANT. The authors specially thank Adam Locke (School of Public Health, University of Michigan) for conducting the additional analyses required for this work. Data on type 2 diabetes were contributed by the DIAGRAM consortium and were downloaded from diagram-consortium.org/downloads.html.

Funding. The Medical Research Council Integrative Epidemiology Unit at the University of Bristol is supported by the Medical Research Council (MC_UU_12013/1, MC_UU_12013/2, and MC_UU_12013/3) and the University of Bristol. R.C.R. and K.H.W. are supported by Cancer Research UK (C18281/A19169). S.B. is supported by the Wellcome Trust (grant 100114). J.B. is funded by a Medical Research Council Methodology Research Fellowship (grant MR/N501906/1).

Duality of Interest. There are no potential conflicts of interest relevant to this article.

Author Contributions. L.J.C. contributed to the conception of the study, conducted the analysis, and wrote the manuscript. R.C.R., K.H.W., and G.D.S. contributed to the conception and supervision of the study. S.B. contributed to method and script development. J.B. contributed to method and script development and prepared the Supplementary Data. N.J.T. conceived and supervised the study. N.J.T. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

1.
Prospective Studies Collaboration, Whitlock G, Lewington S, Sherliker P, et al.
Body-mass index and cause-specific mortality in 900 000 adults: collaborative analyses of 57 prospective studies
.
Lancet
2009;
373
:
1083
1096
2.
Holmes Michael
V
,
Lange Leslie
A
,
Palmer
T
, et al
.
Causal effects of body mass index on cardiometabolic traits and events: a Mendelian randomization analysis
.
Am J Hum Genet
2014
;
94
:
198
208
3.
Davey Smith
G
,
Hemani
G
.
Mendelian randomization: genetic anchors for causal inference in epidemiological studies
.
Hum Mol Genet
2014
;
23
:
R89
R98
4.
Davey Smith
G
,
Ebrahim
S
.
‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease?
Int J Epidemiol
2003
;
32
:
1
22
5.
Lawlor
DA
,
Harbord
RM
,
Sterne
JAC
,
Timpson
N
,
Davey Smith
G
.
Mendelian randomization: using genes as instruments for making causal inferences in epidemiology
.
Stat Med
2008
;
27
:
1133
1163
6.
Sleiman
PMA
,
Grant
SFA
.
Mendelian randomization in the era of genomewide association studies
.
Clin Chem
2010
;
56
:
723
728
7.
Boef
AGC
,
Dekkers
OM
,
le Cessie
S
.
Mendelian randomization studies: a review of the approaches used and the quality of reporting
.
Int J Epidemiol
2015
;44:496–511
8.
Nordestgaard
BG
,
Palmer
TM
,
Benn
M
, et al
.
The effect of elevated body mass index on ischemic heart disease risk: causal estimates from a Mendelian randomisation approach
.
PLoS Med
2012
;
9
:
e1001212
9.
Bowden
J
,
Davey Smith
G
,
Burgess
S
.
Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression
.
Int J Epidemiol
2015
;
44
:
512
525
10.
Didelez
V
,
Sheehan
N
.
Mendelian randomization as an instrumental variable approach to causal inference
.
Stat Methods Med Res
2007
;
16
:
309
330
11.
Bowden
J
,
Davey Smith
G
,
Haycock
PC
,
Burgess
S
.
Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator
.
Genet Epidemiol
2016
;
40
:
304
314
12.
Yaghootkar
H
,
Frayling
T
.
Recent progress in the use of genetics to understand links between type 2 diabetes and related metabolic traits
.
Genome Biol
2013
;
14
:
1
7
13.
Dimas
AS
,
Lagou
V
,
Barker
A
, et al
.
Investigators obotM: impact of type 2 diabetes susceptibility variants on quantitative glycemic traits reveals mechanistic heterogeneity
.
Diabetes
2014
;
63
:
2158
2171
14.
Burgess
S
,
Butterworth
A
,
Thompson
SG
.
Mendelian randomization analysis with multiple genetic variants using summarized data
.
Genet Epidemiol
2013
;
37
:
658
665
15.
Burgess
S
,
Scott
R
,
Timpson
N
,
Davey Smith
G
,
Thompson
S
.
Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors
.
Eur J Epidemiol
2015
;30:543–552
16.
Pierce
BL
,
Burgess
S
.
Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators
.
Am J Epidemiol
2013
;
178
:
1177
1184
17.
Locke
AE
,
Kahali
B
,
Berndt
SI
, et al
.
Genetic studies of body mass index yield new insights for obesity biology
.
Nature
2015
;
518
:
197
206
18.
Morris
AP
,
Voight
BF
,
Teslovich
TM
, et al
.
Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes
.
Nat Genet
2012
;
44
:
981
990
19.
Team
RC
.
R: A Language and Environment for Statistical Computing
. 3.2.0 ed.
Vienna, Austria
,
R Foundation for Statistical Computing
,
2014
20.
Higgins
JPT
,
Thompson
SG
,
Deeks
JJ
,
Altman
DG
.
Measuring inconsistency in meta-analyses
.
BMJ
2003
;
327
:
557
560
21.
Bowden J. Assessing the Suitability of Summary Data for Mendelian Randomization Analyses Using MR-Egger Regression: the Role of the I^2 Statistic. Technical Report (available on request). Bristol, U.K., University of Bristol, 2015
22.
Stone
M
.
Cross-validatory choice and assessment of statistical predictions
.
J R Stat Soc B
1974
;
36
:
111
147
23.
Venables
WN
,
Ripley
BD
.
Modern Applied Statistics with S
.
New York
,
Springer
,
2002
24.
Fox
J
,
Weisberg
S
.
An R Companion to Applied Regression
.
Los Angeles
,
Sage
,
2011
25.
Cook
RD
.
Detection of influential observation in linear regression
.
Technometrics
1977
;
19
:
15
18
26.
Manning
AK
,
Hivert
M-F
,
Scott
RA
, et al
.
A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance
.
Nat Genet
2012
;
44
:
659
669
27.
Richmond
RC
,
Davey Smith
G
,
Ness
AR
,
den Hoed
M
,
McMahon
G
,
Timpson
NJ
.
Assessing causality in the association between child adiposity and physical activity levels: a Mendelian randomization analysis
.
PLoS Med
2014
;
11
:
e1001618
28.
Taylor
PN
,
Richmond
R
,
Davies
N
, et al
.
Paradoxical relationship between body mass index and thyroid hormone levels: a study using Mendelian randomization
.
J Clin Endocrinol Metab
2016
;
101
:
730
738
29.
Walter
S
,
Kubzansky
LD
,
Koenen
KC
, et al
.
Revisiting Mendelian randomization studies of the effect of body mass index on depression
.
Am J Med Genet B Neuropsychiatr Genet
2015
;
168
:
108
115
30.
Office for National Statistics (UK). Statistical Bulletin:Annual Mid-year Population Estimates: 2013. Newport, U.K., Office For National Statistics, 2014
31.
National Cardiovascular Intelligence Network. Cardiovascular disease key facts. In Diabetes: Public Health England. London, Public Health England, 2013
32.
Gatineau M, Hancock C, Holman N, et al. Adult Obesity and Type 2 Diabetes. London, Public Health England, 2014
Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered. More information is available at http://www.diabetesjournals.org/content/license.

Supplementary data