Although there are some epigenome-wide association studies (EWAS) of insulin resistance, for most of them authors did not replicate their findings, and most are focused on populations of European ancestry, limiting the generalizability. In the Epigenetics in Pregnancy (EPIPREG; n = 294 Europeans and 162 South Asians) study, we conducted an EWAS of insulin resistance in maternal peripheral blood leukocytes, with replication in the Born in Bradford (n = 879; n = 430 Europeans and 449 South Asians), Methyl Epigenome Network Association (MENA) (n = 320), and Botnia (n = 56) cohorts. In EPIPREG, we identified six CpG sites inversely associated with insulin resistance across ancestry, of which five were replicated in independent cohorts (cg02988288, cg19693031, and cg26974062 in TXNIP; cg06690548 in SLC7A11; and cg04861640 in ZSCAN26). From methylation quantitative trait loci analysis in EPIPREG, we identified gene variants related to all five replicated cross-ancestry CpG sites, which were associated with several cardiometabolic phenotypes. Mediation analyses suggested that the gene variants regulate insulin resistance through DNA methylation. To conclude, our cross-ancestry EWAS identified five CpG sites related to lower insulin resistance.

Insulin resistance is a hallmark of type 2 diabetes and is characterized by a reduced metabolic response to insulin in skeletal muscle, liver, and adipocytes (1). Insulin resistance is usually compensated by increased insulin secretion, but type 2 diabetes develops when pancreatic β-cells are unable to compensate (2). In a normal pregnancy, insulin resistance increases to secure an adequate supply of glucose to the fetus (3,4). In the third trimester, many women reach a level of insulin resistance similar to what is observed in type 2 diabetes, resulting in gestational diabetes mellitus (GDM) if the pancreatic β-cells fail to compensate (5). Although GDM usually resolves after delivery, affected women have a 10-fold increased risk for developing type 2 diabetes (6).

Currently, genetic variants explain 18% of type 2 diabetes risk (7). A large proportion of type 2 diabetes risk may be explained by the interplay between genetic and environmental factors such as obesity, physical activity, and smoking, among others (8). The relationship between environmental or genetic factors and type 2 diabetes is potentially mediated by epigenetic factors (9). Methylation at specific CpG sites in peripheral blood leukocytes has been associated with both incident and prevalent type 2 diabetes (9,10); with glucose homeostasis variables such as fasting glucose, fasting insulin, and insulin resistance in men and nonpregnant women (1113); and with GDM in cord blood (1416).

South Asians have an increased risk of developing GDM and type 2 diabetes in comparison with Europeans (17,18). Studies suggest that South Asians have high insulin resistance combined with reduced β-cell capacity (19,20). Cross-ancestry studies can be useful to study pathogenic mechanisms that may explain the differential risk between populations or to confirm cross-ancestry associations in diverse populations with distinct confounding structures. Still, most epigenome-wide association studies (EWAS) focus on populations of European ancestry, limiting the generalizability in other ancestries.

We have not found any published EWAS in which peripheral blood leukocytes were used to discover CpG sites associated with insulin resistance across populations with different ancestries. Furthermore, of the four studies in which associations were found between insulin resistance and epigenome-wide DNA methylation in whole blood (11,12,21,22), a replication cohort to verify findings was not used in three (11,12,22). Here, we aimed to identify cross-ancestry CpG sites in peripheral blood leukocytes associated with insulin resistance in gestational week 28, with subsequent testing for replication. We also use different data sources to leverage further knowledge about potential biological mechanisms. And we performed mediation analysis to explore the direction of effects.

Study Population

The STORK Groruddalen (STORK G) study is a population-based cohort, including 823 healthy pregnant women without diabetes in gestational week 15, in the multiethnic area of Groruddalen in Oslo, Norway, 2008–2010 (23). Details about inclusion criteria are given in the Supplementary Material. The Epigenetics in Pregnancy (EPIPREG) sample include all women of European (n = 312) or South Asian (n = 168) origin participating in STORK G with available DNA collected in gestational week 28 (24).

The STORK G study, including its genetic and epigenetic data (ie, from the EPIPREG sample), is approved by the Norwegian Regional Committee for Medical Health Research Ethics South East (reference no. 2015/1035). Written informed consents were gathered from all participants.

Data Collection

Data were collected in gestational week 28. Venous blood (serum and plasma) was drawn into EDTA tubes. The samples were either aliquoted and stored in a biobank or subjected to routine laboratory analyses performed during the study period. Insulin resistance was estimated by the HOMA, using the HOMA2 calculator version 2.2.2 (https://www.dtu.ox.ac.uk/homacalculator) based on fasting levels of glucose (HemoCue, Angelholm, Sweden) and C-peptide (DELFIA, PerkinElmer Life Sciences, Wallac Oy, Turku, Finland) (25). The interassay coefficient of variability for C-peptide was 4–5% between 336 and 3,500 pmol/L. The 2-h glucose was measured with a point-of-care device (HemoCue). We calculated the HOMA for insulin resistance (HOMA-IR) with C-peptide instead of insulin because a recently published study from our group indicated that hepatic insulin clearance differed between European and South Asian women (26).

All women underwent a 75-g oral glucose tolerance test. GDM was diagnosed with the World Health Organization (WHO) 1999 criteria (i.e., fasting glucose ≥7.0 mmol/L or 2-h glucose ≥7.8 mmol/L). We also present a slightly modified version of the current WHO 2013 criteria (fasting glucose 5.1–6.9 mmol/L or 2-h glucose ≥8.5–11 mmol/L; 1-h glucose information was not available in our data). Fasting insulin was measured with noncompeting immunofluorometric assays (DELFIA, PerkinElmer Life Sciences). The interassay coefficient of variability for insulin was 7% between 63 and 580 pmol/L. Fasting plasma total cholesterol, LDL cholesterol, HDL cholesterol, and triglycerides levels were measured with a colorimetric method (Vitros 5.1 fs, Ortho Clinical Diagnostics, Neckargemünd, Germany).

Age of the women was calculated from date of birth reported at enrollment. With data collected in gestational week 28, we calculated BMI from measured height using a fixed stadiometer at enrolment, and measured body weight (Tanita-BC 418 MA, Tanita Corporation, Tokyo, Japan). Systolic and diastolic blood pressures were measured with the M6 Comfort HEM-7000-E device (Omron, Kyoto, Japan). Smoking status was assessed using an interviewer-administered questionnaire and dichotomized as follows: smokers (current and 3 months before pregnancy) versus nonsmokers (former smokers and never smokers).

DNA Methylation Assay and Genotyping

Details about blood collection and DNA isolation and preparation can be found in the Supplementary Material. We quantified DNA methylation in maternal peripheral blood leukocytes in gestational week 28 using the Infinium MethylationEPIC BeadChip (Illumina, San Diego, CA). Quality control (QC) was performed with the meffil R package (27). We removed six outliers from the methylated to unmethylated ratio comparison (>3 SD), one outlier in control probes bisulfite 1 and bisulfite 2 (>5 SD), and one due to sex mismatch (predicted sex outliers >5 SD). We removed probes with a bead count of fewer than three and detection P value < 0.01. In total, 472 of the 480 available individuals, and 864,560 probes passed the QC. To remove technical variation, all samples were randomly distributed across chips. Furthermore, we used functional normalization implemented in the meffil R package (27), which returns normalized data standardized for batch effects of slide, row, and columns, as well as 10 principal components from the QC (25). Proportions of blood cells (namely, CD8T, CD4T, NK, β-cells, monocytes, and neutrophils) were calculated using the Houseman method (27) during the QC. After further removal of probes with infinites, missing values, Y chromosome probes, probes harboring single nucleotide polymorphisms (SNPs) and cross-reactive probes (28), 810,266 probes remained. Technical validation of the EPIC bead chip showed good agreement between pyrosequencing and the EPIC array (24).

The Illumina CoreExome chip was used for genotyping (Supplementary Material). Briefly, after QC, 300 Europeans and 138 South Asians had both genotyping and epigenetic data. Genetic ancestry was assessed through ancestry principal component analysis, which corresponded with the participants’ reported country of birth or mothers’ country of birth if the mother was born outside of Europe (24). Genotype imputation in Europeans and South Asians was done separately by using the respective imputation panel from the 1000 Genomes Project (29).

Overall Study Flow

For the EWAS, we included 294 European and 162 South Asian women (n = 456) who passed the EWAS QC and did not have HOMA-IR outlying values (greater than ±3 SDs) in their respective ethnic group (Fig. 1). Of these, 287 European and 135 South Asian women had genetic data available and were included in the methylation quantitative trait loci (meQTL) analysis (Fig. 1).

Figure 1

Study flow of the EWAS of insulin resistance in EPIPREG. EUR, European; SA, South Asian.

Figure 1

Study flow of the EWAS of insulin resistance in EPIPREG. EUR, European; SA, South Asian.

Close modal

Replication Cohorts

The Born in Bradford (BiB) cohort (n = 430 Europeans and 449 South Asians) (30,31) included pregnant women with DNA methylation data analyzed in peripheral blood leukocytes in gestational weeks 26–28 and insulin resistance calculated with HOMA2. The Methyl Epigenome Network Association (MENA) (n = 320) and the Finnish Botnia cohort (n = 56) included nonpregnant women and men with DNA methylation in peripheral blood leukocytes and insulin resistance calculated with the standard insulin resistance formula. Additional details about the cohorts are given in the Supplementary Material and Supplementary Table 1.

Statistics

EWAS Analysis and Replication

Beta values were logit transformed to M values. For the main discovery analysis, we performed a cross-ancestry EWAS of insulin resistance with mixed models regression using the R packages lme4 and lmerTest (32) and the P value calculation of Satterthwaite. Insulin resistance was treated as the exposure (independent variable) and DNA methylation as the outcome (dependent variable). Blood cell composition, age, and smoking status were treated as fixed effects, and ancestry was treated as a random intercept. We treated ancestry as a random intercept to overcome potential ancestry-related differences in DNA methylation. We also performed EWAS separately on data from Europeans and South Asians to identify potential ancestry-specific CpG sites (Supplementary Material). Because only three South Asian participants were classified as smokers, a model without adjusting for smoking was also performed. For the EWAS analyses, we used a false discovery rate (FDR) threshold of 5%. We considered a CpG as replicated if it had a consistent direction of the effect with the discovery analysis and a Bonferroni corrected P value (0.05 per number of CpG sites tested for replication) in at least one replication cohort. Some of the mixed models we present had singular fit (i.e., overfitting due to the number of categories in the random intercept). Nevertheless, we do not consider this to be a major problem for the presented conclusions (Supplementary Material).

GDM Sensitivity Analysis

To assess if GDM influenced the identified associations, we performed a subanalysis in women without GDM; this was also done in BiB. Some of the mixed models we present had singular fit. As noted in the preceding paragraph, we do not consider this to be a major problem for the presented conclusions (Supplementary Material).

Gene Ontology Analysis

For gene ontology analysis, we included cross-ancestry CpG sites with a P value <1 × 10−4 and accepted a 5% FDR threshold (Supplementary Material).

Associations With Cardiometabolic Phenotypes and Sensitivity Analysis for BMI

For the association between the replicated cross-ancestry CpG sites and other cardiometabolic phenotypes, we used linear mixed models with the same covariates as in the discovery analysis. We additionally adjusted for BMI to evaluate its influence on the association between the discovered CpG sites and insulin resistance.

Cross-Ancestry meQTL Identification

To identify genetic variants associated with the cross-ancestry CpG sites, we used a cross-ancestry approach. Although genetic variants can have differential effects across populations (33), it has been suggested that genetic variants associated with complex traits in more than one ancestry tend to have consistent direction of effects (34,35) and are more likely to be causal or in linkage disequilibrium with causal variants (3335). Therefore, we expect genetic variants associated with methylation of CpG sites across ancestries to have similar direction of effects.

From the variants that passed postimputation QC in data from European (nSNP = 5,063,268) and South Asian (nSNP = 3,939,192) participants, we selected the variants common in both ancestries (nSNP = 3,506,750) for the cross-ancestry meQTL analysis. To save computational power, we performed linear regression separately in data from European and South Asian participants, using the R package GEM (36), adjusted for blood cell composition, age, and smoking. The coefficients and SEs were combined using an inverse-variance weighted fixed-effects meta-analysis with custom R scripts based on METAL (37). Cis-meQTLs were defined as positioned less than ±1 Mb from the methylation site or otherwise were classified as trans-meQTLs. We report associations at P < 1.0 × 10−8 (5 × 10−8 per five meQTL analyses).

The most significant cross-ancestry meQTLs of each CpG site were examined for associations with other cardiometabolic phenotypes and mediation analysis. If some of the most significant genetic variants were located on the same gene and/or chromosome, we evaluated if they were in strong linkage disequilibrium (R2 > 0.8) (Supplementary Material).

To evaluate the association between a genetic variant with other cardiometabolic phenotypes we used linear mixed models with ethnicity as a random intercept.

Mediation Analysis

Mediation analysis was performed to evaluate if DNA methylation was a mediator between the cross-ancestry meQTLs and insulin resistance (Supplementary Material). The mediation analysis calculates three models: 1) the average causal mediation effect, which is the effect size that goes through the mediator (indirect effect); 2) the average direct effect, which is the effect size after adjustment for the mediator, thus removing its effect (direct effect); and 3) the total effect, which is the sum of the indirect and direct effects.

Expression Analysis in MyoGlu

In the Skeletal Muscles, Myokines, and Glucose Metabolism (MyoGlu) study (3840), we used Spearman analysis to test correlations between gene expression in subcutaneous adipose and muscle tissue that mapped to the replicated CpG sites and insulin sensitivity measured with the hyperinsulinemic euglycemic clamp. More details about MyoGlu are presented in the Supplementary Material.

Information From Publicly Available Databases

The BIOS database (https://wiki.bbmri.nl/wiki/BIOS_start-) was consulted to identify expression quantitative trait methylation in peripheral blood leukocytes. The GTEx database (https://gtexportal.org/home/) was consulted to evaluate if the cross-ancestry meQTLs for each CpG site were also expression quantitative loci (eQTL) in peripheral blood leukocytes, subcutaneous adipose tissue, omental adipose tissue, liver, skeletal muscle, and pancreas. Data from the Atlas of GWAS Summary Statistics (https://atlas.ctglab.nl/) were extracted for cross-ancestry meQTLs to identify associations (P < 0.05) with cardiometabolic traits (details about study selection are provided in the Supplementary Material).

Data and Resource Availability

Because of strict regulations for genetic data and privacy protection of patients in Norway, all requests for data access are processed by the STORK G project’s steering committee. Please contact the principal investigator of STORK G ([email protected]) or the principal investigator of EPIPREG ([email protected]).

BiB data used for this submission will be made available on request to the BiB executive committee ([email protected]). The BiB data management plan (https://borninbradford.nhs.uk/research/how-to-access-data/) describes in detail the policy regarding data sharing, which is through a system of managed open access.

The clinical characteristics of the 294 European and 162 South Asian women (N = 456) included in this study are presented in Table 1.

Table 1

Characteristics of EPIPREG, the discovery sample

VariableNAllNEuropeansNSouth Asians
Age, years 456 29.9 (4.6) 294 30.6 (4.6) 162 28.6 (4.4) 
Gestational week 456 28.2 (1.2) 294 28.1 (1.2) 162 28.2 (1.2) 
Smokers* 456 96 (21.1) 294 93 (31.6) 162 3 (1.9) 
 Smokers during pregnancy 456 21 (4.6) 294 20 (6.8) 162 1 (0.6) 
 Smokers 3 months prepregnancy 456 75 (16.4) 294 73 (24.8) 162 2 (1.2) 
 Former smokers 456 92 (20.2) 294 83 (28.2) 162 9 (5.6) 
 Never smokers 456 268 (58.8) 294 118 (40.1) 162 150 (92.6) 
BMI, kg/m2 454 27.4 (4.5) 292 27.6 (4.6) 162 26.8 (4.1) 
Fasting glucose, mmol/L 454 4.8 (0.6) 291 4.7 (0.5) 162 5.0 (0.6) 
2-h glucose, mmol/L 453 6.2 (1.2) 293 6.0 (1.4) 160 6.4 (1.5) 
HbA1c, % 452 5.1 (0.3) 290 5.1 (0.3) 162 5.3 (0.3) 
C-peptide, pmol/L 456 797.3 (298.8) 294 743 (266.7) 162 894.5 (329.0) 
Fasting insulin, pmol/L 456 62.9 (36.2) 290 53.2 (27.8) 162 80.5 (42.6) 
HOMA-IR 456 1.7 (0.7) 294 1.6 (0.6) 162 1.9 (0.7) 
GDM WHO 1999 454 58 (12.8) 294 35 (11.9) 160 23 (14.4) 
GDM WHO 2013 456 136 (29.8) 294 70 (23.8) 162 66 (40.7) 
Triglycerides, mmol/L 456 2.0 (0.7) 294 2.0 (0.7) 162 2.0 (0.6) 
Cholesterol, mmol/L 456 6.3 (1.1) 294 6.4 (1.1) 162 6.0 (1.0) 
LDL cholesterol, mmol/L 451 3.5 (1.0) 290 3.7 (1.0) 161 3.3 (0.9) 
HDL cholesterol, mmol/L 456 1.9 (0.4) 294 1.9 (0.4) 162 1.9 (0.4) 
SBP, mmHg 449 104.8 (9.5) 290 106.8 (9.4) 158 101.3 (8.7) 
DBP, mmHg 449 67.5 (7.1) 290 68.2 (7.1) 158 66.2 (7.0) 
VariableNAllNEuropeansNSouth Asians
Age, years 456 29.9 (4.6) 294 30.6 (4.6) 162 28.6 (4.4) 
Gestational week 456 28.2 (1.2) 294 28.1 (1.2) 162 28.2 (1.2) 
Smokers* 456 96 (21.1) 294 93 (31.6) 162 3 (1.9) 
 Smokers during pregnancy 456 21 (4.6) 294 20 (6.8) 162 1 (0.6) 
 Smokers 3 months prepregnancy 456 75 (16.4) 294 73 (24.8) 162 2 (1.2) 
 Former smokers 456 92 (20.2) 294 83 (28.2) 162 9 (5.6) 
 Never smokers 456 268 (58.8) 294 118 (40.1) 162 150 (92.6) 
BMI, kg/m2 454 27.4 (4.5) 292 27.6 (4.6) 162 26.8 (4.1) 
Fasting glucose, mmol/L 454 4.8 (0.6) 291 4.7 (0.5) 162 5.0 (0.6) 
2-h glucose, mmol/L 453 6.2 (1.2) 293 6.0 (1.4) 160 6.4 (1.5) 
HbA1c, % 452 5.1 (0.3) 290 5.1 (0.3) 162 5.3 (0.3) 
C-peptide, pmol/L 456 797.3 (298.8) 294 743 (266.7) 162 894.5 (329.0) 
Fasting insulin, pmol/L 456 62.9 (36.2) 290 53.2 (27.8) 162 80.5 (42.6) 
HOMA-IR 456 1.7 (0.7) 294 1.6 (0.6) 162 1.9 (0.7) 
GDM WHO 1999 454 58 (12.8) 294 35 (11.9) 160 23 (14.4) 
GDM WHO 2013 456 136 (29.8) 294 70 (23.8) 162 66 (40.7) 
Triglycerides, mmol/L 456 2.0 (0.7) 294 2.0 (0.7) 162 2.0 (0.6) 
Cholesterol, mmol/L 456 6.3 (1.1) 294 6.4 (1.1) 162 6.0 (1.0) 
LDL cholesterol, mmol/L 451 3.5 (1.0) 290 3.7 (1.0) 161 3.3 (0.9) 
HDL cholesterol, mmol/L 456 1.9 (0.4) 294 1.9 (0.4) 162 1.9 (0.4) 
SBP, mmHg 449 104.8 (9.5) 290 106.8 (9.4) 158 101.3 (8.7) 
DBP, mmHg 449 67.5 (7.1) 290 68.2 (7.1) 158 66.2 (7.0) 

EPIPREG was also used for meQTL analysis and associations with cardiometabolic phenotypes. Data represent mean (SD) or n (%). DBP, diastolic blood pressure; GDM 2013 (1999), WHO 2013 (1999) criteria for GDM; SBP, systolic blood pressure.

*

Dichotomized variable for smoking: individuals were classified as smokers if they smoked during pregnancy or 3 months prepregnancy.

Discovery Analysis

We identified six cross-ancestry CpG sites inversely associated with insulin resistance (Table 2 and Fig. 2A), with lower levels of DNA methylation associated with higher insulin resistance. We did not find evidence of inflation (λ = 1.02) (Supplementary Fig. 1). Summary statistics for the 100 cross-ancestry CpG sites with P ≤1 × 10−4 are shown in Supplementary Table 2. In cg10318066, we detected a methylation outlier, but we decided to pursue this site because the direction of the effect was unchanged after omitting it from the analysis (effect = −0.045; SE, 0.011; P = 1.65 × 10−5).

Table 2

Cross-ancestry CpG sites associated with insulin resistance and the replication analysis

CpG informationBiB, N = 879 (EUR = 430, SA = 449)MENA, N = 320 (450K = 214, EPIC = 106)Botnia, N = 56
CpGGeneChrEffect**SEP valueFDREffect**SEP value***Effect**SEP value***Effect**SEP value***
cg02988288* TXNIP −0.225 0.038 7.30 × 10−9 0.006 −0.081 0.019 2.52 × 10−5 0.032 0.035 0.369 NA NA NA 
cg04861640 ZSCAN26 −0.187 0.032 1.81 × 10−8 0.006 −0.079 0.026 0.002 0.021 0.018 0.251 0.064 0.309 0.290 
cg06690548 SLC7A11 −0.154 0.027 2.46 × 10−8 0.006 −0.056 0.015 0.0002 0.02 0.014 0.157 1.583 3.481 0.650 
cg19693031 TXNIP −0.139 0.024 3.14 × 10−8 0.006 −0.028 0.012 0.025 −0.04 0.014 0.004 −3.778 1.429 0.014 
cg26974062* TXNIP −0.194 0.036 8.48 × 10−8 0.014 −0.097 0.022 1.12 × 10−5 0.037 0.032 0.252 NA NA NA 
cg10318066 N/A −0.066 0.012 1.84 × 10−7 0.025 −0.005 0.014 0.688 −0.003 0.005 0.545 −2.354 5.42 0.220 
CpG informationBiB, N = 879 (EUR = 430, SA = 449)MENA, N = 320 (450K = 214, EPIC = 106)Botnia, N = 56
CpGGeneChrEffect**SEP valueFDREffect**SEP value***Effect**SEP value***Effect**SEP value***
cg02988288* TXNIP −0.225 0.038 7.30 × 10−9 0.006 −0.081 0.019 2.52 × 10−5 0.032 0.035 0.369 NA NA NA 
cg04861640 ZSCAN26 −0.187 0.032 1.81 × 10−8 0.006 −0.079 0.026 0.002 0.021 0.018 0.251 0.064 0.309 0.290 
cg06690548 SLC7A11 −0.154 0.027 2.46 × 10−8 0.006 −0.056 0.015 0.0002 0.02 0.014 0.157 1.583 3.481 0.650 
cg19693031 TXNIP −0.139 0.024 3.14 × 10−8 0.006 −0.028 0.012 0.025 −0.04 0.014 0.004 −3.778 1.429 0.014 
cg26974062* TXNIP −0.194 0.036 8.48 × 10−8 0.014 −0.097 0.022 1.12 × 10−5 0.037 0.032 0.252 NA NA NA 
cg10318066 N/A −0.066 0.012 1.84 × 10−7 0.025 −0.005 0.014 0.688 −0.003 0.005 0.545 −2.354 5.42 0.220 

Highlighted in bold are the CpG sites that were replicated in at least one replication cohort. Chr, chromosome; NA not applicable.

*

CpG site only available in the EPIC beadchip.

**

The effect size units are methylation M values.

***

P value threshold for replication was <0.0083 (P value treshold = 0.05/6 tests [one per CpG site]).

Singular fit in EPIPREG, which may indicate overfitting.

Singular fit in BiB.

DBP, diastolic blood pressure; GDM 2013 (1999), WHO 2013 (1999) criteria for GDM; SBP, systolic blood pressure.

Figure 2

A: Manhattan plot showing the cross-ancestry epigenome-wide study of insulin resistance in European and South Asian women from EPIPREG. Red line denotes an FDR threshold of 5%, and the blue line denotes a P value cutoff <1 × 10−4. B: Log P value heat map of CpG–phenotype association for the five replicated CpG sites. The colors range from pale yellow (no association; P > 0.05) to bright red (lowest P value = 1.32 × 10−8). All laboratory measures except 2-h glucose (2hG) levels are derived from fasting samples. CHOL, total cholesterol; DBP, diastolic blood pressure; FG, fasting glucose; GDM 2013 (1999), WHO 2013 (1999) criteria for GDM; SBP, systolic blood pressure; TG, triglycerides.

Figure 2

A: Manhattan plot showing the cross-ancestry epigenome-wide study of insulin resistance in European and South Asian women from EPIPREG. Red line denotes an FDR threshold of 5%, and the blue line denotes a P value cutoff <1 × 10−4. B: Log P value heat map of CpG–phenotype association for the five replicated CpG sites. The colors range from pale yellow (no association; P > 0.05) to bright red (lowest P value = 1.32 × 10−8). All laboratory measures except 2-h glucose (2hG) levels are derived from fasting samples. CHOL, total cholesterol; DBP, diastolic blood pressure; FG, fasting glucose; GDM 2013 (1999), WHO 2013 (1999) criteria for GDM; SBP, systolic blood pressure; TG, triglycerides.

Close modal

In the ancestry-specific EWAS of insulin resistance, we identified 12 European-specific CpG sites (Supplementary Table 3), whereas no CpG sites reached the FDR threshold in the South Asian EWAS (Supplementary Table 4). The South Asian EWAS unadjusted for smoking yielded similar results in comparison with the adjusted model, and no CpG site reached the FDR threshold. The EWAS of data from European participants presented moderate inflation (λ = 1.33) (Supplementary Fig. 2), whereas no inflation seemed to be present in the EWAS of data from South Asian participants (λ = 1.03) (Supplementary Fig. 3).

Replication in Independent Cohorts and GDM Sensitivity Analysis

All the cross-ancestry CpG sites but cg10318066 were replicated in at least one of the three independent cohorts (Table 2). None of the 12 ancestry-specific CpG sites observed in Europeans were replicated in any of the independent cohorts (Supplementary Table 5).

We performed sensitivity analyses on the replicated CpG sites by excluding individuals with GDM. When GDM cases according to WHO 1999 criteria were excluded, most of the coefficients weakened as they increased by 12–18% (Supplementary Table 6). When GDM cases according to WHO 2013 criteria were excluded, all coefficients strengthened as they decreased between 2% and 15% (Supplementary Table 6). Overall, the direction of effects did not change and remained nominally significant (P < 0.05). The same analysis was conducted in the BiB cohort, wherein similar effects were observed (Supplementary Table 6).

Gene ontology Enrichment Analysis

To identify potentially relevant biological processes, we performed gene ontology enrichment analysis in the cross-ancestry (P < 1 × 10−4) CpG sites, which annotated to 80 loci. No gene ontology terms reached an FDR < 0.5 (Supplementary Table 7).

Associations With Cardiometabolic Phenotypes in EPIPREG

Methylation at the replicated CpG sites were inversely associated with C-peptide, insulin, and BMI values, and additionally with at least one phenotype (Table 3 and Fig. 2B).

Table 3

Association of the cross-ancestry CpG sties with other cardiometabolism-related traits in the EPIPREG sample

cg02988288cg04861640cg06690548cg19693031cg26974062
VariableEffectSEP valueEffectSEP valueEffectSEP valueEffectSEP valueEffectSEP value
Fasting glucose −0.087 0.044 0.045 −0.045 0.038 0.236 −0.049 0.032 0.123 −0.072 0.028 0.011 −0.111 0.041 0.007 
2-h glucose −0.042 0.018 0.020 −0.013 0.016 0.391 −0.001 0.013 0.948 −0.022 0.012 0.063 −0.053 0.017 0.002 
C-peptide −4.89 × 10−4 8.46 × 10−5 1.52 × 10−8 −4.20 × 10−4 7.26 × 10−5 1.32 × 10−8 −3.24 × 10−4 5.99 × 10−5 1.03 × 10−7 −3.01 × 10−4 5.43 × 10−5 4.97 × 10−8 −4.24 × 10−4 7.93 × 10−5 1.40 × 10−7 
Fasting insulin −0.003 0.001 3.40 × 10−6 −0.003 0.001 9.82 × 10−8 −0.003 5.00 × 10−4 2.33 × 10−7 −0.002 4.64 × 10−4 3.44 × 10−7 −0.003 0.001 8.09 × 10−6 
GDM WHO 1999 −0.083 0.076 0.280 −0.070 0.065 0.283 −0.046 0.054 0.401 −0.062 0.049 0.207 −0.129 0.070 0.066 
GDM WHO 2013 −0.117 0.055 0.036 −0.014 0.048 0.764 −0.040 0.040 0.316 −0.105 0.036 0.004 −0.140 0.052 0.007 
BMI −0.023 0.006 6.55 × 10−5 −0.017 0.005 0.001 −0.015 0.004 1.55 × 10−4 −0.013 0.004 3.36 × 10−4 −0.020 0.005 1.61 × 10−4 
Triglycerides −0.148 0.036 5.23 × 10−5 −0.009 0.032 0.781 −0.037 0.026 0.153 −0.100 0.023 2.44 × 10−5 −0.123 0.034 2.86 × 10−4 
Total cholesterol −0.009 0.023 0.696 0.002 0.020 0.905 0.005 0.016 0.741 0.008 0.015 0.574 0.005 0.021 0.816 
LDL cholesterol 0.022 0.025 0.380 0.004 0.022 0.851 0.001 0.018 0.963 0.026 0.016 0.105 0.037 0.024 0.113 
HDL cholesterol −0.048 0.059 0.412 0.016 0.050 0.749 0.078 0.042 0.061 0.011 0.038 0.783 −0.067 0.055 0.222 
SBP −0.008 0.003 0.003 −0.006 0.002 0.020 −0.005 0.002 0.011 −0.004 0.002 0.045 −0.007 0.003 0.006 
DBP −0.012 0.004 0.001 −0.012 0.003 1.08 × 10−4 −0.007 0.003 0.008 −0.003 0.002 0.143 −0.007 0.003 0.032 
cg02988288cg04861640cg06690548cg19693031cg26974062
VariableEffectSEP valueEffectSEP valueEffectSEP valueEffectSEP valueEffectSEP value
Fasting glucose −0.087 0.044 0.045 −0.045 0.038 0.236 −0.049 0.032 0.123 −0.072 0.028 0.011 −0.111 0.041 0.007 
2-h glucose −0.042 0.018 0.020 −0.013 0.016 0.391 −0.001 0.013 0.948 −0.022 0.012 0.063 −0.053 0.017 0.002 
C-peptide −4.89 × 10−4 8.46 × 10−5 1.52 × 10−8 −4.20 × 10−4 7.26 × 10−5 1.32 × 10−8 −3.24 × 10−4 5.99 × 10−5 1.03 × 10−7 −3.01 × 10−4 5.43 × 10−5 4.97 × 10−8 −4.24 × 10−4 7.93 × 10−5 1.40 × 10−7 
Fasting insulin −0.003 0.001 3.40 × 10−6 −0.003 0.001 9.82 × 10−8 −0.003 5.00 × 10−4 2.33 × 10−7 −0.002 4.64 × 10−4 3.44 × 10−7 −0.003 0.001 8.09 × 10−6 
GDM WHO 1999 −0.083 0.076 0.280 −0.070 0.065 0.283 −0.046 0.054 0.401 −0.062 0.049 0.207 −0.129 0.070 0.066 
GDM WHO 2013 −0.117 0.055 0.036 −0.014 0.048 0.764 −0.040 0.040 0.316 −0.105 0.036 0.004 −0.140 0.052 0.007 
BMI −0.023 0.006 6.55 × 10−5 −0.017 0.005 0.001 −0.015 0.004 1.55 × 10−4 −0.013 0.004 3.36 × 10−4 −0.020 0.005 1.61 × 10−4 
Triglycerides −0.148 0.036 5.23 × 10−5 −0.009 0.032 0.781 −0.037 0.026 0.153 −0.100 0.023 2.44 × 10−5 −0.123 0.034 2.86 × 10−4 
Total cholesterol −0.009 0.023 0.696 0.002 0.020 0.905 0.005 0.016 0.741 0.008 0.015 0.574 0.005 0.021 0.816 
LDL cholesterol 0.022 0.025 0.380 0.004 0.022 0.851 0.001 0.018 0.963 0.026 0.016 0.105 0.037 0.024 0.113 
HDL cholesterol −0.048 0.059 0.412 0.016 0.050 0.749 0.078 0.042 0.061 0.011 0.038 0.783 −0.067 0.055 0.222 
SBP −0.008 0.003 0.003 −0.006 0.002 0.020 −0.005 0.002 0.011 −0.004 0.002 0.045 −0.007 0.003 0.006 
DBP −0.012 0.004 0.001 −0.012 0.003 1.08 × 10−4 −0.007 0.003 0.008 −0.003 0.002 0.143 −0.007 0.003 0.032 

DBP, diastolic blood pressure; GDM 2013 (1999), WHO 2013 (1999) criteria for GDM; SBP, systolic blood pressure.

Because the five replicated CpG sites were associated with maternal BMI, we further adjusted the regression models for BMI. The effect sizes weakened by 8–21%, but the direction of the effects persisted and remained nominally significant (P < 0.05) (Supplementary Table 8).

Genetic Variants Associated With Cross-Ancestry CpG Sites in EPIPREG

From meQTL analysis in the EPIPREG sample, we obtained 478 genetic variants related to the five replicated cross-ancestry CpG sites (Supplementary Table 9). The most significant genetic variants from the cross-ancestry meQTL (Table 4) were further explored.

Table 4

Cross-ancestry CpG sites associated with insulin resistance and their associated genetic variants (meQTLs)

CpG informationCross-ancestry
meQTL results**
Most significant cross-ancestry meQTL**
CpGNChrGeneNo. of cis-meQTLNo. of trans-meQTLSNP ID (AL/MA)NChrGeneEffect (SE)*P value
cg02988288 456 TXNIP 37 rs2297976 (G/T) 422 SLC2A1 0.406 (0.04) 1.56 × 10−22 
cg04861640 456 ZSCAN26 374 rs2179174 (A/G) 422 ZSCAN26 −0.457 (0.05) 1.10 × 10−46 
cg06690548 456 SLC7A11 rs7006759 (T/C) 422 PSD3 −0.315 (0.054) 7.43 × 10−9 
cg19693031 456 TXNIP 37 rs17387775 (T/C) 422 SLC2A1 0.247 (0.03) 1.55 × 10−20 
cg26974062 456 TXNIP 28 rs34964576 (T/C) 422 SLC2A1 0.309 (0.04) 8.88 × 10−15 
cg10318066* 456 NA NA NA NA NA NA NA NA NA 
CpG informationCross-ancestry
meQTL results**
Most significant cross-ancestry meQTL**
CpGNChrGeneNo. of cis-meQTLNo. of trans-meQTLSNP ID (AL/MA)NChrGeneEffect (SE)*P value
cg02988288 456 TXNIP 37 rs2297976 (G/T) 422 SLC2A1 0.406 (0.04) 1.56 × 10−22 
cg04861640 456 ZSCAN26 374 rs2179174 (A/G) 422 ZSCAN26 −0.457 (0.05) 1.10 × 10−46 
cg06690548 456 SLC7A11 rs7006759 (T/C) 422 PSD3 −0.315 (0.054) 7.43 × 10−9 
cg19693031 456 TXNIP 37 rs17387775 (T/C) 422 SLC2A1 0.247 (0.03) 1.55 × 10−20 
cg26974062 456 TXNIP 28 rs34964576 (T/C) 422 SLC2A1 0.309 (0.04) 8.88 × 10−15 
cg10318066* 456 NA NA NA NA NA NA NA NA NA 

AL, alternative allele; Chr, chromosome; ID, identifier; MA, minor allele; NA, not applicable.

*

The effect size units are methylation M-values.

**

Results from a meta-analysis between Europeans and South Asians.

Because rs2297976, rs17387775, and rs34964576 associated with TNIPX sites were strongly correlated within each other (R2 = 1 when using either the European or South Asian panels), only rs2297976 was explored further. In EPIPREG, the minor allele of rs2297976 (T) was associated with lower insulin resistance and C-peptide levels (Supplementary Table 10), whereas the minor allele of rs2179174 (G) and rs7006759 (C) had the opposite effect. The minor allele of rs2179174 (G) was also associated with higher fasting insulin and high systolic and diastolic blood pressures. The minor allele of rs7006759 (C) was associated with higher BMI, higher 2-h glucose levels, increased systolic and diastolic blood pressures, and GDM per WHO 1999 criteria (Supplementary Table 10).

We performed look-ups of the meQTLs in GWAS summary data for relevant outcomes (Supplementary Table 11). With nominal significance (P = 0.047 to 1.14 × 10−5), rs2297976 was associated with fasting glucose, 2-h glucose, BMI, fasting insulin, total cholesterol, and LDL cholesterol levels, and both systolic and diastolic blood pressures. The variant rs2179174 was associated with type 1 diabetes, HDL cholesterol level, and systolic blood pressure. The variant rs7006759 was not associated with any of the tested phenotypes.

Mediation Analysis

The average causal mediation effects for all CpG sites were stronger than the average direct effects (Table 5), suggesting that the association between the meQTLs and insulin resistance was mediated by DNA methylation. When studying insulin resistance as a mediator between the meQTLs and DNA methylation, average direct effects were stronger than the average causal mediation effects for all the CpG sites (Table 5).

Table 5

Mediation analyses to evaluate the relationships among the meQTLs, DNA methylation, and insulin resistance

meQTLCpGACME (95%CI)P valueADE (95%CI)P valueTotal effect (95%CI)P value
Causal mediation when using DNA methylation as a mediator between the meQTL and insulin resistance 
 rs2297976 cg02988288 −0.131 (−0.190, −0.080) <2 × 10−16 −2.08 × 10−5 (−0.114, 0.110) 0.998 −0.131 (−0.237, −0.030) 0.013 
 rs2179174 cg04861640 0.181 (0.102, 0.260) <2 × 10−16 0.001 (−0.127, 0.130) 0.978 0.182 (0.076, 0.290) 0.0003 
 rs7006759 cg06690548 0.130 (0.067, 0.210) <2 × 10−16 0.177 (−0.015, 0.370) 0.071 0.308 (0.117, 0.500) 0.002 
 rs2297976 cg19693031 −0.110 (−0.165, −0.060) <2 × 10−16 −0.022 (−0.133, 0.090) 0.707 −0.131 (−0.238, −0.020) 0.016 
 rs2297976 cg26974062 −0.085 (−0.133, −0.040) <2 × 10−16 −0.047 (−0.157, −0.060) 0.408 −0.132 (−0.238, −0.030) 0.016 
Causal mediation when using insulin resistance as a mediator between the meQTL and DNA methylation 
 rs2297976 cg02988288 0.024 (0.004, 0.050) 0.016 0.384 (0.304, 0.460) <2 × 10−16 0.407 (0.327, 0.490) <2 × 10−16 
 rs2179174 cg04861640 −0.023 (−0.041, −0.010) 0.002 −0.437 (−0.495, −0.380) <2 × 10−16 −0.460 (−0.518, −0.40) <2 × 10−16 
 rs7006759 cg06690548 −0.043 (−0.078, −0.010) 0.002 −0.259 (−0.364, −0.150) <2 × 10−16 −0.302 (−0.410, −0.190) <2 × 10−16 
 rs2297976 cg19693031 0.015 (0.002, 0.030) 0.016 0.224 (0.173, 0.280) <2 × 10−16 0.239 (0.187, 0.290) <2 × 10−16 
 rs2297976 cg26974062 0.019 (0.003, 0.040) 0.016 0.276 (0.201, 0.350) <2 × 10−16 0.295 (0.219, 0.370) <2 × 10−16 
meQTLCpGACME (95%CI)P valueADE (95%CI)P valueTotal effect (95%CI)P value
Causal mediation when using DNA methylation as a mediator between the meQTL and insulin resistance 
 rs2297976 cg02988288 −0.131 (−0.190, −0.080) <2 × 10−16 −2.08 × 10−5 (−0.114, 0.110) 0.998 −0.131 (−0.237, −0.030) 0.013 
 rs2179174 cg04861640 0.181 (0.102, 0.260) <2 × 10−16 0.001 (−0.127, 0.130) 0.978 0.182 (0.076, 0.290) 0.0003 
 rs7006759 cg06690548 0.130 (0.067, 0.210) <2 × 10−16 0.177 (−0.015, 0.370) 0.071 0.308 (0.117, 0.500) 0.002 
 rs2297976 cg19693031 −0.110 (−0.165, −0.060) <2 × 10−16 −0.022 (−0.133, 0.090) 0.707 −0.131 (−0.238, −0.020) 0.016 
 rs2297976 cg26974062 −0.085 (−0.133, −0.040) <2 × 10−16 −0.047 (−0.157, −0.060) 0.408 −0.132 (−0.238, −0.030) 0.016 
Causal mediation when using insulin resistance as a mediator between the meQTL and DNA methylation 
 rs2297976 cg02988288 0.024 (0.004, 0.050) 0.016 0.384 (0.304, 0.460) <2 × 10−16 0.407 (0.327, 0.490) <2 × 10−16 
 rs2179174 cg04861640 −0.023 (−0.041, −0.010) 0.002 −0.437 (−0.495, −0.380) <2 × 10−16 −0.460 (−0.518, −0.40) <2 × 10−16 
 rs7006759 cg06690548 −0.043 (−0.078, −0.010) 0.002 −0.259 (−0.364, −0.150) <2 × 10−16 −0.302 (−0.410, −0.190) <2 × 10−16 
 rs2297976 cg19693031 0.015 (0.002, 0.030) 0.016 0.224 (0.173, 0.280) <2 × 10−16 0.239 (0.187, 0.290) <2 × 10−16 
 rs2297976 cg26974062 0.019 (0.003, 0.040) 0.016 0.276 (0.201, 0.350) <2 × 10−16 0.295 (0.219, 0.370) <2 × 10−16 

ACME, average causal mediation effects; ADE, average direct effect.

Relationships With Gene Expression

In MyoGlu, SLC7A11 expression was associated with higher insulin sensitivity (low insulin resistance) in muscle and with low insulin sensitivity in adipose tissue. ZSCAN26 expression was positively associated with high insulin sensitivity in both tissues. From expression quantitative trait methylation look-ups in data from the Biobank-based Integrative Omics Studies (BIOS) consortium higher methylation in cg19693031 in peripheral blood leukocytes was associated with lower TXNIP expression.

From eQTL look-ups in GTEx, the T allele of rs2297976 in SLC2A1 was associated with increased SLC2A1-AS1 expression levels in whole blood, pancreas, both subcutaneous and omental adipose tissue, and for SLC2A1 in pancreas (Supplementary Table 12). The C allele of the cis variant rs2179174 in ZSCAN26 was associated with increased expression of several genes from the zinc-finger family in all the tissues tested (Supplementary Table 12).

We have identified six cross-ancestry CpG sites inversely associated with insulin resistance during pregnancy in peripheral blood leukocytes, of which five were replicated in at least one of three independent cohorts. Both the CpG sites and their associated meQTLs were associated with several cardiometabolic phenotypes in the EPIPREG sample as well as in publicly available GWAS summary data. The meQTLs were also eQTLs in whole blood, subcutaneous adipose tissue, omental adipose tissue, liver, skeletal muscle, and pancreas. On the basis of mediation analysis and look-ups on gene expression, it can be inferred that the association between the meQTLs and insulin resistance was mediated by DNA methylation, presumably through modifying gene expression.

Low methylation of cg06690548 in SLC7A11 in peripheral blood leukocytes has been related to high fasting insulin levels in a nonpregnant population (13). SLC7A11 encodes a cysteine/glutamate plasma membrane transporter for the cellular uptake of cysteine in exchange for glutamate (41). Interestingly, cysteine, glutamate, and glycine conform glutathione, an antioxidant that may be deficient in type 2 diabetes, probably due to inefficient protein turnover (42). Notably, SLC7A11’s mRNA correlated positively in muscle and negatively in adipose tissue with insulin sensitivity, implying that in muscle, high cysteine/glutamate transport may be associated with low insulin resistance and with high insulin resistance in adipose tissue. Although low methylation in cg06690548 has been associated with high SLC7A11 mRNA in blood (43), the gene expression results should be interpreted with caution, because the association between SLC7A11’s mRNA and insulin resistance in peripheral blood leukocytes is currently unknown.

In accordance with our phenotype association analysis, cg06690548 has been related to BMI (44) and systolic and diastolic blood pressures (43). Notably its meQTL rs7006759 is located in PSD3, a gene that harbors genetic variants that have been associated with type 2 diabetes (45). Because rs7006759 and cg06690548 are located on different chromosomes, it is not clear how they may interact to regulate insulin resistance. Interchromosomal associations are the most common reported trans-meQTLs, and it is possible that these trans associations are in long-range linkage disequilibrium with other cis-meQTLs (46). Another possible mechanism is that trans-meQTL could target genes due to chromatin loops which could make them spatially close, thus mimicking a cis-meQTL effect (47). However, whether these mechanisms are true for rs7006759 and cg06690548 remains to be elucidated.

Regarding cg04861640 in ZSCAN26, a previous study reported that methylation of cg04861640 in peripheral blood leukocytes was decreased in patients with diabetes type 1 who had nephropathy (48), suggesting a possible role in the detrimental effects of diabetes. Notably, cg04861640 has been related to rheumatoid arthritis (49) and Sjögren syndrome (50), and its cis-meQTL, rs2179174, is associated with rheumatoid arthritis (51); both are autoimmune diseases recognized by inflammation. These and other autoimmune disorders have been associated with increased type 2 diabetes risk (52). Hence, type 2 diabetes may share mechanisms with a wide spectrum of autoimmune disorders, although the mechanisms are not well understood (52).

Because low methylation in promoters is usually associated with increased transcription (53), we can infer that lower methylation of cg04861640 might lead to higher expression of ZSCAN26, which, in turn, would increase insulin resistance. However, contrasting these possible relationships, high ZSCAN26 mRNA was associated with increased insulin sensitivity in the MyoGlu data. The effect of ZSCAN26 mRNA in peripheral blood leukocytes on insulin resistance should be elucidated to make further conclusions. Moreover, rs2179174 was an eQTL for pancreas, blood, and both omental and subcutaneous adipose tissue for several genes of the zinc-finger family, suggesting an effect on transcription of ZSCAN26 across tissues, although the implications in the context of insulin resistance and related traits are unknown.

Three of the cross-ancestry CpG sites were located in TXNIP, a gene that may induce β-cell apoptosis at high glucose levels, which may promote development of diabetes (54). Consistent with our findings, reduced methylation of cg19693031 in peripheral blood leukocytes was associated with higher insulin resistance in a general Mexican-ancestry, nonpregnant population (12). Furthermore, reduced blood methylation in cg19693031 has been observed in type 2 diabetes in a recent European-based EWAS meta-analysis (9) and in a cross-ancestry EWAS of Europeans and Indian Asians (55). Furthermore, reduced methylation of cg02988288 and cg26974062 in peripheral blood leukocytes has been associated with type 2 diabetes in Filipino individuals (56). In cord blood, reduced methylation at cg02988288 and cg26974062 was positively associated with glucose area under the curve (16) and a principal component composed of both insulin and glucose variables from an oral glucose tolerance test (57).

Low methylation in peripheral blood leukocytes of the three TXNIP CpG sites is related to high TXNIP transcription in type 2 diabetes (56), suggesting TXNIP’s mRNA levels increase in conditions with insulin resistance. TXNIP’s meQTL rs2297976 in SLC2A1 encodes the GLUT1 receptor, which is a major glucose transporter ubiquitously expressed in all tissues, especially in the blood-brain barrier and erythrocytes (58). Interestingly, SLC2A1 has a negative relationship with TXNIP’s mRNA, because SLC2A1 expression is repressed by TXNIP in the presence of glucose (55). The rs2297976 variant was also an eQTL for pancreatic tissue; hence, it is possible that rs2297976 regulates TXNIP transcription through DNA methylation in pancreas.

In line with our phenotype associations, reduced methylation levels at cg19693031 in peripheral blood leukocytes have been related to fasting glucose (13), triglyceride (59), and systolic blood pressure (60) values. Furthermore, methylation at cg19693031 is also reduced in people with metabolic syndrome (61). Thus, cg19693031 is consistently associated with a wide spectrum of cardiometabolic traits.

We replicated five cross-ancestry sites in at least one replication cohort, but only cg19693031 had a concordant direction of the effect across all three replication cohorts. This could be due to underlying differences between cohorts, or to statistical power. Most of the replicated sites were found in BiB, which had the largest sample size, an equivalent ancestry composition, and pregnant women. Furthermore, MENA and Botnia included both sexes, which may have played a role. None of the CpG sites identified in the European EWAS were replicated, suggesting that these may be false positives driven by the moderate inflation observed in this analysis.

Mediation analysis indicated that all five CpG sites mediated the association between the meQTLs and insulin resistance. That gene expression of TXNIP is associated with type 2 diabetes in peripheral blood leukocytes, whereas gene expression of SLC7A11 and ZSCAN26 in both muscle and adipose tissue is associated with insulin sensitivity, may suggest that the meQTLs alter DNA methylation, subsequently altering gene expression, which ultimately may affect insulin resistance (Fig. 3 and Supplementary Fig. 4). However, transcription levels may not necessarily directly affect insulin resistance, because other possible pathways and traits may be involved. The meQTLs were not strongly associated with insulin resistance independently of DNA methylation; hence, other environmental factors may influence the methylation of these CpG sites, which, in turn, could influence insulin resistance. Hence, Mendelian randomization studies with robust instruments should be performed to explore causality.

Figure 3

Proposed directionality of the meQTLs and CpG sites associated with insulin resistance (IR) based on the causal mediation analysis conclusions. The meQTLs would influence DNA methylation (CpG), whereas DNA methylation regulates IR potentially through gene expression (mRNA).

Figure 3

Proposed directionality of the meQTLs and CpG sites associated with insulin resistance (IR) based on the causal mediation analysis conclusions. The meQTLs would influence DNA methylation (CpG), whereas DNA methylation regulates IR potentially through gene expression (mRNA).

Close modal

Major strengths of this work are the cross-ancestry approach and the successful replication in independent cohorts. Integrating data from publicly available sources enabled us to leverage more information about the CpG sites identified in this study. Important limitations include the cross-sectional design and observational nature, which make it difficult to draw a conclusion about the directionality of the associations. Because of the relatively small sample size, we are only able to detect large and medium effect sizes for the association between DNA methylation and insulin resistance. Insulin resistance was calculated with different methods across the replication cohorts, and MENA and Botnia included both men and nonpregnant women, which may partly explain why some sites were not consistently replicated across all cohorts.

In conclusion, the cross-ancestry EWAS identified five CpG sites associated with insulin resistance, which were replicated in independent cohorts. These five CpG sites showed a decrease in methylation at high insulin resistance levels and were genetically regulated. Both the CpG sites and their meQTLs showed associations with several cardiometabolic phenotypes, whereas the meQTLs were eQTLs for different tissues. Furthermore, expression of the genes in which the CpG sites are annotated showed associations with cardiometabolic traits. These relationships, along with the mediation analysis, suggest that the gene variants may affect insulin resistance levels through DNA methylation, presumably by altering gene transcription.

This article contains supplementary material online at https://doi.org/10.2337/figshare.21733004.

R.B.P. and C.S. contributed equally as senior authors on this work.

Acknowledgments. The authors thank the women who participated in the STORK Groruddalen study, Maria Sterner, Malin Neptin, and Gabriella Gremsperger at the Genomics Diabetes and Endocrinology CRC, Malmö, for the wet laboratory experiments of the bead chips, and Leif C. Groop, Lund University Diabetes Centre, Malmö, Sweden, for facilitating the wet laboratory experiments. The BiB study is only possible because of the enthusiasm and commitment of the Children and Parents in BiB. We are grateful to all the participants, teachers, school staff, health professionals and researchers who have made BiB happen.

Funding. EPIPREG is supported by the South-Eastern Norway Regional Health Authority (grant 2019092), and the Norwegian Diabetes Association. BiB receives core infrastructure funding from the Wellcome Trust (WT101597MA), a joint grant from the UK Medical Research Council (MRC) and Economic and Social Science Research Council (MR/N024397/1), the British Heart Foundation (CS/16/4/32482), and the National Institute for Health Research under its Applied Research Collaboration for Yorkshire and Humber and the Clinical Research Network. DNA methylation data were funded by the UK MRC via the Integrative Epidemiology Unit (MC_UU_12013/5). H.R.E. works in the MRC Integrative Epidemiology Unit at the University of Bristol, which is supported by the MRC and the University of Bristol (MC_UU_00011/5). R.P.B. received funding form EFSD/Novo Nordisk Programme for Diabetes Research in Europe, the Swedish Research Council (2021-02623), and the Crafoord Foundation. The Botnia Study has been financially supported by grants from the Sigrid Juselius Foundation, Folkhalsan Research Foundation, Nordic Center of Excellence in Disease Genetics, EU (EXGENESIS), Finnish Diabetes Research Foundation, Foundation for Life and Health in Finland, Finnish Medical Society, Helsinki University Central Hospital Research Foundation, Perklén Foundation, Ollqvist Foundation, and Narpes Health Care Foundation. The study has also been supported by the Municipal Heath Care Center and Hospital in Jakobstad and Health Care Centers in Vasa, Narpes, and Korsholm.

Duality of Interest. No potential conflicts of interest relevant to this article were reported.

The funders played no role in the study design and conduct, in these analyses, or in the decision to submit the manuscript for publication.

Author Contributions. N.F.-B., C.S., and R.B.P. contributed to the study conceptualization and design of this sub-study. C.S., E.Q., and K.I.B. conceptualized and C.S. and K.I.B. designed the EPIPREG sample. C.S., E.Q., and K.I.B. acquired genotype and DNA methylation data in STORK G. N.F.-B. conducted the statistical analyses in EPIPREG, drafted the manuscript, performed the analyses in MENA, and did the postimputation QC. H.R.E. contributed to study design and conducted the analysis in the BiB cohort. R.B.P. conducted the analysis in Botnia and realized the wet laboratory experiments regarding the methylation and genotyping chips. S.L.-Ø. curated EPIPREG data and performed the correlation analyses in the MyoGlu study. J.O.O. contributed to the data visualization and interpretation. G.-H.M. performed the QC of the genomic data and imputation in EPIPREG. K.I.B. and A.K.J. designed the STORK G project. L.S. contributed with data acquisition in STORK-G. C.A.D. and K.I.B. designed and administered the MyoGlu study. C.S. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. All authors revised the manuscript critically and approved the final version.

Prior Presentation. This work was presented at the 57th European Association for the Study of Diabetes Annual Meeting, virtual, 28 September to 1 October 2021.

1.
DeFronzo
RA
.
Insulin resistance, lipotoxicity, type 2 diabetes and atherosclerosis: the missing links. The Claude Bernard Lecture 2009
.
Diabetologia
2010
;
53
:
1270
1287
2.
Cerf
ME
.
Beta cell dysfunction and insulin resistance
.
Front Endocrinol (Lausanne)
2013
;
4
:
37
3.
Catalano
PM
,
Tyzbir
ED
,
Roman
NM
,
Amini
SB
,
Sims
EAH
.
Longitudinal changes in insulin release and insulin resistance in nonobese pregnant women
.
Am J Obstet Gynecol
1991
;
165
:
1667
1672
4.
Kampmann
U
,
Knorr
S
,
Fuglsang
J
,
Ovesen
P
.
Determinants of maternal insulin resistance during pregnancy: an updated overview
.
J Diabetes Res
2019
;
2019
:
5320156
5.
Buchanan
TA
,
Xiang
AH
.
Gestational diabetes mellitus
.
J Clin Invest
2005
;
115
:
485
491
6.
Vounzoulaki
E
,
Khunti
K
,
Abner
SC
,
Tan
BK
,
Davies
MJ
,
Gillies
CL
.
Progression to type 2 diabetes in women with a known history of gestational diabetes: systematic review and meta-analysis
.
BMJ
2020
;
369
:
m1361
7.
Mahajan
A
,
Taliun
D
,
Thurner
M
, et al
.
Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps
.
Nat Genet
2018
;
50
:
1505
1513
8.
Beulens
JWJ
,
Pinho
MGM
,
Abreu
TC
, et al
.
Environmental risk factors of type 2 diabetes-an exposome approach
.
Diabetologia
2022
;
65
:
263
274
9.
Juvinao-Quintero
DL
,
Marioni
RE
,
Ochoa-Rosales
C
, et al
.
DNA methylation of blood cells is associated with prevalent type 2 diabetes in a meta-analysis of four European cohorts
.
Clin Epigenetics
2021
;
13
:
40
10.
Cardona
A
,
Day
FR
,
Perry
JRB
, et al
.
Epigenome-wide association study of incident type 2 diabetes in a British Population: EPIC-Norfolk Study
.
Diabetes
2019
;
68
:
2315
2326
11.
Arpón
A
,
Milagro
FI
,
Ramos-Lopez
O
, et al
.
Epigenome-wide association study in peripheral white blood cells involving insulin resistance
.
Sci Rep
2019
;
9
:
2445
12.
Kulkarni
H
,
Kos
MZ
,
Neary
J
, et al
.
Novel epigenetic determinants of type 2 diabetes in Mexican-American families
.
Hum Mol Genet
2015
;
24
:
5330
5344
13.
Liu
J
,
Carnero-Montoro
E
,
van Dongen
J
, et al
.
An integrative cross-omics analysis of DNA methylation sites of glucose and insulin homeostasis
.
Nat Commun
2019
;
10
:
2581
14.
Antoun
E
,
Kitaba
NT
,
Titcombe
P
, et al.;
UPBEAT Consortium
.
Maternal dysglycaemia, changes in the infant’s epigenome modified with a diet and physical activity intervention in pregnancy: secondary analysis of a randomised control trial
.
PLoS Med
2020
;
17
:
e1003229
15.
Howe
CG
,
Cox
B
,
Fore
R
, et al
.
Maternal gestational diabetes mellitus and newborn DNA methylation: findings from the Pregnancy and Childhood Epigenetics Consortium
.
Diabetes Care
2020
;
43
:
98
105
16.
Tobi
EW
,
Juvinao-Quintero
DL
,
Ronkainen
J
, et al
.
Maternal glycemic dysregulation during pregnancy and neonatal blood DNA methylation: meta-analyses of epigenome-wide association studies
.
Diabetes Care
2022
;
45
:
614
623
17.
Arora
GP
,
Åkerlund
M
,
Brøns
C
, et al
.
Phenotypic and genotypic differences between Indian and Scandinavian women with gestational diabetes mellitus
.
J Intern Med
2019
;
286
:
192
206
18.
Rai
AS
,
Sletner
L
,
Jenum
AK
, et al
.
Identifying women with gestational diabetes based on maternal characteristics: an analysis of four Norwegian prospective studies
.
BMC Pregnancy Childbirth
2021
;
21
:
615
19.
Kanaya
AM
,
Herrington
D
,
Vittinghoff
E
, et al
.
Understanding the high prevalence of diabetes in U.S. South Asians compared with four racial/ethnic groups: the MASALA and MESA studies
.
Diabetes Care
2014
;
37
:
1621
1628
20.
Prasad
RB
,
Asplund
O
,
Shukla
SR
, et al
.
Subgroups of patients with young-onset type 2 diabetes in India reveal insulin deficiency as a major driver
.
Diabetologia
2022
;
65
:
254
21.
Hidalgo
B
,
Irvin
MR
,
Sha
J
, et al
.
Epigenome-wide association study of fasting measures of glucose, insulin, and HOMA-IR in the Genetics of Lipid Lowering Drugs and Diet Network study
.
Diabetes
2014
;
63
:
801
807
22.
Kriebel
J
,
Herder
C
,
Rathmann
W
, et al
.
Association between DNA methylation in whole blood and measures of glucose metabolism: KORA F4 Study
.
PLoS One
2016
;
11
:
e0152314
23.
Jenum
AK
,
Sletner
L
,
Voldner
N
, et al
.
The STORK Groruddalen research programme: a population-based cohort study of gestational diabetes, physical activity, and obesity in pregnancy in a multiethnic population. Rationale, methods, study population, and participation rates
.
Scand J Public Health
2010
;
38
:
60
70
24.
Fragoso-Bargas
N
,
Opsahl
JO
,
Kiryushchenko
N
, et al
.
Cohort profile: Epigenetics in Pregnancy (EPIPREG) - population-based sample of European and South Asian pregnant women with epigenome-wide DNA methylation (850k) in peripheral blood leukocytes
.
PLoS One
2021
;
16
:
e0256158
25.
Mørkrid
K
,
Jenum
AK
,
Sletner
L
, et al
.
Failure to increase insulin secretory capacity during pregnancy-induced insulin resistance is associated with ethnicity and gestational diabetes
.
Eur J Endocrinol
2012
;
167
:
579
588
26.
Sharma
A
,
Lee-Ødegård
S
,
Qvigstad
E
, et al
.
β-Cell function, hepatic insulin clearance, and insulin sensitivity in South Asian and Nordic Women after gestational diabetes mellitus
.
Diabetes
2022
;
71
:
2530
2538
27.
Houseman
EA
,
Accomando
WP
,
Koestler
DC
, et al
.
DNA methylation arrays as surrogate measures of cell mixture distribution
.
BMC Bioinformatics
2012
;
13
:
86
28.
Pidsley
R
,
Zotenko
E
,
Peters
TJ
, et al
.
Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling
.
Genome Biol
2016
;
17
:
208
29.
Auton
A
,
Brooks
LD
,
Durbin
RM
, et al.;
1000 Genomes Project Consortium
.
A global reference for human genetic variation
.
Nature
2015
;
526
:
68
74
30.
Born in Bradford Collaborative Group
.
Born in Bradford, a cohort study of babies born in Bradford, and their parents: protocol for the recruitment phase
.
BMC Public Health
2008
;
8
:
327
31.
Wright
J
,
Small
N
,
Raynor
P
, et al.;
Born in Bradford Scientific Collaborators Group
.
Cohort pProfile: the Born in Bradford multi-ethnic family cohort study
.
Int J Epidemiol
2013
;
42
:
978
991
32.
Kuznetsova
A
,
Brockhoff
PB
,
Christensen
RHB
.
lmerTest Package: tests in linear mixed effects models
.
J Stat Softw
2017
;
82
:
1
26
33.
Peterson
RE
,
Kuchenbaecker
K
,
Walters
RK
, et al
.
Genome-wide association studies in ancestrally diverse populations: opportunities, methods, pitfalls, and recommendations
.
Cell
2019
;
179
:
589
603
34.
Marigorta
UM
,
Navarro
A
.
High trans-ethnic replicability of GWAS results implies common causal variants
.
PLoS Genet
2013
;
9
:
e1003566
35.
Schaid
DJ
,
Chen
W
,
Larson
NB
.
From genome-wide associations to candidate causal variants by statistical fine-mapping
.
Nat Rev Genet
2018
;
19
:
491
504
36.
Pan
H
,
Holbrook
JD
,
Karnani
N
,
Kwoh
CK
.
Gene, Environment and Methylation (GEM): a tool suite to efficiently navigate large scale epigenome wide association studies and integrate genotype and interaction between genotype and environment
.
BMC Bioinformatics
2016
;
17
:
299
37.
Willer
CJ
,
Li
Y
,
Abecasis
GR
.
METAL: fast and efficient meta-analysis of genomewide association scans
.
Bioinformatics
2010
;
26
:
2190
2191
38.
Langleite
TM
,
Jensen
J
,
Norheim
F
, et al
.
Insulin sensitivity, body composition and adipose depots following 12 w combined endurance and strength training in dysglycemic and normoglycemic sedentary men
.
Arch Physiol Biochem
2016
;
122
:
167
179
39.
Lee
S
,
Gulseth
HL
,
Langleite
TM
, et al
.
Branched-chain amino acid metabolism, insulin sensitivity and liver fat response to exercise training in sedentary dysglycaemic and normoglycaemic men
.
Diabetologia
2021
;
64
:
410
423
40.
Lee
S
,
Norheim
F
,
Langleite
TM
,
Gulseth
HL
,
Birkeland
KI
,
Drevon
CA
.
Effects of long-term exercise on plasma adipokine levels and inflammation-related gene expression in subcutaneous adipose tissue in sedentary dysglycaemic, overweight men and sedentary normoglycaemic men of healthy weight
.
Diabetologia
2019
;
62
:
1048
1064
41.
Koppula
P
,
Zhang
Y
,
Zhuang
L
,
Gan
B
.
Amino acid transporter SLC7A11/xCT at the crossroads of regulating redox homeostasis and nutrient dependency of cancer
.
Cancer Commun (Lond)
2018
;
38
:
12
42.
Sekhar
RV
,
McKay
SV
,
Patel
SG
, et al
.
Glutathione synthesis is diminished in patients with uncontrolled diabetes and restored by dietary supplementation with cysteine and glycine
.
Diabetes Care
2011
;
34
:
162
167
43.
Richard
MA
,
Huan
T
,
Ligthart
S
, et al.;
BIOS Consortium
.
DNA methylation analysis identifies loci for blood pressure regulation
.
Am J Hum Genet
2017
;
101
:
888
902
44.
Wahl
S
,
Drong
A
,
Lehne
B
, et al
.
Epigenome-wide association study of body mass index, and the adverse outcomes of adiposity
.
Nature
2017
;
541
:
81
86
45.
Gong
S
,
Xu
C
,
Wang
L
, et al
.
Genetic association analysis of polymorphisms in PSD3 gene with obesity, type 2 diabetes, and HDL cholesterol
.
Diabetes Res Clin Pract
2017
;
126
:
105
114
46.
Villicaña
S
,
Bell
JT
.
Genetic impacts on DNA methylation: research findings and future perspectives
.
Genome Biol
2021
;
22
:
127
47.
Hoffmann
A
,
Ziller
M
,
Spengler
D
.
The future is the past: methylation QTLs in schizophrenia
.
Genes (Basel)
2016
;
7
:
104
48.
Smyth
LJ
,
Patterson
CC
,
Swan
EJ
,
Maxwell
AP
,
McKnight
AJ
.
DNA methylation associated with diabetic kidney disease in blood-derived DNA
.
Front Cell Dev Biol
2020
;
8
:
561907
49.
Liu
Y
,
Aryee
MJ
,
Padyukov
L
, et al
.
Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis
.
Nat Biotechnol
2013
;
31
:
142
147
50.
Imgenberg-Kreuz
J
,
Sandling
JK
,
Almlöf
JC
, et al
.
Genome-wide DNA methylation analysis in multiple tissues in primary Sjögren’s syndrome reveals regulatory effects at interferon-induced genes
.
Ann Rheum Dis
2016
;
75
:
2029
2036
51.
Okada
Y
,
Wu
D
,
Trynka
G
, et al.;
RACI consortium
;
GARNET consortium
.
Genetics of rheumatoid arthritis contributes to biology and drug discovery
.
Nature
2014
;
506
:
376
381
52.
Hemminki
K
,
Liu
X
,
Försti
A
,
Sundquist
J
,
Sundquist
K
,
Ji
J
.
Subsequent type 2 diabetes in patients with autoimmune disease
.
Sci Rep
2015
;
5
:
13871
53.
Clermont
P-L
,
Parolia
A
,
Liu
HH
,
Helgason
CD
.
DNA methylation at enhancer regions: novel avenues for epigenetic biomarker development
.
Front Biosci (Landmark Ed)
2016
;
21
:
430
446
54.
Cha-Molstad
H
,
Saxena
G
,
Chen
J
,
Shalev
A
.
Glucose-stimulated expression of Txnip is mediated by carbohydrate response element-binding protein, p300, and histone H4 acetylation in pancreatic beta cells
.
J Biol Chem
2009
;
284
:
16898
16905
55.
Chambers
JC
,
Loh
M
,
Lehne
B
, et al
.
Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study
.
Lancet Diabetes Endocrinol
2015
;
3
:
526
534
56.
Albao
DS
,
Cutiongco-de la Paz
EM
,
Mercado
ME
, et al
.
Methylation changes in the peripheral blood of Filipinos with type 2 diabetes suggest spurious transcription initiation at TXNIP
.
Hum Mol Genet
2019
;
28
:
4208
4218
57.
Juvinao-Quintero
DL
,
Cardenas
A
,
Perron
P
,
Bouchard
L
,
Lutz
SM
,
Hivert
MF
.
Associations between an integrated component of maternal glycemic regulation in pregnancy and cord blood DNA methylation
.
Epigenomics
2021
;
13
:
1459
1472
58.
Pragallapati
S
,
Manyam
R
.
Glucose transporter 1 in health and disease
.
J Oral Maxillofac Pathol
2019
;
23
:
443
449
59.
Hedman
AK
,
Mendelson
MM
,
Marioni
RE
,
Gustafsson
S
,
Joehanes
R
,
Irvin
MR
, et al
.
Epigenetic patterns in blood associated with lipid traits predict incident coronary heart disease events and are enriched for results from genome-wide association studies
.
Circ Cardiovasc Genet
2017
;
10
:
e001487
60.
Huang
Y
,
Ollikainen
M
,
Muniandy
M
, et al
.
Identification, heritability, and relation with gene expression of novel DNA methylation loci for blood pressure
.
Hypertension
2020
;
76
:
195
205
61.
Nuotio
M-L
,
Pervjakova
N
,
Joensuu
A
, et al
.
An epigenome-wide association study of metabolic syndrome and its components
.
Sci Rep
2020
;
10
:
20567
Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered. More information is available at https://www.diabetesjournals.org/journals/pages/license.