Type 1 diabetes (T1D) polygenic risk scores (PRS) effectively discriminate T1D from both type 2 and monogenic diabetes (1) and can aid identification of T1D in the population, serving as a valuable tool for screening and intervention trials (2). The Genetic Risk Score 2 (GRS2) (3) combining 67 single nucleotide variants is widely adopted in translational studies, yet absence of a consistent method for generation is a barrier to broader clinical and research utility. Study-specific differences limit the availability of variants, particularly those in the HLA region (6p21.3) with strong disease-modifying effects. Differences in genotyping coverage and accuracy arise from the choice of technology (arrays or whole-genome sequencing [WGS]) and access to reference panels used to infer missing genotypes (imputation), often constrained by global data access restrictions. This variability limits standardized risk assessment across research and clinical studies. We present GRS2x, an accessible approach for standardized measurement of T1D polygenic risk.

We used three widely available reference panels (1000 Genomes, Haplotype Reference Consortium [HRC], and Trans-Omics for Precision Medicine [TOPMed] [versions TOPMed-r2 and TOPMed-r3]) to replace variants from GRS2. Insertions and deletions (more than two base pairs), multinucleotide variants, and variants not in all reference panels were replaced (19 of 67), with linkage disequilibrium (as previously described [3]). Groups of variants partitioned by genetic locus (HLA class II, HLA class I, and non-HLA) were used to define partitioned polygenic scores (pPS), with total GRS2x calculated as their sum (Fig. 1A). We implemented algorithms to address two critical limitations: 1) incomplete sets of variants prohibit standardized risk assessment and 2) variants marking HLA haplotypes with imperfect linkage disequilibrium can lead to uncertain predictions (more than two HLA haplotypes) in individuals, often resulting in exclusion. Mean effect contribution for missing variants was estimated on the basis of Hardy-Weinberg equilibrium, based on reference frequencies (TOPMed). After assigning HLA predictions, we iteratively removed least likely genotypes based on population frequency (NMPD [National Marrow Donor Program]). We then applied normalization to rescale GRS2x based on theoretical minimum and maximum relative risk values. Performance in identifying cases was assessed using area under the receiver operating characteristic curve (AUROC).

Figure 1

A: Construction of the GRS2x, the sum of partitions defined by locus, after application of missing variant score estimation and normalization. B: pPS grouped by locus are independently discriminative of T1D in individuals from the UKB of European ancestry (EUR) with WGS data. C: GRS2x discriminative performance in comparison with that of GRS2 (Type 1 Diabetes Genetics Consortium [T1DGC]), with stratification by genotyping technology (UKB), and with application to MA (AoU).

Figure 1

A: Construction of the GRS2x, the sum of partitions defined by locus, after application of missing variant score estimation and normalization. B: pPS grouped by locus are independently discriminative of T1D in individuals from the UKB of European ancestry (EUR) with WGS data. C: GRS2x discriminative performance in comparison with that of GRS2 (Type 1 Diabetes Genetics Consortium [T1DGC]), with stratification by genotyping technology (UKB), and with application to MA (AoU).

Close modal

We measured case-control discriminative performance of GRS2x using array genotyping from the Type 1 Diabetes Genetics Consortium (T1D 6,483, control 9,247; age of onset <20 years) after TOPMed imputation. Discrimination of T1D improved marginally (AUROC: GRS2x 0.924 vs. GRS2 0.922; P = 2.96 × 10−10) and remained consistent across reference panels, matching results previously reported for GRS2 after 1000 Genomes plus HRC imputation (3) (Fig. 1C).

We further validated GRS2x, with stratification by genotyping technology of the UK Biobank (UKB) (4). We replicated published case-control definitions (T1D 408, control 365,622) (3) from electronic health records (EHR) for European-ancestry individuals with array (n = 367,318) and/or WGS (n = 366,030) genotyping available. Performance of GRS2x in identifying T1D was stable (AUROC: array 0.931 vs. WGS 0.932), despite the absence of three variants in the array data set (Fig. 1C). In individuals with genotypes from both sources, GRS2x was highly concordant (R = 0.96) and rank order of individual-level risk predictions was preserved (ρ = 0.96). GRS2x further resolved conflicting HLA class II genotypes, allowing score assignment for additional individuals (n = 5,550 [1.14%]). Most T1D discrimination was driven by pPS encoding HLA class II (AUROC 0.90), over HLA class I and non-HLA variants (AUROC 0.55 and 0.75, respectively) (Fig. 1B).

For assessment of GRS2x multiancestry performance, case-control definitions (T1D 147, control 317,817) were generated in the All of Us Research Program (AoU) (5) (EHR-recorded data: T1D, age at EHR incidence <30 years, prescribed insulin, no noninsulin glucose-lowering agents). Sensitivity and specificity were limited by incomplete EHR data and the use of age at recorded incidence as a proxy for juvenile onset. Identification of T1D in the multiancestry population (MA) was strong but reduced in comparison with European-only ancestry population (EUR) (nT1D = 103) (AUROC: MA 0.860, EUR 0.895) (Fig. 1C), though sample sizes were insufficient for further ancestry stratification. Rank-based concordance between GRS2x and GRS2 was marginally lower for non-Europeans (ρEUR = 0.906, ρnon-EUR = 0.900; P < 2.22 × 10−16), suggesting subtle ancestry-related shifts in individual-level risk precision.

We have demonstrated that GRS2x enables standardized measurement of T1D polygenic risk across diverse studies. While complementary efforts aim to enhance discriminative performance, addressing practical limitations provides immediate utility in reducing study-specific variability. Standardization is crucial for clinical translation; however, validated reference ranges must be derived before thresholds for clinical action can be applied. Stabilizing individual-level precision is also critical for clinical consistency. Both individual- and population-level performance may improve with incorporation of greater ancestry-related allelic diversity, particularly from HLA, where attributable risk is variable by ancestry and age of onset. Since our study was limited to early-onset cases, performance for late-onset cases is less clear; however, immediate utility for early-life screening studies is evident. GRS2x offers a robust, unified method for T1D polygenic risk assessment for ongoing and future studies.

Acknowledgments. We gratefully acknowledge AoU participants for their contributions, without whom this research would not have been possible. We also thank the National Institutes of Health AoU for making available the participant data examined in this study. This research was conducted with use of the UKB resource under application no. 9072. We also acknowledge the contributions of the consortium working on the development of the NHLBI BioData Catalyst (BDC) ecosystem.

A.L.G. is an editor of Diabetes Care but was not involved in any of the decisions regarding review of the manuscript or its acceptance.

Funding. A.J.D. is funded by National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) grant K23DK140643. R.A.O. is funded by NIDDK (grants R01DK121843 and R01DK124395), Breakthrough T1D (4-SRA-2023-1375-M-B, 3-SRA-2022-1241-S-B, 2-SRA-2022-1261-S-B, 2-SRA-2022-1258-M-B, and 2-SRA-2024-1620-S-B), and The Leona M. and Harry B. Helmsley Charitable Trust (2016PG-T1D049, 2018PG-T1D049, 2103-05059, and G-2404-06858). H.I.O. is funded by NIDDK grant T32DK007217 and Stanford Maternal and Child Health Research Institute. A.K.M. is supported by the Foundation for the National Institutes of Health (FNIH) with funding from AMP CMD RFP 2 and National Human Genome Research Institute (NHGRI) grant U01HG011723. J.M.M. is supported by America Diabetes Association grant 11-22-ICTSPM-16 and by NHGRI grant U01HG011723 and NIDDK grants R01DK137993 and U01 DK140757, AMP CMD RFP 6 from the FNIH, and a Medical University of Bialystok grant from the Ministry of Science and Higher Education (Poland). M.A.R. is supported by NHGRI grant R01HG010140 and National Institute of Mental Health grant R01MH124244. M.S.U. is funded by Doris Duke Charitable Foundation Clinical Scientist Development Award 2022063, NHGRI grant U01HG011723, and NIDDK (grants U54DK118612, UM1DK126185, and U01DK140757). A.L.G. is funded by NIDDK (grants UM1-1DK126185 and P30 DK116074). S.A.S. is funded by the Larry L. Hillblom Foundation (2024-D-017-FEL). This study/research is funded/supported by the National Institute for Health and Care Research (NIHR) Exeter Biomedical Research Centre.

The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care (U.K.).

Duality of Interest. A.M.L. is funded by a PhD studentship from Randox Laboratories. A.M.L., R.A.O., and M.N.W. hold funding from Randox Laboratories to study translation of autoimmune genetic scores. R.A.O. is funded by Randox Laboratories. The spouse of A.L.G. is employed by Genentech and holds stock options in Roche. The University of Exeter has a licensing and royalty agreement with Randox Laboratories relating to a 10-variant T1D polygenic risk score. No other potential conflicts of interest relevant to this article were reported.

Author Contributions. A.M.L., R.A.O., M.N.W., A.L.G., and S.A.S. contributed to analysis and wrote the manuscript. R.A.O., A.L.G., and S.A.S. designed the study. A.J.D., H.I.O., D.P.F., K.A., A.K.M., J.M.M., M.A.R., and M.S.U. contributed to analysis and reviewed the manuscript. All authors contributed to the discussion and reviewed or edited the manuscript. S.A.S. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Prior Presentation. Parts of this study were presented at the Human Islet Research Network 2025 annual meeting, Changing the Course in Type 1 Diabetes, Bethesda, MD, 13–14 January 2025.

Data and Resource Availability. GRS2x and the associated single nucleotide variant lists are available through the Polygenic Risk Score Extension for Diabetes Mellitus (PRSedm) package, which can be accessed on GitHub (https://github.com/sethsh7/prsedm), Python Package Index (PyPI) (https://pypi.org/project/prsedm/), and Anaconda (https://anaconda.org/sethsh7/prsedm). Notebooks for generating GRS2x using data from AoU and the UKB are also available on GitHub.

Handling Editors. The journal editor responsible for overseeing the review of the manuscript was Stephen S. Rich.

1.
Luckett
AM
,
Weedon
MN
,
Hawkes
G
, et al
.
Utility of genetic risk scores in type 1 diabetes
.
Diabetologia
2023
;
66
:
1589
1600
2.
Sims
EK
,
Besser
REJ
,
Dayan
C
, et al.;
NIDDK Type 1 Diabetes TrialNet Study Group
.
Screening for type 1 diabetes in the general population: a status report and perspective
.
Diabetes
2022
;
71
:
610
623
3.
Sharp
SA
,
Rich
SS
,
Wood
AR
, et al
.
Development and standardization of an improved type 1 diabetes genetic risk score for use in newborn screening and incident diagnosis
.
Diabetes Care
2019
;
42
:
200
207
4.
Halldorsson
BV
,
Eggertsson
HP
,
Moore
KHS
, et al.;
DBDS Genetic Consortium
.
The sequences of 150,119 genomes in the UK Biobank
.
Nature
2022
;
607
:
732
740
5.
All of Us Research Program Genomics Investigators
.
Genomic data in the All of Us Research Program
.
Nature
2024
;
627
:
340
346
Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered. More information is available at https://www.diabetesjournals.org/journals/pages/license.