OBJECTIVE

Despite extensive evidence for genetic susceptibility to diabetic nephropathy, the identification of susceptibility genes and their variants has had limited success. To search for genes that contribute to diabetic nephropathy, a genome-wide association scan was implemented on the Genetics of Kidneys in Diabetes collection.

RESEARCH DESIGN AND METHODS

We genotyped ∼360,000 single nucleotide polymorphisms (SNPs) in 820 case subjects (284 with proteinuria and 536 with end-stage renal disease) and 885 control subjects with type 1 diabetes. Confirmation of implicated SNPs was sought in 1,304 participants of the Diabetes Control and Complications Trial (DCCT)/Epidemiology of Diabetes Interventions and Complications (EDIC) study, a long-term, prospective investigation of the development of diabetes-associated complications.

RESULTS

A total of 13 SNPs located in four genomic loci were associated with diabetic nephropathy with P < 1 × 10−5. The strongest association was at the FRMD3 (4.1 protein ezrin, radixin, moesin [FERM] domain containing 3) locus (odds ratio [OR] = 1.45, P = 5.0 × 10−7). A strong association was also identified at the CARS (cysteinyl-tRNA synthetase) locus (OR = 1.36, P = 3.1 × 10−6). Associations between both loci and time to onset of diabetic nephropathy were supported in the DCCT/EDIC study (hazard ratio [HR] = 1.33, P = 0.02, and HR = 1.32, P = 0.01, respectively). We demonstratedexpression of both FRMD3 and CARS in human kidney.

CONCLUSIONS

We identified genetic associations for susceptibility to diabetic nephropathy at two novel candidate loci near the FRMD3 and CARS genes. Their identification implicates previously unsuspected pathways in the pathogenesis of this important late complication of type 1 diabetes.

Diabetic nephropathy is the leading contributor to end-stage renal disease (ESRD) in the U.S. (1). Clinically, diabetic nephropathy is manifest as a progressive disease process that advances through characteristic stages. It begins with microalbuminuria (leakage of small amounts of albumin into the urine) and progresses to overt proteinuria. In a large proportion of these patients, renal function declines and continues to deteriorate until ESRD is reached, and replacement therapy is indicated (2,4). Overall, ESRD develops in ∼20% of all patients with type 1 diabetes (5,6).

Despite evidence that genetic susceptibility plays a role in the development of diabetic nephropathy in type 1 diabetes (7,9), success in identifying the responsible genetic variants has been limited (10,11). This has been attributable, in part, to the small size of the DNA collections available to individual research groups and the narrow focus of the searches on candidate genes. Another challenge that has received little attention in previous studies is the possibility that successive stages of diabetic nephropathy are influenced by different genetic factors (12,13).

To conduct a statistically robust study that provides genome-wide coverage for detection of common variants that may have relatively small, but pathogenically significant, effect on risk of diabetic nephropathy in type 1 diabetes, the Genetics of Kidneys in Diabetes (GoKinD) collection was established (14). A genome-wide scan of this collection was supported by the Genetic Association Information Network (GAIN) initiative (15). This report presents 1) results of this genome-wide association scan in the GoKinD collection, 2) replication of the significant associations in this scan with time to onset of diabetes-associated complications (severe nephropathy) in theDiabetes Control and Complications Trial (DCCT)/Epidemiology of Diabetes Interventions and Complications (EDIC) study, and 3) characterization of expression of the identified candidate diabetic nephropathy genes in normal human cell lines.

Subjects for the GoKinD collection were recruited through two centers with different methods of ascertainment and recruitment (14). The George Washington University (GWU) Biostatistics Center coordinated the recruitment of volunteers (through mass media advertisement) living throughout the U.S. (excluding New England) and Canada to 1 of 27 clinical centers located across the U.S. and Canada. The Section of Genetics and Epidemiology at the Joslin Diabetes Center (JDC) recruited and examined patients of the Joslin Clinic from New England who were already enrolled in the Joslin Kidney Study on the Genetics of Diabetic Nephropathy, a clinic-based cohort study in which case subjects with diabetic nephropathy and a random sampling of eligible control subjects were identified and recruited (16).

A detailed description of the GoKinD collection has been published (14). Briefly, subjects enrolled in GoKinD had type 1 diabetes diagnosed before age 31, began insulin treatment within 1 year of their diagnosis, and were between 18 and 59 years of age at the time of enrollment. Participation in the DCCT/EDIC study was an exclusion criterion so that the two study populations would be independent. Case subjects with diabetic nephropathy had either persistent proteinuria, defined by a urinary albumin-to-creatinine ratio ≥300 μg/mg in two of the last three measurements taken at least 1 month apart, or ESRD (dialysis or renal transplant). Control subjects had type 1 diabetes for at least 15 years and normoalbuminuria, defined by an albumin-to-creatinine ratio <20 μg/mg in two of the last three measurements taken at least 1 month apart (if a third measurement was required, a value <40 μg/mg was necessary for inclusion), without ever having been treated with ACE inhibitors or angiotensin receptor blockers, and they were not being treated with antihypertensive medication at the time of recruitment into the study. For additional information regarding the definition of case and control subjects used in this analysis, refer to the report by Mueller et al. (14). In total, 1,879 subjects (935 case and 944 control subjects) were recruited into GoKinD. The GWU panel included 437 case subjects with diabetic nephropathy (58 with proteinuria and 379 with ESRD) and 446 control subjects; the JDC panel included 498 case subjects with diabetic nephropathy (268 with proteinuria and 230 with ESRD) and 498 control subjects. Further details are also provided in the supplementary information, which is available in an online appendix at http://diabetes.diabetesjournals.org/cgi/content/full/db08-1514/DC1.

Confirmation of our findings in the GoKinD collection was sought in genome-wide association data from the DCCT/EDIC study, a long-term, prospective investigation of the development of diabetes-associated complications (17,18). Of the original DCCT cohort recruited between 1983 and 1989, 1,375 subjects (95%) were retained in the EDIC follow-up study. Participants in EDIC underwent baseline examinations between 1994 and 1995 and have since participated in annual follow-up examinations to assess the development or progression of complications. As of EDIC year 12 (2005), this cohort had 16–22 years of follow-up, and 132 cases of severe nephropathy (proteinuria or ESRD) had been documented in 1,304 Caucasian DCCT/EDIC participants. This phenotype is the closest to the phenotype used in the GoKinD collection. Detailed clinical characteristics of the DCCT/EDIC study have been published (13,17,18). Additional details are also provided in the supplementary information.

Genotyping.

The GoKinD collection was genotyped on an Affymetrix 5.0 500K single nucleotide polymorphism (SNP) array by the GAIN genotyping laboratory at the Eli and Edythe L. Broad Institute (Cambridge, MA). A description of study genotyping is available in the supplementary information. Additionally, two SNPs, rs39075 and rs1888747, were genotyped in the GoKinD collection using TaqMan (Applied Biosystems, Foster City, CA) technology by the genetics core of the Diabetes and Endocrinology Research Center at the JDC in accordance with the manufacturer's protocols. DNA samples used for the genotyping of these SNPs in the GoKinD collection were obtained through the National Institute of Diabetes and Digestive and Kidney Diseases central repository (www.niddkrepository.org).

SNP quality control.

After internal quality control, the GAIN genotyping laboratory released genotypes for 467,144 SNPs. Several quality control metrics, including filters for minor allele frequency < 0.01, rejection of Hardy-Weinberg assumptions (P≤ 10−5), and differential rates of missing data (by case/control status) were applied to these data. After reconciliation of SNPs eliminated by these analyses, the resulting data contained 359,193 autosomal SNPs. More details are available in the supplementary information and supplementary Table 1.

Sample quality control and SNP imputation.

The application of quality control criteria reduced the GoKinD population from 1,879 to 1,705 individuals of European ancestry. Samples from the two GoKinD panels that constitute this sample are 379 GWU case subjects (49 with proteinuria and 330 with ESRD), 413 GWU control subjects, 441 JDC case subjects (235 with proteinuria and 206 with ESRD), and 472 JDC control subjects. The measured genotypes for these individuals were augmented by imputation of ungenotyped SNPs across two mega–base pair regions flanking each of the lead SNPs in the GoKinD collection (where linkage disequilibrium, as measured by r2, decayed to <0.20 for all lead SNPs). A total of 8,245 imputed HapMap SNPs across these four loci were included in our association analysis. Further details of sample quality control and imputation procedures are available in the supplementary information and supplementary Table 2.

Statistical analysis.

GoKinD samples were collected under two separate ascertainment protocols (JDC and GWU panels) so that tests for latent residual stratification were performed. Cochran-Armitage tests of trend for the JDC versus GWU control subjects, and JDC versus GWU case subjects, revealed an overdispersion in the test statistics for both control and case subjects compared with the complete null. The median genomic control parameters were estimated at λ = 1.13 for control subjects and λ = 1.097 for case subjects. Permutation analysis within the control subjects and within the case subjects resulted in a stratification significance of P < 10−3 for both case and control subjects. Therefore, the primary association analysis used in the study was a stratified test of association combining case-control tests of allele frequencies in JDC and GWU strata. Combined P values and odds ratios (ORs) were calculated using a Cochran-Mantel-Haenszel procedure. Homogeneity across strata was assessed using the Breslow-Day statistic. All genome-wide statistical association analyses were performed using PLINK and R (19). Further details of quality control procedures, software, statistical analysis and adjustments, and cluster plots are available in the supplementary information. Data from our analysis of the GoKinD collection are available for specific SNPs and/or genes upon request.

Genotyping and analysis of the DCCT/EDIC data.

Genotypes of the DCCT/EDIC study participants were generated with the Illumina Human1M Beadchip. Briefly, quality control measures resulted in 840,354 SNPs suitable for statistical analysis. Population substructure was assessed to ensure that all included samples were of European ancestry. Multivariate Cox proportional hazard analyses were performed on data from 1,304 Caucasian subjects using time to onset of severe nephropathy, defined by an albumin excretion rate (AER) >300 mg/24 h on at least two consecutive examinations or dialysis/renal transplant with prior persistent microalbuminuria (two consecutive AERs >30 mg/24 h) as the outcome phenotype (n = 132). Among those with severe nephropathy, 116 subjects developed only proteinuria (AER >300 mg/24 h), whereas 16 progressed to ESRD. The DCCT cohort (primary prevention versus secondary intervention), treatment (intensive versus conventional), and interaction between cohort and treatment were used as covariates in the analysis of the effect of an independent additive SNP genetic factor. This model was examined for all associated loci in GoKinD and subsequently tested for both statistical significance and the same direction of effect for associated alleles (20,22).

Gene expression.

The expression of candidate genes was examined in four primary human cell lines derived from cells that have been implicated in the pathogenesis of kidney complications (endothelial cells from the iliac artery, adult dermal fibroblasts, mesangial cells, and epithelial cells from proximal tubules) by quantitative real-time PCR. Sources of these cells, cell culture conditions, and protocols used in these experiments are available in the supplementary information.

Genome-wide association scans for genes associated with diabetic nephropathy in type 1 diabetes.

The application of metrics for SNP and sample quality resulted in the analysis of 359,193 autosomal SNPs and 1,705 GoKinD samples of European ancestry (885 control subjects and 820 case subjects) (see research design and methods and the supplementary information). Clinical characteristics of the JDC and GWU panels are summarized in Table 1. Because different ascertainment protocols were used by the JDC and GWU, the resulting data were found to exhibit significant stratification. As a result, the primary association analyses were conducted using a stratified test of association.

TABLE 1

Baseline clinical characteristics of the GoKinD collection

JDC GoKinD control subjectsJDC GoKinD case subjectsGWU GoKinD control subjectsGWU GoKinD case subjects
n 472 441 413 379 
Men/women 220/252 233/208 143/270 190/189 
Age at diagnosis of type 1 diabetes (years) 12.9 ± 7.5 11.5 ± 6.7 12.9 ± 7.2 12.1 ± 6.6 
Duration of type 1 diabetes (years)* 26.4 ± 8.0 27.0 ± 7.8 24.2 ± 7.3 25.9 ± 7.6 
Age at examination (years) 39.4 ± 6.5 42.0 ± 7.1 37.1 ± 8.8 44.4 ± 6.5 
Laser treatment (%) 22 84 10 85 
A1C(%) 7.5 ± 1.7 8.5 ± 1.6 7.4 ± 1.1 7.8 ± 1.4 
ACR in control subjects (μg/mg) 7.0 ± 3.7 — 6.3 ± 3.6 — 
Case subjects with proteinuria/ESRD  235/206  49/330 
ACR in subjects with proteinuria (μg/mg)  1,571 ± 1,534  1,282 ± 1,096 
ESRD duration (years)  7.5 ± 5.4  7.2 ± 5.6 
Kidney transplant (%)  86  95 
JDC GoKinD control subjectsJDC GoKinD case subjectsGWU GoKinD control subjectsGWU GoKinD case subjects
n 472 441 413 379 
Men/women 220/252 233/208 143/270 190/189 
Age at diagnosis of type 1 diabetes (years) 12.9 ± 7.5 11.5 ± 6.7 12.9 ± 7.2 12.1 ± 6.6 
Duration of type 1 diabetes (years)* 26.4 ± 8.0 27.0 ± 7.8 24.2 ± 7.3 25.9 ± 7.6 
Age at examination (years) 39.4 ± 6.5 42.0 ± 7.1 37.1 ± 8.8 44.4 ± 6.5 
Laser treatment (%) 22 84 10 85 
A1C(%) 7.5 ± 1.7 8.5 ± 1.6 7.4 ± 1.1 7.8 ± 1.4 
ACR in control subjects (μg/mg) 7.0 ± 3.7 — 6.3 ± 3.6 — 
Case subjects with proteinuria/ESRD  235/206  49/330 
ACR in subjects with proteinuria (μg/mg)  1,571 ± 1,534  1,282 ± 1,096 
ESRD duration (years)  7.5 ± 5.4  7.2 ± 5.6 
Kidney transplant (%)  86  95 

Clinical characteristics are the mean values ± SD for all Caucasian patients (n = 1,705) included in the current analysis after the application of quality control metrics (see supplemental information regarding sample quality control analysis and population substructure and ancestry analysis).

*The duration of type 1 diabetes in control subjects and in subjects with proteinuria is based on the duration at examination. Among ESRD case subjects, this is based on the duration of type 1 diabetes at the onset of ESRD. All other clinical characteristics are based on measurements performed at examination.

†Mean A1C values do not include data from case subjects that have undergone pancreas transplantation (11% of JDC case subjects and 58% of GWU case subjects);

‡percentages are of ESRD group.

Although no SNP achieved genome-wide significance (0.05/359,193 = 1.4 × 10−7), the primary association analysis identified 11 SNPs representing four distinct chromosomal regions with P < 1 × 10−5 (Fig. 1 and Table 2), which were considered for replication. The strongest association with diabetic nephropathy occurred on chromosome 9q with rs10868025 (OR = 1.45, P = 5.0 × 10−7). This SNP is located near the 5′ end of the 4.1 protein ezrin, radixin, moesin (FERM) domain–containing 3 (FRMD3) gene.

FIG. 1.

Summary of genome-wide association scan results in the GoKinD collection. The –log10P values calculated using the Cochran-Mantel-Haenszel method (adjusting for sex and GoKinD subcollection [JDC/GWU]) across the entire genome are shown for the combined GoKinD collection. The horizontal dashed line corresponds to a –log10P value = 5.0 (P = 1 × 10−5). SNPs shown in green (n = 11) exceed this threshold (because of the resolution of this image, some of the SNPs located on chromosome 13 [n = 7] appear indistinguishable).

FIG. 1.

Summary of genome-wide association scan results in the GoKinD collection. The –log10P values calculated using the Cochran-Mantel-Haenszel method (adjusting for sex and GoKinD subcollection [JDC/GWU]) across the entire genome are shown for the combined GoKinD collection. The horizontal dashed line corresponds to a –log10P value = 5.0 (P = 1 × 10−5). SNPs shown in green (n = 11) exceed this threshold (because of the resolution of this image, some of the SNPs located on chromosome 13 [n = 7] appear indistinguishable).

TABLE 2

Summary of SNPs associated with diabetic nephropathy in the GoKinD collection

Locus
Risk allele frequencies and P values for control and case subjects by panel
P values and ORs for combined analysis
GWU GoKinD
JDC GoKinD
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele (non–risk allele)Control subjectsCase subjectsPControl subjectsCase subjectsPPOR (95% CI)
n — — — — 413 379 — 472 441 — — — 
rs39059 7p 29.2 CPVL/CHN2 A(G) 0.61 0.69 8.8 × 10−4 0.60 0.67 1.7 × 10−3 5.0 × 10−6 1.39 (1.20–1.61) 
rs39075 7p 29.2 CPVL/CHN2 G(A) 0.57 0.66 2.0 × 10−4 0.57 0.64 8.2 × 10−4 6.5 × 10−7 1.43 (1.23–1.64) 
rs1888747 9q 85.3 FRMD3 G(C) 0.68 0.73 3.6 × 10−3 0.66 0.74 4.4 × 10−5 6.3 × 10−7 1.45 (1.25–1.67) 
rs10868025 9q 85.4 FRMD3 A(G) 0.59 0.66 1.9 × 10−3 0.56 0.66 7.2 × 10−5 5.0 × 10−7 1.45 (1.25–1.67) 
rs739401 11p 3.0 CARS C(T) 0.46 0.54 4.7 × 10−4 0.49 0.55 3.6 × 10−3 6.4 × 10−6 1.36 (1.19–1.56) 
rs451041 11p 3.0 CARS A(G) 0.46 0.54 6.9 × 10−4 0.48 0.56 1.3 × 10−3 3.1 × 10−6 1.36 (1.19–1.56) 
rs1041466 13q 109.0 No gene G(A) 0.39 0.47 3.6 × 10−3 0.43 0.51 2.7 × 10−4 3.2 × 10−6 1.38 (1.20–1.58) 
rs1411766/rs17412858 13q 109.1 No gene A(G)/G(A) 0.31 0.39 8.5 × 10−4 0.32 0.40 6.4 × 10−4 1.8 × 10−6 1.41 (1.23–1.63) 
rs6492208/rs2391777§ 13q 109.1 No gene T(C)/G(A) 0.55 0.62 8.7 × 10−3 0.56 0.65 1.9 × 10−4 6.1 × 10−6 1.37 (1.20–1.59) 
rs7989848 13q 109.1 No gene A(G) 0.49 0.56 2.0 × 10−3 0.50 0.57 1.1 × 10−3 7.0 × 10−6 1.37 (1.19–1.56) 
rs9521445 13q 109.1 No gene A(C) 0.47 0.54 2.1 × 10−3 0.47 0.55 4.2 × 10−4 2.9 × 10−6 1.38 (1.20–1.58) 
Locus
Risk allele frequencies and P values for control and case subjects by panel
P values and ORs for combined analysis
GWU GoKinD
JDC GoKinD
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele (non–risk allele)Control subjectsCase subjectsPControl subjectsCase subjectsPPOR (95% CI)
n — — — — 413 379 — 472 441 — — — 
rs39059 7p 29.2 CPVL/CHN2 A(G) 0.61 0.69 8.8 × 10−4 0.60 0.67 1.7 × 10−3 5.0 × 10−6 1.39 (1.20–1.61) 
rs39075 7p 29.2 CPVL/CHN2 G(A) 0.57 0.66 2.0 × 10−4 0.57 0.64 8.2 × 10−4 6.5 × 10−7 1.43 (1.23–1.64) 
rs1888747 9q 85.3 FRMD3 G(C) 0.68 0.73 3.6 × 10−3 0.66 0.74 4.4 × 10−5 6.3 × 10−7 1.45 (1.25–1.67) 
rs10868025 9q 85.4 FRMD3 A(G) 0.59 0.66 1.9 × 10−3 0.56 0.66 7.2 × 10−5 5.0 × 10−7 1.45 (1.25–1.67) 
rs739401 11p 3.0 CARS C(T) 0.46 0.54 4.7 × 10−4 0.49 0.55 3.6 × 10−3 6.4 × 10−6 1.36 (1.19–1.56) 
rs451041 11p 3.0 CARS A(G) 0.46 0.54 6.9 × 10−4 0.48 0.56 1.3 × 10−3 3.1 × 10−6 1.36 (1.19–1.56) 
rs1041466 13q 109.0 No gene G(A) 0.39 0.47 3.6 × 10−3 0.43 0.51 2.7 × 10−4 3.2 × 10−6 1.38 (1.20–1.58) 
rs1411766/rs17412858 13q 109.1 No gene A(G)/G(A) 0.31 0.39 8.5 × 10−4 0.32 0.40 6.4 × 10−4 1.8 × 10−6 1.41 (1.23–1.63) 
rs6492208/rs2391777§ 13q 109.1 No gene T(C)/G(A) 0.55 0.62 8.7 × 10−3 0.56 0.65 1.9 × 10−4 6.1 × 10−6 1.37 (1.20–1.59) 
rs7989848 13q 109.1 No gene A(G) 0.49 0.56 2.0 × 10−3 0.50 0.57 1.1 × 10−3 7.0 × 10−6 1.37 (1.19–1.56) 
rs9521445 13q 109.1 No gene A(C) 0.47 0.54 2.1 × 10−3 0.47 0.55 4.2 × 10−4 2.9 × 10−6 1.38 (1.20–1.58) 

The most strongly associated SNPs from the combined analysis of the GWU and JDC GoKinD panels are presented along with the risk allele frequencies and P values (calculated using the Cochran-Mantel-Haenszel method, adjusting for sex, between case and control subjects within each collection) for each separate collection. Combined P values and ORs were calculated using the Cochran-Mantel-Haenszel method. Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1. A summary of the genotype frequencies for the most strongly associated SNPs in the GoKinD collection are presented in supplementary Table 3.

†rs39075 and rs1888747 were identified through imputation and genotyped using Taqman assays in the GoKinD collection;

‡rs1411766 and rs17412858 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0);

§rs6492208 and rs2391777 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0).

Three additional genomic regions located on chromosomes 7p, 11p, and 13q were also associated with diabetic nephropathy. The rs39059 SNP (OR = 1.39, P = 5.0 × 10−6) localizes to the first intron of CHN2 (β-chimerin) isoform 2 and upstream of an alternatively spliced CPVL (serine carboxypeptidase vitellogenic-like) transcript on chromosome 7p. The rs451041 SNP (OR = 1.36, P = 3.1 × 10−6) is located on chromosome 11p in an intronic region of the CARS (cysteinyl-tRNA synthetase) gene. And, finally, the region bounded by rs1411766/rs1742858 (OR = 1.41, P = 1.8 × 10−6) is located in a 42 kb intergenic region on chromosome 13q.

Analyses of the imputed SNPs in our lead loci identified 11 additional SNPs that were highly correlated withthe original associations (P < 1 × 10−5). Of these, two were more strongly associated with diabetic nephropathy than our lead genotyped SNPs. Imputed SNP rs1888747 (chromosome 9q), which is in partial linkage disequilibrium (r2 = 0.81) with rs10868025, was more strongly associated with diabetic nephropathy than the original SNP (P = 4.7 × 10−7) (Fig. 2,B). Similarly, two imputed SNPs in the 7p region (rs39075 and rs39076) were also more strongly associated than the original SNP in that region (rs39059) (Fig. 2,A). Both imputed SNPs were genotyped in the GoKinD samples, and the associations with the imputed data were confirmed (rs39075, P = 6.5 × 10−7; and rs1888747, P = 6.3 × 10−7) (Table 2).

FIG. 2.

Summary of genome-wide association results for the chromosome 7p, 9q, 11p, and 13q loci. A: Genome-wide association scan and imputed data for the chromosome 7p locus. rs39059 (solid red triangle) is located at position IVS1 + 21350 relative to exon 1 of CHN2 isoform 2 and is in tight linkage disequilibrium with rs39075 (r2 = 0.96), located at position IVS1 + 42572. rs39059 and rs39075 reside −69,318 and −90,540 kb, respectively, upstream of CPVL isoforms 1 and 2. A third alternate transcript (isoform 3) is predicted for CPVL and contains an exon that extends to intron 1 of CHN2. rs39059 and rs39075 are located at positions −20579 and −41801, respectively, relative to this transcript. ▲, SNPs genotyped on the Affymetrix array (n = 163); △, imputed SNPs (n = 694). *Imputed SNP rs39075 was genotyped in the GoKinD samples to confirm the imputation. B: Genome-wide association scan and imputed data for the chromosome 9q locus. A total of 100 genotyped SNPs from the Affymetrix array data and 450 imputed SNPs are shown. rs10868025 (solid red triangle) is located at position −10829 relative to FRMD3's transcription start site. rs10868025 is in complete linkage disequilibrium (r2 = 1.0) and only 253 bp from imputed SNP rs13289150 (△ superimposed on rs10868025). rs1888747, located at position −2204, is in partial linkage disequilibrium (r2 = 0.81) with rs10868025. *Imputed SNP rs1888747 was genotyped in the GoKinD samples to confirm the imputation. C: Genome-wide association scan and imputed data for the chromosome 11p locus. A total of 33 genotyped SNPs from the Affymetrix array data and 190 imputed SNPs are shown. rs739401 and rs451041 (solid red triangles) are in strong linkage disequilibrium (r2 = 0.97). rs739401 is located in intron 16 (isoforms a and c)/17 (isoforms b and d) of the CARS gene (position IVS16 + 687/IVS17 + 687). rs451041 is located in intron 4 (isoforms a and c)/5 (isoforms b and d), position IVS4 − 203/IVS5 − 203). D: Genome-wide association scan and imputed data for the chromosome 13q locus. A total of 68 genotyped SNPs from the Affymetrix array data and 268 imputed SNPs are shown. Seven lead SNPs (rs1041466, rs1411766, rs17412858, rs6492208, rs2391777, rs7989848, and rs9521445) from this region are indicated in red. rs1411766 and rs17412858 are in complete linkage disequilibrium (r2 = 1.0). Similarly, rs6492208 and rs2391777 are in complete linkage disequilibrium (r2 = 1.0). rs7989848 and rs9521445 are in strong linkage disequilibrium (r2 = 0.87), whereas only modest linkage disequilibrium exists between all other SNP pairs (r2 = 0.30–0.65). The two nearest genes are MYO16 and IRS2, located ∼384 kb centromeric and 120 kb telomeric of this region, respectively.

FIG. 2.

Summary of genome-wide association results for the chromosome 7p, 9q, 11p, and 13q loci. A: Genome-wide association scan and imputed data for the chromosome 7p locus. rs39059 (solid red triangle) is located at position IVS1 + 21350 relative to exon 1 of CHN2 isoform 2 and is in tight linkage disequilibrium with rs39075 (r2 = 0.96), located at position IVS1 + 42572. rs39059 and rs39075 reside −69,318 and −90,540 kb, respectively, upstream of CPVL isoforms 1 and 2. A third alternate transcript (isoform 3) is predicted for CPVL and contains an exon that extends to intron 1 of CHN2. rs39059 and rs39075 are located at positions −20579 and −41801, respectively, relative to this transcript. ▲, SNPs genotyped on the Affymetrix array (n = 163); △, imputed SNPs (n = 694). *Imputed SNP rs39075 was genotyped in the GoKinD samples to confirm the imputation. B: Genome-wide association scan and imputed data for the chromosome 9q locus. A total of 100 genotyped SNPs from the Affymetrix array data and 450 imputed SNPs are shown. rs10868025 (solid red triangle) is located at position −10829 relative to FRMD3's transcription start site. rs10868025 is in complete linkage disequilibrium (r2 = 1.0) and only 253 bp from imputed SNP rs13289150 (△ superimposed on rs10868025). rs1888747, located at position −2204, is in partial linkage disequilibrium (r2 = 0.81) with rs10868025. *Imputed SNP rs1888747 was genotyped in the GoKinD samples to confirm the imputation. C: Genome-wide association scan and imputed data for the chromosome 11p locus. A total of 33 genotyped SNPs from the Affymetrix array data and 190 imputed SNPs are shown. rs739401 and rs451041 (solid red triangles) are in strong linkage disequilibrium (r2 = 0.97). rs739401 is located in intron 16 (isoforms a and c)/17 (isoforms b and d) of the CARS gene (position IVS16 + 687/IVS17 + 687). rs451041 is located in intron 4 (isoforms a and c)/5 (isoforms b and d), position IVS4 − 203/IVS5 − 203). D: Genome-wide association scan and imputed data for the chromosome 13q locus. A total of 68 genotyped SNPs from the Affymetrix array data and 268 imputed SNPs are shown. Seven lead SNPs (rs1041466, rs1411766, rs17412858, rs6492208, rs2391777, rs7989848, and rs9521445) from this region are indicated in red. rs1411766 and rs17412858 are in complete linkage disequilibrium (r2 = 1.0). Similarly, rs6492208 and rs2391777 are in complete linkage disequilibrium (r2 = 1.0). rs7989848 and rs9521445 are in strong linkage disequilibrium (r2 = 0.87), whereas only modest linkage disequilibrium exists between all other SNP pairs (r2 = 0.30–0.65). The two nearest genes are MYO16 and IRS2, located ∼384 kb centromeric and 120 kb telomeric of this region, respectively.

If the etiology of diabetic nephropathy involves the interaction of a locus with the cumulative effect of hyperglycemia, the association of the locus with diabetic nephropathy can vary according to diabetes duration at diabetic nephropathy onset, such that it is strongest in early-appearing case subjects and diminishes in later ones, even reversing in direction in very late-appearing case subjects (23). We examined the SNPs in Table 2 according to diabetes duration by stratifying case and control subjects across tertiles of diabetes duration (at the onset of ESRD or at enrollment into GoKinD for proteinuria patients and control subjects). The strength of the associations was consistent across these strata (data not shown).

Additionally, if a locus influences mortality risk, the high mortality experienced by patients with ESRD would alter its association with diabetic nephropathy according to the duration of survival with ESRD and may mask the effect of a diabetic nephropathy risk allele or produce a false association. For this reason, we also analyzed the lead SNPs in Table 2 according to duration of ESRD. For each of these SNPs, the ORs were consistent across tertiles of ESRD duration (supplementary Table 4), a pattern consistent with the absence of survival bias. However, the current study is underpowered to formally exclude the presence of such effects.

Confirmation of associated type 1 diabetic nephropathy SNPs in the DCCT/EDIC study.

Data from a genome-wide association scan of the DCCT/EDIC study were used to assess whether genome regions identified in the GoKinD collection were associated with advanced diabetic nephropathy in an independent collection. Among the 11 SNPs identified in GoKinD, eight were included on the Illumina array used in the DCCT/EDIC study (Table 3). The three SNPs not included on this platform, rs39059, rs739401, and rs9521445, were in strong linkage disequilibrium (r2 ≥ 0.87) with rs39075, rs451041, and rs7989848, respectively. Analysis of time to onset of severe nephropathy confirmed the significant associations with diabetic nephropathy in GoKinD for rs1888746 (FRMD3, P = 0.02), rs13289150 (FRMD3, P = 0.05), and rs451041 (CARS, P = 0.01).

TABLE 3

HRs for the development of severe nephropathy during 16–22 years of follow-up in the DCCT/EDIC study for SNPs associated with diabetic nephropathy in GoKinD

Locus
P values and HRs in DCCT/EDIC collection severe nephropathy
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele*Frequency of risk alleleP (one-sided)HR
rs39075 7p 29.2 CPVL/CHN2 0.60 NS 0.85 
rs1888746 9q 85.3 FRMD3 0.70 0.02 1.33 
rs13289150§ 9q 85.4 FRMD3 0.62 0.05 1.23 
rs451041 11p 3.0 CARS 0.51 0.01 1.32 
rs1041466 13q 109.0 No gene 0.47 0.11 1.22 
rs1411766 13q 109.1 No gene 0.36 0.11 1.17 
rs6492208 13q 109.1 No gene 0.61 NS 0.90 
rs7989848 13q 109.1 No gene 0.53 NS 0.93 
Locus
P values and HRs in DCCT/EDIC collection severe nephropathy
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele*Frequency of risk alleleP (one-sided)HR
rs39075 7p 29.2 CPVL/CHN2 0.60 NS 0.85 
rs1888746 9q 85.3 FRMD3 0.70 0.02 1.33 
rs13289150§ 9q 85.4 FRMD3 0.62 0.05 1.23 
rs451041 11p 3.0 CARS 0.51 0.01 1.32 
rs1041466 13q 109.0 No gene 0.47 0.11 1.22 
rs1411766 13q 109.1 No gene 0.36 0.11 1.17 
rs6492208 13q 109.1 No gene 0.61 NS 0.90 
rs7989848 13q 109.1 No gene 0.53 NS 0.93 

Data are from multivariate Cox proportional hazard analysis of time to onset of severe nephropathy. As of 2005, the number of severe nephropathy cases was 132 (vs. 1,172 censored). Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1.

*The risk alleles that are presented are in reference to those identified in the GoKinD collection;

†one-sided P values (consistent with the current “best practices” for replication in GWA scans) (20,22) are used to test for the same direction of effect as in the GoKinD collection;

‡rs1888746 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs1888747 (genotyped using a Taqman assay in GoKinD);

§rs13289150 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs10868025 (genotyped on an Affymetrix array in GoKinD).

Analysis of candidate diabetic nephropathy geneexpression.

Previous studies, as well as publicly available gene expression data (www.ncbi.nlm.nih.gov/geo), have shown that genes closest to the lead SNPs identified in GoKinD are expressed in a variety of human tissues, including kidney (24,26). To further test whether these candidate genes may be involved in the development of diabetic nephropathy, we examined their expression in cell lines relevant to this disease. The expression of CHN2, CPVL, FRMD3, and CARS was examined in four primary human cell lines: iliac artery endothelial cells, adult dermal fibroblasts, mesangial cells, and renal proximal tubule cells. Our data show that CARS expression was high in all four of the cell lines that we examined (Table 4). FRMD3 expression was also detected in each cell type, with its highest expression being observed in renal proximal tubule cells. Of the two candidate diabetic nephropathy genes located in chromosome 7p region, neither was detected in mesangial cells, whereas CPVL expression was greatest in proximal tubule cells.

TABLE 4

Relative gene expression of novel candidate diabetic nephropathy susceptibility genes in primary human cell lines

Cell lineCPVL Ct valuesCHN2 Ct valuesFRMD3 Ct valuesCARS Ct values
Endothelial cells, iliac 17.3 ± 0.7 12.5 ± 0.1 12.5 ± 0.3 8.4 ± 0.4 
Fibroblasts 15.7 ± 0.1 n.e. 11.9 ± 0.7 6.5 ± 0.4 
Mesangial cells n.e. n.e. 12.9 ± 0.5 7.2 ± 0.1 
Proximal tubules 8.6 ± 0.1 13.1 ± 0.1 10.2 ± 0.2 8.0 ± 0.1 
Cell lineCPVL Ct valuesCHN2 Ct valuesFRMD3 Ct valuesCARS Ct values
Endothelial cells, iliac 17.3 ± 0.7 12.5 ± 0.1 12.5 ± 0.3 8.4 ± 0.4 
Fibroblasts 15.7 ± 0.1 n.e. 11.9 ± 0.7 6.5 ± 0.4 
Mesangial cells n.e. n.e. 12.9 ± 0.5 7.2 ± 0.1 
Proximal tubules 8.6 ± 0.1 13.1 ± 0.1 10.2 ± 0.2 8.0 ± 0.1 

The relative gene expression of CPVL, CHN2, FRMD3, and CARS was determined in four primary human cell lines using real-time PCR. Relative gene expression of each gene was calculated in reference to a normalization control (β-actin) and is presented as the mean ΔCt (Ctvalue from each gene minus Ct value from the normalization control) and SD from three separate experiments. Mean ΔCt = 0 equals high relative gene expression (i.e., expression similar to β-actin). n.e., not expressed (mean ΔCt >40.0).

In this report, we describe the results of a genome-wide association scan in the GoKinD collection to identify loci associated with risk of diabetic nephropathy in type 1 diabetes. The most significant associations were identified with variants located within four distinct chromosomal regions. Although the biology underlying these associations remains to be elucidated, they implicate CHN2/CPVL, FRMD3, CARS, and an intergenic region on chromosome 13q as novel genes/genetic regions involved in the pathogenesis of diabetic nephropathy. None of these loci overlap with previously reported associations between candidate genes and the development of any stage of diabetic nephropathy (10,11). Importantly, replication in a Cox proportional hazard analysis of the associations at the FRMD3 and CARS loci with time to the onset of severe nephropathy in the DCCT/EDIC study bolsters the significance of these two findings; that two studies having such different designs (one a case-control study and the other a prospective cohort study) yielded similar ORs strengthens confidence in this conclusion.

FRMD3 encodes the 4.1O protein, a structural protein with unknown function and a member of the 4.1 family of proteins (26). Members of the 4.1 protein family have well-characterized roles as cytoskeletal proteins, maintaining both cellular shape and form, in a variety of cell types, including mouse nephron (27,28). Although membership of the 4.1O protein in this family has recently been questioned, it does contain a FERM domain, which is a module that is integral in maintaining cell integrity through its interactions with transmembrane proteins and actin filaments (29,30). FRMD3 is detectable in adult ovaries as well as in fetal skeletal muscle, brain, and thymus (26). Our data extend the expression profile of FRMD3 to specifically include mesangial and proximal tubular cells. Interestingly, among 18 genes that contain FERM domains, including several members of the 4.1 protein family, we identified nominally significant associations with diabetic nephropathy for SNPs located in eight of these genes (supplementary Table 5), including FARP2 (FERM, RhoGEF and pleckstrin domain protein 2; P = 3.0 × 10−4) and EPB41L2 (erythrocyte membrane protein band 4.1-like 2; P = 2.3 × 10−4). Although these findings require further study, including replication in additional collections, it is interesting to speculate that these data may point to the involvement of new, previously unsuspected pathways in the pathogenesis of diabetic nephropathy.

The CARS gene encodes cysteinyl-tRNA synthetase, one of several aminoacyl-tRNA synthetases (ARSs) that have been identified in humans (31,32). ARSs are important regulators of intracellular amino acid concentrations and protein biosynthesis in both the cytoplasm and mitochondria (a process facilitated by specialized mitochondria-specific and bifunctional ARSs). In the initial steps of protein translation, the function of these enzymes is to attach amino acids to their cognate tRNA molecules. To date, both autosomal dominant and recessive mutations in ARS-encoding genes have been identified only in neurodegenerative disease, including missense changes in glycyl-tRNA synthetase (GARS) and both missense mutations and in-frame deletions in tyrosyl-tRNA synthetase (YARS) in Charcot-Marie-Tooth disease (32).

CARS has been implicated in cystinosis, an autosomal recessive renal tubule disorder caused by the accumulation of free cystine in cellular lysosomes (33,34). A recent study identified defects in lysosomal cystine transport as the primary cause of the disease (35). However, ESRD is prominent in this disorder, and such an outcome may be due to vulnerability of specific renal cells to damage by excess cystine. Interesting, in this light, is the observation that of all the associated SNPs, only those in the CARS locus were associated primarily with ESRD (supplementary Table 4). CARS is expressed in mesangial and proximal tubule cells. Further work is needed to characterize the role of CARS in the pathway that is involved in the development of ESRD in diabetes. Similar to the set of genes containing FERM domains, analysis of 21 ARS genes identified nominally significant associations with diabetic nephropathy for SNPs located in four members of this class of genes (supplementary Table 6), with the most significant association (P = 9.1 × 10−3) occurring at the TARS (threonyl-tRNA synthetase) locus.

Two additional loci were strongly associated with diabetic nephropathy in both panels of the GoKinD collection. Of the two genes located on chromosome 7p, CPVL, a carboxypeptidase that is highly expressed in the kidney and, more specifically, in proximal tubules, is a particularly interesting candidate gene. Other carboxypeptidases, such as ACE and bradykinin, are important regulators of renal hemodynamics and have previously been implicated in the pathogenesis of diabetic nephropathy (36,37). The last diabetic nephropathy–associated locus involves multiple SNPs within a 33 kb haplotype block on chromosome 13q. Previously, genomic deletions of this locus have been linked to congenital renal abnormalities (38). The two genes closest to the associated SNPs, MYO16 (myosin heavy-chain Myr 8) and IRS2 (insulin receptor substrate 2), are located ∼384 kb centromeric and 120 kb telomeric of this region, respectively. Although there is little linkage disequilibrium between the variants within this block and those in the vicinity of either MYO16 or IRS2, the multiple signals identified in this region give credence to the association detected in our analysis. Additional experiments are needed to characterize the nature of these associations further.

The findings presented in our study contribute to understanding the genetic susceptibility of diabetic nephropathy in type 1 diabetes. As has been reported for other complex genetic disorders, no single major gene that contributes to an increased risk of disease emerged (20,39). However, given the incomplete coverage of the genome by the genotyping platform and the suboptimal study design (prevalent rather than incident cases of ESRD), detection of any existing major gene effect was not guaranteed. For example, because most of the case subjects with ESRD had survived many years on dialysis or with a kidney transplant, a disease allele that not only increased susceptibility to diabetic nephropathy but also increased mortality in patients with ESRD could go undetected. Appreciably, the SNPs that we identified in the GoKinD collection were mortality neutral (supplementary Table 4). The optimal study design for detecting all disease loci, regardless of their effect on mortality, would be a large cohort of incident ESRD case subjects. Such a data set is currently unavailable.

There are other limitations to this study as well. The GoKinD collection is heavily weighted with case subjects with ESRD; thus, the small number of case subjects with proteinuria limited our ability to detect variants primarily associated with the risk of proteinuria. Second, because of the limited power of the DCCT/EDIC study and the need to contain inflation of the α-error in seeking replication for multiple SNPs in this dataset, our replication efforts refrained from considering SNPs less significant than P = 1 × 10−5. It is certainly possible that additional variants among those not meeting this threshold may truly be associated with diabetic nephropathy; however, given these limitations, these variants remain to be identified. Similarly, despite replication in the DCCT/EDIC cohort, we acknowledge that positive associations at both the FRMD3 and CARS loci require additional study to be certain of these findings. Third, although the locations of the variants confirmed in this study implicate both FRMD3 and CARS as novel genes involved in the pathogenesis of diabetic nephropathy, the underlying mechanisms of disease of these associations need to be elucidated. And, finally, although confirmation in DCCT/EDIC has been achieved for variations near FRMD3 and CARS, additional cohorts, particularly non-Caucasian, would be useful to further characterize the pathogenic role of these, and other, candidate genes identified in the GoKinD collection.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

This work was supported by the following grants from the National Institutes of Health (NIH): DK77532 (to A.S.K.), DK36836 (to the Genetics Core of the Diabetes and Endocrinology Research Center at the JDC), DK-62204 (to A.D.P.), and DK-077510 (to A.D.P.) and from the Foundation for NIH (FNIH): 06GAIN0 (to J.H.W.). We acknowledge JDC NIH Training Grant T32 DK007260-31 (to M.G.P.) and a CIHR (Canadian Institutes of Health Research) scholarship (to L.M.) and senior investigator award (to S.B.B.). We also acknowledge support from the Canadian Network of Centres of Excellence in Mathematics (to S.B.B.). The GoKinD study was conducted by the GoKinD Investigators and supported by the Juvenile Diabetes Research Foundation (JDRF) and by funding from the Centers for Disease Control (CDC; PL 105-33, 106-554, and 107-360 administered by the National Institute of Diabetes and Digestive and Kidney Diseases [NIDDK]). The GoKinD collection of DNA was genotyped through the GAIN program, with the support of the FNIH and NIDDK. The GAIN database was accessed through the NCBI (National Center for Biotechnology Information). The DCCT/EDIC study is supported by contracts with the Division of Diabetes, Endocrinology, and Metabolic Diseases of the NIDDK and the General Clinical Research Centers Program, National Center for Research Resources. The content and conclusions presented in this manuscript do not necessarily reflect the opinions or views of the JDRF, CDC, FNIH, NIDDK, or NCBI.

No potential conflicts of interest relevant to this article were reported.

1.
Jones
CA
,
Krolewski
AS
,
Rogus
J
, et al
:
Epidemic of end-stage renal disease in people with diabetes in the United States population: do we know the cause?
Kidney Int
2005
; 
67
:
1684
1691
2.
Perkins
BA
,
Ficociello
LH
,
Silva
KH
, et al
:
Regression of microalbuminuria in type 1 diabetes
.
N Engl J Med
2003
; 
348
:
2285
2293
3.
Parving
HH
,
Mauer
M
,
Ritz
E
:
Diabetic nephropathy
. In
Brenner and Rector's The Kidney
. 7th ed.
Brenner
BM
Ed.
Philadelphia, PA
,
Elsevier
,
2004
, p.
1777
1818
4.
Perkins
BA
,
Ficociello
LH
,
Ostrander
BE
, et al
:
Microalbuminuria and the risk for early progressive renal function decline in type 1 diabetes
.
J Am Soc Nephrol
2007
; 
18
:
1353
1361
5.
Krolewski
M
,
Eggers
PW
,
Warram
JH
:
Magnitude of end-stage renal disease in IDDM: a 35 year follow-up study
.
Kidney Int
1996
; 
50
:
2041
2046
6.
Nishimura
R
,
Dorman
JS
,
Bosnyak
Z
, et al
:
Incidence of ESRD and survival after renal replacement therapy in patients with type 1 diabetes: a report from the Allegheny County registry
.
Am J Kidney Dis
2003
; 
42
:
117
124
7.
Seaquist
ER
,
Goetz
FC
,
Rich
S
, et al
:
Familial clustering of diabetic kidney disease: evidence for genetic susceptibility to diabetic nephropathy
.
N Engl J Med
1989
; 
320
:
1161
1165
8.
Quinn
M
,
Angelico
MC
,
Warram
JH
, et al
:
Familial factors determine the development of diabetic nephropathy in patients with IDDM
.
Diabetologia
1996
; 
39
:
940
945
9.
Diabetes Control and Complications Trial Research Group
.
Clustering of long-term complications in families with diabetes in the Diabetes Control and Complications Trial
.
Diabetes
1997
; 
46
:
1829
1839
10.
Ng
DP
,
Krolewski
AS
:
Molecular genetic approaches for studying the etiology of diabetic nephropathy
.
Curr Mol Med
2005
; 
5
:
509
525
11.
Freedman
BI
,
Bostrom
M
,
Daeihagh
P
, et al
:
Genetic factors in diabetic nephropathy
.
Clin J Am Soc Nephrol
2007
; 
2
:
1306
1316
12.
Placha
G
,
Canani
LH
,
Warram
JH
, et al
:
Evidence for different susceptibility genes for proteinuria and ESRD in type 2 diabetes
.
Adv Chronic Kidney Dis
2005
; 
12
:
155
169
13.
Al-Kateb
H
,
Boright
AP
,
Mirea
L
, et al
:
Multiple superoxide dismutase 1/splicing factor serine alanine 15 variants are associated with the development and progression of diabetic nephropathy: the Diabetes Control and Complications Trial/Epidemiology of Diabetes Interventions and Complications Genetics study
.
Diabetes
2008
; 
57
:
218
228
14.
Mueller
PW
,
Rogus
JJ
,
Cleary
PA
, et al
:
Genetics of Kidneys in Diabetes (GoKinD) study: a genetics collection available for identifying genetic susceptibility factors for diabetic nephropathy in type 1 diabetes
.
J Am Soc Nephrol
2006
; 
17
:
1782
1790
15.
GAIN Collaborative Research Group
.
New models of collaboration in genome-wide association studies: the Genetic Association Information Network
.
Nat Genet
2007
; 
39
:
1045
1051
16.
Wanic
K
,
Placha
G
,
Dunn
J
, et al
:
Exclusion of polymorphisms in carnosinase genes (CNDP1 and CNDP2) as a cause of diabetic nephropathy in type 1 diabetes: results of large case-control and follow-up studies
.
Diabetes
2008
; 
57
:
2547
2551
17.
Diabetes Control and Complications Trial Research Group
.
The effect of intensive treatment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus
.
N Engl J Med
1993
; 
329
:
977
986
18.
Epidemiology of Diabetes Interventions and Complications Research Group
.
Epidemiology of Diabetes Interventions and Complications (EDIC): design, implementation, and preliminary results of a long-term follow-up of the Diabetes Control and Complications Trial cohort
.
Diabetes Care
1999
; 
22
:
99
111
19.
Purcell
S
,
Neale
B
,
Todd-Brown
K
, et al
:
PLINK: a tool set for whole-genome association and population-based linkage analyses
.
Am J Hum Genet
2007
; 
81
:
559
575
20.
Willer
CJ
,
Sanna
S
,
Jackson
AU
, et al
:
Newly identified loci that influence lipid concentrations and risk of coronary artery disease
.
Nat Genet
2008
; 
40
:
161
169
21.
Lasky-Su
J
,
Lyon
HN
,
Emilsson
V
, et al
:
On the replication of genetic associations: timing can be everything!
Am J Hum Genet
2008
; 
82
:
849
858
22.
Weedon
MN
,
Lango
H
,
Lindgren
CM
, et al
:
Genome-wide association analysis identifies 20 loci that influence adult height
.
Nat Genet
2008
; 
40
:
575
583
23.
Rogus
JJ
,
Warram
JH
,
Krolewski
AS
:
Genetic studies of late diabetic complications: the overlooked importance of diabetes duration before complication onset
.
Diabetes
2002
; 
51
:
1655
1662
24.
Yuan
S
,
Miller
DW
,
Barnett
GH
, et al
:
Identification and characterization of human beta 2-chimaerin: association with malignant transformation in astrocytoma
.
Cancer Res
1995
; 
55
:
3456
3461
25.
Mahoney
JA
,
Ntolosi
B
,
DaSilva
RP
, et al
:
Cloning and characterization of CPVL, a novel serine carboxypeptidase, from human macrophages
.
Genomics
2001
; 
72
:
243
251
26.
Ni
X
,
Ji
C
,
Cao
G
, et al
:
Molecular cloning and characterization of the protein 4.1O gene, a novel member of the protein 4.1 family with focal expression in ovary
.
J Hum Genet
2003
; 
48
:
101
106
27.
Hoover
KB
,
Bryant
PJ
:
The genetics of the protein 4.1 family: organizers of the membrane and cytoskeleton
.
Curr Opin Cell Biol
2000
; 
12
:
229
234
28.
Ramez
M
,
Blot-Chabaud
M
,
Cluzeaud
F
, et al
:
Distinct distribution of specific members of protein 4.1 gene family in the mouse nephron
.
Kidney Int
2003
; 
63
:
1321
1337
29.
Chishti
AH
,
Kim
AC
,
Marfatia
SM
, et al
:
The FERM domain: a unique module involved in the linkage of cytoplasmic proteins to the membrane
.
Trends Biochem Sci
1998
; 
23
:
281
282
30.
Baines
AJ
:
A FERM-adjacent (FA) region defines a subset of the 4.1 superfamily and is a potential regulator of FERM domain function
.
BMC Genomics
2006
; 
7
:
85
31.
Sheppard
K
,
Yuan
J
,
Hohn
MJ
, et al
:
From one amino acid to another: tRNA-dependent amino acid biosynthesis
.
Nucleic Acid Res
2008
; 
36
:
1813
1825
32.
Antonellis
A
,
Green
ED
:
The role of aminoacyl-tRNA synthetases in genetic diseases
.
Annu Rev Genomics Hum Genet
2008
; 
9
:
87
107
33.
Waterson
JR
,
Winter
WP
,
Schmickel
RD
:
Cysteine activation in cultured cystinotic cells: the specific activity of cysteinyl-tRNA synthetase and tRNACys and the determination of the Michaelis-Menten constants for cysteinyl-tRNA synthetase
.
J Clin Invest
1974
; 
54
:
182
187
34.
Gahl
WA
,
Thoene
JG
,
Schneider
JA
:
Cystinosis
.
N Engl J Med
2002
; 
347
:
111
121
35.
Town
M
,
Jean
G
,
Cherqui
S
, et al
:
A novel gene encoding an integral membrane protein is mutated in nephropathic cystinosis
.
Nat Genet
1998
; 
18
:
319
324
36.
Adler
S
:
Diabetic nephropathy: linking histology, cell biology, and genetics
.
Kidney Int
2004
; 
66
:
2095
2106
37.
Riad
A
,
Zhuo
JL
,
Schultheiss
HP
, et al
:
The role of the renal kallikrein-kinin system in diabetic nephropathy
.
Curr Opin Nephrol Hypertens
2007
; 
16
:
22
26
38.
Vats
KR
,
Ishwad
C
,
Singla
I
, et al
:
A locus for renal malformations including vesico-ureteric reflux on chromosome 13q33-34
.
J Am Soc Nephrol
2006
; 
17
:
1158
1167
39.
Scott
LJ
,
Mohlke
KL
,
Bonnycastle
LL
, et al
:
A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants
.
Science
2007
; 
316
:
1341
1345

Supplementary data