Administrative data are used with increasing frequency in research. However, validity of such data, including International Classification of Diseases, 9th revision, Clinical Modification (ICD-9-CM) codes (1), varies across diseases and settings (212). The ICD-9-CM coding for diabetes in youth may be especially susceptible to errors. While most diagnoses of diabetes in American youth are type 1 diabetes (13), incidence of type 2 diabetes is increasing (14). Rising prevalence of pediatric obesity (15) makes distinguishing between type 1 and type 2 diabetes at diagnosis difficult, and type 2 diabetes ICD-9-CM codes (250.X0/X2) include “unspecified” diabetes (1). Our aim was to evaluate the positive predictive value (PPV) of type 2 diabetes ICD-9-CM codes in children, adolescents, and young adults.

In a retrospective chart review, we evaluated 432 patients aged <26 years as of 31 January 2005 with at least one visit to the Endocrine/Diabetes or Obesity Programs at Children’s Hospital Boston in Boston, Massachusetts, from 1 July 2003 to 31 January 2005 and at least one type 2 diabetes ICD-9-CM code (250.X0/X2, X = 0–9) from inpatient/outpatient sites before 5 April 2005. We identified 455 patients utilizing scheduling and billing information, and excluded 23 patients without completed visits.

To contrast the accuracy of type 2 diabetes ICD-9-CM codes with type 1 diabetes ICD-9-CM codes, we reviewed charts of patients <26 years as of 31 January 2005 with at least one visit to the Endocrine/Diabetes Program from 1 July 2003 to 31 January 2005 with a type 1 diabetes ICD-9-CM code (250.X1/X3, X = 0–9) at that visit. We randomly sampled 100 of 932 patients identified utilizing scheduling and billing information and excluded 1 patient without completed visits. Children’s Hospital Boston Institutional Review Board approved the study.

Chart review

A research assistant reviewed up to three records from Endocrine/Diabetes or Obesity Programs in reverse chronological order from 31 January 2005 using an algorithm. The algorithm assigned one diagnosis in the following order of priority based on provider-documented diagnoses in the records: any type of diabetes, impaired glucose tolerance, hyperglycemia, insulin resistance, hyperinsulinism/hyperinsulinemia, obesity, or diabetes insipidus. If more than one type of diabetes or if none of these diagnoses were documented, diagnosis was deferred to reviewers who were pediatric endocrinologists blinded to the study aim. After review, patients without these diagnoses were categorized as “other.”

Statistical analysis

Results are presented as proportions with PPV defined as the proportion with a type 2 diabetes ICD-9-CM code that had a clinical diagnosis of type 2 diabetes: [PPV = true positives/(true positives + false positives)]. Proportions were compared with χ2 test (SAS version 9.0).

Among 432 patients with a type 2 diabetes ICD-9-CM code, the average age as of 31 January 2005 was 15.5 ± 4.8 years (range 1.8–25.9). Average time between first and last encounter reviewed for patients with more than one visit was 0.78 ± 0.8 years (range 0.01–6.66). Diagnoses were assigned to 283 participants (66%) by algorithm and 149 (34%) by reviewers. Results are summarized in Table 1. Sixty-nine patients had type 2 diabetes (PPV 16.0%), and most others had type 1 diabetes. PPV was higher for the type 2 diabetes ICD-9-CM codes originating from Endocrine/Diabetes or Obesity Programs (19.9 vs. 7.7%, χ2 test, P = 0.001), and patients with ICD-9-CM codes originating from other hospital sites more often had cystic fibrosis–related diabetes, steroid-induced diabetes, or “other” diagnoses.

In contrast to type 2 diabetes codes, PPV for type 1 diabetes ICD-9-CM codes was higher. Among 99 patients assigned a type 1 diabetes ICD-9-CM code, 96 had type 1 diabetes (PPV 97.0%).

Administrative data provide useful information for researchers. However, disease and coding methods threaten validity. In a large children’s hospital, PPV of type 2 diabetes ICD-9-CM codes was low, whereas type 1 diabetes codes were highly accurate.

Several issues may explain these findings. First, type 2 diabetes ICD-9-CM codes include “unspecified” diabetes. Given the rising prevalence of pediatric obesity (15), differentiating type 1 and type 2 diabetes at diagnosis may be difficult. Patients with phenotypic characteristics of type 2 diabetes may have pancreatic autoimmunity (16), and African Americans may present with nonautoimmune (idiopathic) type 1b diabetes (17). These patients might be classified as unspecified with type 2 diabetes ICD-9-CM codes until the diagnosis is clarified. Second, type 2 diabetes codes may be utilized instead of more accurate but less familiar codes for other forms of diabetes, such as steroid-induced diabetes (codes 251.8 and E932.0). Third, many patients in our study without diabetes had type 2 diabetes risk factors, such as insulin resistance, suggesting inaccurate assignment of type 2 diabetes codes during diagnostic evaluation. A similar issue has been noted with ICD-9-CM coding for acute myocardial infarction (10,11). Finally, coding methods may influence accuracy. If patients only have “diabetes” written on billing forms, coders may utilize “unspecified” type 2 diabetes codes. In contrast, ICD-10-CM codes will separate the “unspecified” category (18).

While, to our knowledge, validity of type 2 diabetes ICD-9-CM coding in youth has not been evaluated, adult studies have examined accuracy of administrative data for identification of diabetes and its complications (710). Among 23,657 Medicare beneficiaries, PPV of diabetes ICD-9-CM code 250.x was 98% (9). Similarly, among 1,976 adults, PPV of two ICD-9-CM codes for diabetes (code 250) was 94% (7). Two codes were used because using one identified many patients without diabetes (7), consistent with our findings. In these studies, however, focus was on diabetes in general, and therefore the PPV cannot be directly compared with our findings.

Several limitations should be noted. First, we evaluated PPV of one type 2 diabetes ICD-9-CM code. This likely highlighted the worst-case scenario but underscores limitations of administrative coding for distinguishing between type 1 and type 2 diabetes. Alternative approaches including additional type 2 diabetes codes or excluding type 1 diabetes codes should be explored. Second, we could not evaluate sensitivity of type 2 diabetes codes, as alternative methods of identifying type 2 diabetic patients were lacking. Third, our analysis was conducted in one hospital. As PPV is influenced by prevalence of type 2 diabetes, our findings may not generalize to locations with higher rates of type 2 diabetes in youth. Institutional coding practices may also influence outcomes. Lastly, diabetes diagnoses were based on provider assessments, which could differ from diagnoses based on laboratory or other standardized criteria. However, the algorithm and reviewer blinding guarded against bias in assignment of diagnoses with respect to ICD-9-CM codes. Overall, our findings argue for clinical corroboration of these codes before widely applying them to pediatric diabetes research.

This study was supported by the grant K01DP000089 from the Centers for Disease Control and Prevention.

Parts of this study were published in abstract form at the 66th Scientific Sessions of the American Diabetes Association, Washington, DC, 9-13 June 2006.

We thank chart reviewers Laurie Cohen, Diane Stafford, Bat-Sheva Levine, and Catherine Gordon.

1.
National Center for Health Statistics: International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM), 2006. Available from http://www.cdc.gov/nchs/about/otheract/icd9/abticd9.htm. Accessed 26 March
2006
2.
Benesch C, Witter DM Jr, Wilder AL, Duncan PW, Samsa GP, Matchar DB: Inaccuracy of the International Classification of Diseases (ICD-9-CM) in identifying the diagnosis of ischemic cerebrovascular disease.
Neurology
49
:
660
–664,
1997
3.
Clark S, Gaeta TJ, Kamarthi GS, Camargo CA: ICD-9-CM coding of emergency department visits for food and insect sting allergy.
Ann Epidemiol
16
:
696
–700,
2006
4.
Goldstein LB: Accuracy of ICD-9-CM coding for the identification of patients with acute ischemic stroke: effect of modifier codes.
Stroke
29
:
1602
–1604,
1998
5.
Bazarian JJ, Veazie P, Mookerjee S, Lerner EB: Accuracy of mild traumatic brain injury case ascertainment using ICD-9 codes.
Acad Emerg Med
13
:
31
–38,
2006
6.
Andrade SE, Gurwitz JH, Chan KA, Donahue JG, Beck A, Boles M, Buist DSM, Goodman M, LaCroix AZ, Levin TR, Platt R: Validation of diagnoses of peptic ulcers and bleeding from administrative databases: a multi-health maintenance organization study.
J Clin Epidemiol
55
:
310
–313,
2002
7.
O’Connor PJ, Rush WA, Pronk NP, Cherney LM: Identifying diabetes mellitus or heart disease among health maintenance organization members: sensitivity, specificity, predictive value, and cost of survey and database methods.
Am J Manag Care
4
:
335
–342,
1998
8.
Kern EFO, Maney M, Miller DR, Tseng C-L, Tiwari A, Rajan M, Aron D, Pogach L: Failure of ICD-9-CM codes to identify patients with comorbid chronic kidney disease in diabetes.
Health Services Research
41
:
564
–580,
2006
9.
Birman-Deych E, Waterman AD, Yan Y, Nilasena DS, Radford MJ, Gage BF: Accuracy of ICD-9-CM codes for identifying cardiovascular and stroke risk factors.
Med Care
43
:
480
–485,
2005
10.
Newton KM, Wagner EH, Ramsey SD, McCulloch D, Evans R, Sandhu N, Davis C: The use of automated data to identify complications and comorbidities of diabetes: a validation study.
J Clin Epidemiol
52
:
199
–207,
1999
11.
Rosamond WD, Chambless LE, Sorlie PD, Bell EM, Weitzman S, Smith JC, Folsom AR: Trends in the sensitivity, positive predictive value, false-positive rate, and comparability ratio of hospital discharge diagnosis codes for acute myocardial infarction in four US communities, 1987–2000.
Am J Epidemiol
160
:
1137
–1146,
2004
12.
Frohnert BK, Lussky RC, Alms MA, Mendelsohn NJ, Symonik DM, Falken MC: Validity of hospital discharge data for identifying infants with cardiac defects.
J Perinatol
25
:
737
–742,
2005
13.
Duncan GE: Prevalence of diabetes and impaired fasting glucose levels among US adolescents: National Health and Nutrition Examination Survey, 1999–2002.
Arch Pediatr Adolesc Med
160
:
523
–528,
2006
14.
Fagot-Campagna A, Pettitt DJ, Engelgau MM, Burrows NR, Geiss LS, Valdez R, Beckles GL, Saaddine J, Gregg EW, Williamson DF, Narayan KM: Type 2 diabetes among North American children and adolescents: an epidemiologic review and a public health perspective.
J Pediatr
136
:
664
–672,
2000
15.
Ogden CL, Carroll MD, Curtin LR, McDowell MA, Tabak CJ, Flegal KM: Prevalence of overweight and obesity in the United States, 1999–2004.
JAMA
295
:
1549
–1555,
2006
16.
Gilliam LK, Brooks-Worrell BM, Palmer JP, Greenbaum CJ, Pihoker C: Autoimmunity and clinical course in children with type 1, type 2, and type 1.5 diabetes.
J Autoimmun
25
:
244
–250,
2005
17.
American Diabetes Association: Diagnosis and classification of diabetes mellitus (Position Statement).
Diabetes Care
29 (Suppl. 1)
:
S43
–S48,
2006
18.
National Center for Health Statistics: International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM). Available from http://www.cdc.gov/nchs/about/otheract/icd9/abticd10.htm. Accessed 6 May
2006

A table elsewhere in this issue shows conventional and Système International (SI) units and conversion factors for many substances.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C Section 1734 solely to indicate this fact.