Close

Measuring the Information Gain of Diagnosis vs. Diagnosis Category Coding.

Abstract

Coding categories of diseases, injuries, symptoms, findings, etc. with ICD-9-CM necessarily imparts a loss of information vs. coding such entities with a terminology or ontology-a consequence of the nature of classifications. However, to our knowledge, no one has attempted to quantify this information loss or conversely, the information to be gained by coding entities as opposed to categories. We estimated a lower bound on information gain of coding with SNOMED CT instead of ICD-9-CM, as measured by Shannon's information entropy. We found that the nation could gain more than 97 megabytes of information per year by coding diagnoses vs. diagnosis categories, an increase of 10%. This increase is more than that obtained from coding ICD-9-CM at the 5(th) instead of the 3(rd) digit level. We recommend that ICD-9-CM be removed from electronic medical record (EMR) stage 2 and later meaningful use criteria.

MIDAS Network Members

Citation:

Hogan WR, Slee VN. (2010). Measuring the Information Gain of Diagnosis vs. Diagnosis Category Coding. AMIA ... Annual Symposium proceedings. AMIA Symposium, (2010)