Scientific article
OA Policy
English

From episodes of care to diagnosis codes: automatic text categorization for medico-economic encoding

Publication date2008
Abstract

We report on the design and evaluation of an original system to help assignment ICD (International Classification of Dis-ease) codes to clinical narratives. The task is defined as a multi-class multi-document classification task. We combine a set of machine learning and data-poor methods to generate a single automatic text categorizer, which returns a ranked list of ICD codes. The combined ranking system currently ob-tains a precision of 75% at high ranks and a recall of about 63% for the top twenty returned codes for a theoretical upper bound of about 79% (inter-coder agreement). The performance of the data-poor classifier is weak, whereas the use of tempo-rally-typed contents such as anamnesis and prescription free text sections results in a statistically significant improvement.

Citation (ISO format)
RUCH, Patrick et al. From episodes of care to diagnosis codes: automatic text categorization for medico-economic encoding. In: AMIA ... Annual Symposium proceedings, 2008, p. 636–640.
Main files (1)
Article (Accepted version)
accessLevelPublic
Identifiers
Journal ISSN1559-4076
568views
215downloads

Technical informations

Creation21/09/2009 12:13:00
First validation21/09/2009 12:13:00
Update time14/03/2023 15:11:45
Status update14/03/2023 15:11:45
Last indexation29/10/2024 12:00:25
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack