rmnldwg / icd

Python library for dealing with ICD codes (revisions 10 & 11)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Python Interface to the International Statistical Classification of Diseases and Related Health Problems

license badge ICD-10 badge ICD-10-CM badge ICD-11 badge tests badge docs badge codecov


Content

  1. Disclaimer
  2. Installation
  3. Usage
    1. Chapters
    2. Blocks
    3. Category
    4. Exploration
  4. Motivation
  5. Roadmap

Disclaimer

⚠️ This is not an official tool from the WHO, the CDC or any other authority with respect to clinical classifications. This Python package is independently developed and maintainaned. It should therefore probably not be used in critical clinical applications as it wasn't approved or cross-checked by the issuers of the classifications.


Installation

This package has not yet been pushed to PyPI, so it can't just be installed via pip. Instead, clone the repository, cd into the directory and install it locally.

git clone git@github.com/rmnldwg/icd.git
cd icd
pip install .

Usage

📖 DOCS: The full documentation is hosted here using GitHub pages.

To get started, import the package and load a codex. The ICD-10 codex can be loaded from the submodule rev10, while the clinical modification of the CDC, ICD-10-CM, is available in rev10cm:

import icd

icd10_codex = icd.rev10.get_codex(release="2019")
icd10cm_codex = icd.rev10cm.get_codex(release="2022")

Chapters

The created objects are both root nodes of the respective ICD tree. Directy under that, it contains the main chapters of the classification, which are accessible via a dictionary aptly named chapters. The dictionary's keys are the codes of the chapters and the values the respective entry instance. For example

icd10_codex.chapters["IX"]

returns

ICD10Chapter(code='IX', title='Diseases of the circulatory system', revision='10')

⚠️ NOTE: There is also an attribute called chapter. But that attribute returns the current entry's chapter, which is either the entry itself, if it is a chapter, or the chapter under which the entry is grouped. This is a general pattern: The singular form (root, chapter, block) returns the grouping above the current entry, while the plural form (chapters, blocks, categories) return dictionaries with keys of ICD codes and values of children elements below.

Blocks

Next in the ICD hierarchy are blocks, for which the code attribute is a range of ICD codes, like C00-C96. The blocks of a chapter are accessible from a chapter via blocks in the same manner as chapters are accessed from the root.

icd10_codex.chapters["II"].blocks["C00-C97"]

returns

ICD10Block(code='C00-C97', title='Malignant neoplasms', revision='10')

Blocks may contain other blocks. So it is possible for a block element to have both the attributes block and blocks available. E.g. the block with the code C00-C75 is such a case:

middle_block = codex.get("C00-C75")
parent_block = middle_block.block    # this will have the code `C00-C97`
child_blocks = moddle_block.blocks   # dictionary containing more blocks below

Category

Blocks themselves can have either yet more, but finer, blocks as children (reach them via the blocks attribute again) or categories containing actual diagnoses. In the latter case - you might have guessed it - they are returned in a dictionary with codes as keys called categories.

icd10_codex.chapter["XVI"].block["P05-P08"].categories["P07"]

returns

ICD10Category(code='P07', title='Disorders related to short gestation and low birth weight, not elsewhere classified', revision='10')

Exploration

Of course, one doesn't know the chapters, blocks and codes by heart. Which is why there are a growing number of utilities to explore and visualize the tree of codes. Frist, the entire subtree of an entry can be plotted up to a specified depth using the tree(maxdepth=<N>) method:

icd10_codex.chapters["XII"].tree(maxdepth=2)

returns

block L00-L08: Infections of the skin and subcutaneous tissue
├───category L00: Staphylococcal scalded skin syndrome
├───category L01: Impetigo
│   ├───category L01.0: Impetigo [any organism] [any site]
│   └───category L01.1: Impetiginization of other dermatoses
├───category L02: Cutaneous abscess, furuncle and carbuncle
│   ├───category L02.0: Cutaneous abscess, furuncle and carbuncle of face
│   ├───category L02.1: Cutaneous abscess, furuncle and carbuncle of neck
│   ├───category L02.2: Cutaneous abscess, furuncle and carbuncle of trunk
│   ├───category L02.3: Cutaneous abscess, furuncle and carbuncle of buttock
│   ├───category L02.4: Cutaneous abscess, furuncle and carbuncle of limb
│   ├───category L02.8: Cutaneous abscess, furuncle and carbuncle of other sites
│   └───category L02.9: Cutaneous abscess, furuncle and carbuncle, unspecified
├───category L03: Cellulitis
│   ├───category L03.0: Cellulitis of finger and toe
│   ├───category L03.1: Cellulitis of other parts of limb
│   ├───category L03.2: Cellulitis of face
│   ├───category L03.3: Cellulitis of trunk
│   ├───category L03.8: Cellulitis of other sites
│   └───category L03.9: Cellulitis, unspecified
├───category L04: Acute lymphadenitis
│   ├───category L04.0: Acute lymphadenitis of face, head and neck
│   ├───category L04.1: Acute lymphadenitis of trunk
│   ├───category L04.2: Acute lymphadenitis of upper limb
│   ├───category L04.3: Acute lymphadenitis of lower limb
│   ├───category L04.8: Acute lymphadenitis of other sites
│   └───category L04.9: Acute lymphadenitis, unspecified
├───category L05: Pilonidal cyst
│   ├───category L05.0: Pilonidal cyst with abscess
│   └───category L05.9: Pilonidal cyst without abscess
└───category L08: Other local infections of skin and subcutaneous tissue
    ├───category L08.0: Pyoderma
    ├───category L08.1: Erythrasma
    ├───category L08.8: Other specified local infections of skin and subcutaneous tissue
    └───category L08.9: Local infection of skin and subcutaneous tissue, unspecified

It is also possible to search for codes or even just parts of codes using the search() method. It always returns a list of found entries.

# get category by ICD code
cat = icd10_codex.search("C32.1")[0]

# print ancestry of category
cat.ancestry()

The ancestry() function prints out the ancestors of a given entry, in contrast to the tree(), which prints the descendants. The above code will output this:

root ICD-10: International Statistical Classification of Diseases and Related Health Problems 10th Revision
└───chapter II: Neoplasms
    └───block C00-C97: Malignant neoplasms
        └───block C00-C75: Malignant neoplasms, stated or presumed to be primary, of specified sites, except of lymphoid, haematopoietic and related tissue
            └───block C30-C39: Malignant neoplasms of respiratory and intrathoracic organs
                └───category C32: Malignant neoplasm of larynx
                    └───category C32.1: Supraglottis

Finally, it's possible to check if a specific code exists using the exists(code) method and return it using get(code):

codex.exists("H26.2")  # will return `True`
codex.get("H26.2")     # will return the respective category

Motivation

I recently noticed that there have been some attempts to write a Python package for dealing with ICD codes, notably the python package icd10-cm which is very simple and pythonic, as well as the icd-codex, which was apparently the result of a hackathon and represents the ICD 10 codes as a graph.

Despite those attempts however, there is no package out there that would serve all needs or even just simply deal with the latest 11th revision of ICD. On one hand this might be because the WHO is actually quite stingy with the raw data. If it publishes any data openly at all it is usually some reduced table in a somewhat unconventional format (for programmers and data scientists).

This package attempts to combine the great ideas of the previously mentioned packages but provide a more complete interface to the ICD system.


Roadmap

As one might have noticed, this package isn't the complete interface to the ICD system it strives to be. So, here's an outlook of what features are planned to be added soon:

  • Implementation of the latest revision ICD-11 (issue #7)
  • A translation from ICD-10 to ICD-11 and back (issue #8)
  • Adding modifiers to ICD-10 (issue #3)
  • Enable exporting the codex in different formats (issue #6)

Stay tuned for updates!

About

Python library for dealing with ICD codes (revisions 10 & 11)

License:MIT License


Languages

Language:Python 100.0%