gazzola / BRAX-labeler

CheXpert NLP tool to extract observations from radiology reports.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BRAX Labeler: Brazilian Portuguese adaptation of Chexpert Labeler

The main difference resides in the fact that NegBio doesn't work for Portuguese, so we adapt Brazillian Negex triggers to detect negation and uncertainty. In this code we use EasyNegex repository implementation of Negex.

Prerequisites

Please install following dependencies or use the Dockerized labeler (see below).

  1. Clone the EasyNegex repository to the root of this repository:
git clone https://github.com/fuchsfelipel/easyNegex
  1. Make the virtual environment:
conda env create -f environment.yml
  1. Activate the virtual environment:
conda activate chexpert-label
  1. Install NLTK data:
python -m nltk.downloader universal_tagset punkt wordnet

Usage

Place reports in a headerless, single column csv {reports_path}. Each report must be contained in quotes if (1) it contains a comma or (2) it spans multiple lines. See sample_reports.csv (with output labeled_reports.csv)for an example.

python label.py --reports_path {reports_path}

Run python label.py --help for descriptions of all of the command-line arguments.

Contributions

This repository builds upon the work of CheXpert, Negex and EasyNegex.

Citing

If you're using the BRAX labeler forked from CheXpert labeling tool, please cite BRAX dataset and CheXpert as a reference.

About

CheXpert NLP tool to extract observations from radiology reports.

License:MIT License


Languages

Language:Python 100.0%