mazurowski-lab / DBTex-baseline

This is a baseline neural network model for breast tumor detection on the BCS-DBT dataset, to accompany the DBTex challenge.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DBTex-baseline

This is an implementation of a basic Faster R-CNN model for breast tumor detection on the ultra-high resolution Duke BCS-DBT (Breast Cancer Screening - Digital Breast Tomosynthesis) dataset, to be used as a "baseline" model for the DBTex breast lesion detection challenge and benchmark.

We provide notebooks from pre-processing and dataset preparation to training and testing stages. We offer two version of the Faster-RCNN implementation.

Installation

We build our implemenation mainly based on MMDection. The installation of MMDetection can be found here.

Preparing Data

The raw (DICOM) image data and annotation/label tables can be downloaded from the BCS-DBT page on The Cancer Imaging Archive. After downloading the tables, put them in ./data+_csv.

We supply the notebook preprocess.ipynb to do data preprocessing, which can transfer the raw dicom images into png image slices and create the corresponding json files.

Model Training

For model training, use the notebook train&evaluate.ipynb.

Model Evaluation

Trained models are saved in /work_dirs/breast. If you want to evaluate our pretrained checkpoints, you can set the cfg.work_dirs to the pretrained model.

Model Testing

For testing, we offer the notebook generate_test_table.ipynb to create predictions formatted according to the DBTex guidelines.

About

This is a baseline neural network model for breast tumor detection on the BCS-DBT dataset, to accompany the DBTex challenge.

License:Apache License 2.0


Languages

Language:Python 97.7%Language:Shell 1.4%Language:Jupyter Notebook 0.8%Language:Dockerfile 0.1%Language:Batchfile 0.0%Language:Makefile 0.0%