yhygao/CBIM-Medical-Image-Segmentation

CBIM-Medical-Image-Segmentation

This repo is a PyTorch-based framework for medical image segmentation, whose goal is to provide an easy-to-use framework for academic researchers to develop and evaluate deep learning models. It provides fair evaluation and comparison of CNNs and Transformers on multiple medical image datasets.

News

Detail updates can be found in docs/change.md

Our new work, Hermes, has been released on arXiv: Training Like a Medical Resident: Universal Medical Image Segmentation via Context Prior Learning. Inspired by the training of medical residents, we explore universal medical image segmentation, whose goal is to learn from diverse medical imaging sources covering a range of clinical targets, body regions, and image modalities. Following this paradigm, we propose Hermes, a context prior learning approach that addresses the challenges related to the heterogeneity on data, modality, and annotations in the proposed universal paradigm. Code will be released at https://github.com/yhygao/universal-medical-image-segmentation.
Support PyTorch 2.0 (April 10, 2023). Detail tutorial is comming soon.
Support using GPU to conduct data augmentation for faster training
We update a revised version of MedFormer paper, including more experiments and analysis. (April 5, 2023)
Support AMOS22 dataset. (Feb. 16, 2023)
Support KiTS19 dataset. (Feb. 10, 2023)
Support PyTorch DDP and AMP training. (Dec. 19. 2022)
Add a tutorial on the usage of this repo: docs/tutorial.md (Still updating, Dec. 19. 2022)
Supports for BCV and LiTS dataset have been updated.

Features

Cover the whole process of model design, including dataset processing, model definition, model configuration, training and evaluation.
Provide SOTA models as baseline for comparison. Model definition, training and evaluation code are simple with no complex code encapsulation.
Provide models, losses, metrics, augmentation and etc. for 2D, 3D data, multiple modalities and multiple tasks.
Optimized training techniques for SOTA performance.

Supporting models

MedFormer (Official implementation)
UNet. Including 2D, 3D with different building block, e.g. double conv, Residual BasicBlock, Bottleneck, MBConv, or ConvNeXt block.
UNet++
Attention UNet
Dual Attention
TransUNet
SwinUNet
UNETR
VT-UNet
nnFormer
SwinUNETR
More models are comming soon ...

Supporting Datasets

ACDC Cardiac MRI
BCV Abdomen Organ CT
LiTS Liver and Tumor CT
KiTS19 Kidney and Tumor CT
AMOS22 Abdomen Organ CT and MR
More datasets are comming soon

Usage

We provide flexible usage. If you just want to use the models in your own framework, you can directly find the corresponding models in the model/ folder and use them in your own framework. For the definition of specific models, please refer to model/utils.py in the get_model function. The models we provide do not have complex dependencies and encapsulation. The modules used by each model are defined in its own xxx_utils.py file. For example, the model definition of UNet only depend on unet.py, unet_utils.py and conv_layers.py.

If you want to use our framework, please follow below steps.

Install requirements

Create a new virtual environment and install all dependencies by:

pip install -r requirement.txt

Data preparation

Download the origin dataset from their corresponding official website.

Enter the dataset_conversion fold and find the dataset you want to use and the corresponding dimension (2d or 3d)

Edit the src_path and tgt_path the in xxxdataset.py, where the src_path is the path to the origin dataset, and tgt_path is the target path to store the processed dataset.

Then, python xxxdataset.py

After processing is finished, put the processed dataset into dataset/ folder or use a soft link.

Configuration

Enter config/xxxdataset/ and find the model and dimension (2d or 3d) you want to use. The training details, e.g. model hyper-parameters, training epochs, learning rate, optimizer, data augmentation, etc., can be altered here. You can try your own congiguration or use the default configure, the one we used in the MedFormer paper, which should have a decent performance. The only thing to care is the data_root, make sure it points to the processed dataset directory.

Training

We can start training after the data and configuration is done. Several arguments can be parsed in the command line, see in the get_parser() function in the train.py and train_ddp.py. You need to specify the model, the dimension, the dataset, whether use pretrain weights, batch size, and the unique experiment name. Our code will find the corresponding configuration and dataset for training.

Here is an example to train 3D MedFormer with one gpu on ACDC:

python train.py --model medformer --dimension 3d --dataset acdc --batch_size 3 --unique_name acdc_3d_medformer --gpu 0

This command will start the cross validation on ACDC. The training loss and evaluation performance will be logged by tensorboard. You can find them in the log/dataset/unique_name folder. All the standard output, the configuration, and model weigths will be saved in the exp/dataset/unique_name folder. The results of cross validation will be saved in exp/dataset/unique_name/cross_validation.txt.

Besides training with a single GPU, we also provide distributed training (DDP) and automatic mixed precision (AMP) training in the train_ddp.py. The train_ddp.py is the same as train.py except it supports DDP and AMP. We recomend you to start with train.py to make sure the whole train and eval pipeline is correct, and then use train_ddp.py for faster training or larger batch size.

Example of using DDP:

python train_ddp.py --model medformer --dimension 3d --dataset acdc --batch_size 16 --unique_name acdc_3d_medformer_ddp --gpu 0,1,2,3

Example of using DDP and AMP:

python train_ddp.py --model medformer --dimension 3d --dataset acdc --batch_size 32 --unique_name acdc_3d_medformer_ddp_amp --gpu 0,1,2,3 --amp

We have not fully benchmark if AMP can speed up training, but AMP can reduce the GPU memory consumption a lot.

To Do

Add MSD dataset support.

We'll continously maintain this repo to add more SOTA models, and add more dataset support.

Performance comparison results of the supported models and dataset

Hope this repo can serves as a solid baseline for the future medical imaging model design.

Citation

If you find this repo helps, please kindly cite our paper, thanks!

@inproceedings{gao2021utnet,
  title={UTNet: a hybrid transformer architecture for medical image segmentation},
  author={Gao, Yunhe and Zhou, Mu and Metaxas, Dimitris N},
  booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
  pages={61--71},
  year={2021},
  organization={Springer}
}

@article{gao2022data,
  title={A data-scalable transformer for medical image segmentation: architecture, model efficiency, and benchmark},
  author={Gao, Yunhe and Zhou, Mu and Liu, Di and Yan, Zhennan and Zhang, Shaoting and Metaxas, Dimitris N},
  journal={arXiv preprint arXiv:2203.00131},
  year={2022}
}

yhygao / CBIM-Medical-Image-Segmentation