ContourDiff

Contour-Guided Diffusion Models for Unpaired Image-to-Image Translation

By Yuwen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee and Maciej Mazurowski

This is the code for our paper ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models, which is a novel framework that leverages domain-invariant anatomical contour representations of images to enable unpaired translation between different domains.

Why use our model?

Our method can:

Enforce precise anatomical consistency even between modelaities with severe structural biases (See example figure below)
Potentially translate images from arbitrary unseen input domains (i.e., train once, translate any)

Great thanks to Segmentation-guided Diffusion for inspiration and code backbone!

Citation

Please cite our paper if you use our code or reference our work:

@article{chen2024contourdiff,
  title={ContourDiff: Unpaired Image Translation with Contour-Guided Diffusion Models},
  author={Chen, Yuwen and Konz, Nicholas and Gu, Hanxue and Dong, Haoyu and Chen, Yaqian and Li, Lin and Lee, Jisoo and Mazurowski, Maciej A},
  journal={arXiv preprint arXiv:2403.10786},
  year={2024}
}

Getting Started

Please follow the steps below to have your own ContourDiff model!

1) Preprocess data and extract contours

To extract the contours, run command:

python preprocess.py \
  --data_directory {DATA_DIRECTORY} \
  --domain_img_folder {DOMAIN_IMG_FOLDER} \
  --domain_contour_folder {DOMAIN_CONTOUR_FOLDER} \
  --domain_meta_path {DOMAIN_META_PATH} \

where:

DATA_DIRECTORY is directory of data from multiple domains
DOMAIN_IMG_FOLDER is path to certain domain images under DATA_DIRECTORY
DOMAIN_CONTOUR_FOLDER is path to save extracted contours under DATA_DIRECTORY
DOMAIN_META_PATH is path (*.csv) to save meta information under DATA_DIRECTORY

To enable removal of non-anatomical background artifacts, use --remove_artifact.

For example, given data structure below:

DATA_DIRECTORY
├── domain_1
│   ├── images
│   │   ├── img_1.png
│   │   ├── img_2.png
│   │   └── ...
├── domain_2
│   ├── images
│   │   ├── img_1.png
│   │   ├── img_2.png
│   │   └── ...
└── ...

If extracting contours for images from domain 1, then set DOMAIN_IMG_FOLDER="domain_1/images" Then, if setting DOMAIN_CONTOUR_FOLDER="domain_1/contours" and DATA_DIRECTORY="domain_1/df_meta.csv", the outcome data structure is:

DATA_DIRECTORY
├── domain_1
│   ├── images
│   │   ├── img_1.png
│   │   ├── img_2.png
│   │   └── ...
│   ├── contours
│   │   ├── img_1.png
│   │   ├── img_2.png
│   │   └── ...
│   ├── df_meta.csv
├── domain_2
│   ├── images
│   │   ├── img_1.png
│   │   ├── img_2.png
│   │   └── ...
└── ...

To visualize the extracted contours, run contour_checker.ipynb.

2) Training Phase

To train your own ContourDiff model, run command:

python train.py \
  --input_domain {INPUT_DOMAIN} \
  --output_domain {OUTPUT_DOMAIN} \
  --data_directory {DATA_DIRECTORY} \
  --input_domain_img_folder {INPUT_DOMAIN_IMG_FOLDER} \
  --input_domain_contour_folder {INPUT_DOMAIN_CONTOUR_FOLDER} \
  --output_domain_img_folder {OUTPUT_DOMAIN_IMG_FOLDER} \
  --output_domain_contour_folder {OUTPUT_DOMAIN_CONTOUR_FOLDER} \
  --input_domain_meta_path {INPUT_DOMAIN_META_PATH} \
  --output_domain_meta_path {OUTPUT_DOMAIN_META_PATH} \
  --output_dir {OUTPUT_DIR}
  --device {DEVICE} \
  --contour_guided \

where:

INPUT_DOMAIN is the string name of the input domain (e.g. any, CT or MRI)
OUTPUT_DOMAIN is the string name of the output domain (e.g. CT or MRI)
DATA_DIRECTORY is directory of data from multiple domains
INPUT_DOMAIN_IMG_FOLDER is path to input domain images under DATA_DIRECTORY
INPUT_DOMAIN_CONTOUR_FOLDER is path to input domain contours under DATA_DIRECTORY
OUTPUT_DOMAIN_IMG_FOLDER is path to output domain images under DATA_DIRECTORY
OUTPUT_DOMAIN_CONTOUR_FOLDER is path to output domain contours under DATA_DIRECTORY
INPUT_DOMAIN_META_PATH is path (*.csv) to input domain meta file under DATA_DIRECTORY
OUTPUT_DOMAIN_META_PATH is path (*.csv) to output domain meta file under DATA_DIRECTORY
OUTPUT_DIR is absolute path to save output results, including model checkpoints and visualization samples
DEVICE is GPU device
contour_guided is flag to enable contour-guided mode for diffusion models

Notice: Input domain images and contours are used for validation in the training phase.

3) Translation Phase

To translate input domain images using your own ContourDiff model, run command:

python translation.py \
  --input_domain {INPUT_DOMAIN} \
  --output_domain {OUTPUT_DOMAIN} \
  --data_directory {DATA_DIRECTORY} \
  --input_domain_contour_folder {INPUT_DOMAIN_CONTOUR_FOLDER} \
  --input_domain_meta_path {INPUT_DOMAIN_META_PATH} \
  --num_copy {NUM_COPY} \
  --by_volume \
  --volume_specifier {VOLUME_SPECIFIER} \
  --slice_specifier {SLICE_SPECIFIER} \
  --selected_epoch {SELECTED_EPOCH} \
  --translating_folder_name {TRANSLATING_FOLDER_NAME} \
  --device {DEVICE} \
  --num_partition {NUM_PARTITION} \
  --partition {PARTITION}

where:

INPUT_DOMAIN is the string name of the input domain (e.g. any, CT or MRI)
OUTPUT_DOMAIN is the string name of the output domain (e.g. CT or MRI)
DATA_DIRECTORY is directory of data from multiple domains
INPUT_DOMAIN_CONTOUR_FOLDER is path to input domain contours under DATA_DIRECTORY
INPUT_DOMAIN_META_PATH is path (*.csv) to input domain meta file under DATA_DIRECTORY
OUTPUT_DIR is absolute path to save output results, including model checkpoints and visualization samples
NUM_COPY is the number of samples generated in each iteration
by_volume is flag to enable slice-by-slice generation within each volume
VOLUME_SPECIFIER is string of column to indicate each volume (e.g., "volume")
SLICE_SPECIFIER is string of column to indicate slice number (e.g., "slice")
SELECTED_EPOCH is epoch of the selected checkpoint to load
TRANSLATING_FOLDER_NAME is absolute path to store the tranlsated images
DEVICE is GPU device
NUM_PARTITION is total number of partition to split input domain units (either slices or volumes)
PARTITION is specified partition to translate

Notice:

VOLUME_SPECIFIER and SLICE_SPECIFIER are required to enable by_volume translation, which means the meta file should include corresponding columns.
NUM_PARTITION and PARTITION are aimed for translation in parallel.
PARTITION is within range [0, NUM_PARTITION - 1].

mazurowski-lab / ContourDiff

ContourDiff

By Yuwen Chen, Nicholas Konz, Hanxue Gu, Haoyu Dong, Yaqian Chen, Lin Li, Jisoo Lee and Maciej Mazurowski

Why use our model?

Citation

Getting Started

1) Preprocess data and extract contours

2) Training Phase

3) Translation Phase

About

Languages