mirthAI / RetinaRegNet

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RetinaRegNet

RetinaRegNet is a versatile registration model that achieves state-of-the-art performance in various retinal image registration problems. RetinaRegNet has the capacity to register images with large displacement deformation and minimal overlap. Unlike other deep learning-based registration methods, it does not require training on retinal images, making it a general approach applicable to other image registration problems. Our method is inspired by the paper titled "Emergent Correspondence from Image Diffusion", where the authors applied a latent stable diffusion model to generate robust Diffusion Features (DIFT) for finding semantic correspondences. We evaluated the performance of RetinaRegNet on three retinal image registration datasets: two public datasets, "FIRE" and "FLoRI21", and a privately owned dataset titled "LSFG".

Visual Results

For additional queries about the model and its architecture, kindly refer to our paper titled "RetinaRegNet: A Versatile Approach for Retinal Image Registration".

Requirements

The scripts were executed using a custom kernel configured with the list of packages mentioned below.

  • torch==2.0.1
  • torchvision==0.15.2
  • jax==0.4.23
  • jaxlib==0.4.23
  • accelerate==0.23.0
  • diffusers==0.20.2
  • ipympl==0.9.3
  • transformers==4.34.0
  • xformers==0.0.22
  • numpy
  • scipy
  • opencv-python
  • matplotlib
  • pyunpack
  • Note: To install the aforementioned list of packages, kindly use the command pip install -r requirements.txt

Datasets

Evaluation Steps

Evaluating our model on any of the mentioned datasets involves three simple steps, outlined below:

  • Download a copy of the dataset using the provided link.
  • Download the corresponding script for the dataset from this repository.
  • Start executing the entire script with style.

Note:

  • Before starting the model evaluation, ensure that the dataset and its corresponding script are saved in the same directory.
  • The aforementioned scripts are customized for each dataset listed here, facilitating the automatic processing of all images and their corresponding ground truth landmarks.

References

Certain code cells were from this GitHub repository, as detailed in the paper titled "Emergent Correspondence from Image Diffusion".

Citations

Kindly cite our paper as follows if you use our code.

@misc{sivaraman2024retinaregnet,
    title={RetinaRegNet: A Versatile Approach for Retinal Image Registration},
    author={Vishal Balaji Sivaraman and Muhammad Imran and Qingyue Wei and Preethika Muralidharan and Michelle R. Tamplin and Isabella M . Grumbach and Randy H. Kardon and Jui-Kai Wang and Yuyin Zhou and Wei Shao},
    year={2024},
    eprint={2404.16017},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Contact

For any questions related to the code or its execution, please feel free to contact Vishal Balaji Sivaraman at vishalbalaji.sivaraman@medicine.ufl.edu.

About

License:MIT License


Languages

Language:Jupyter Notebook 100.0%