Harvard-Ophthalmology-AI-Lab / FairSeg

[ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Home Page:https://ophai.hms.harvard.edu/datasets/harvard-fairseg10k/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FairSeg

[ICLR 24] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

by Yu Tian*, Min Shi*, Yan Luo*, Ava Kouhana, Tobias Elze, and Mengyu Wang.

Screenshot 2024-01-20 at 9 24 39 AM

Note that, the modifier word “Harvard” only indicates that our dataset is from the Department of Ophthalmology of Harvard Medical School and does not imply an endorsement, sponsorship, or assumption of responsibility by either Harvard University or Harvard Medical School as a legal identity.

Download Harvard-FairSeg Dataset

  • Our Harvard-FairSeg dataset can refer to the dataset page of our lab website via this link.

  • Alternatively, you could also use this Google Drive link to directly download our Harvard-FairSeg dataset.

  • If you cannot directly download the Harvard-FairSeg dataset, please request access in the above Google Drive link, we will make sure to grant you access within 3-5 days.

  • Please refer to each of the folders for FairSeg with SAMed and TransUNet, respectively.

  • CVer中文讲解

Dataset Description

This dataset can only be used for non-commercial research purposes. At no time, the dataset shall be used for clinical decisions or patient care. The data use license is CC BY-NC-ND 4.0.

The dataset containing 10,000 patients includes 10,000 Scanning laser ophthalmoscopy (SLO) fundus images. The disc and cup masks, patient age, gender, race, ethnicity, language, and marital status information are also included in the data. Under the folder "ReadMe", the file "data_summary.csv" provides an overview of our data.

10,000 SLO fundus images with pixel-wise disc and cup masks are in the Google Drive folder: data_00001.npz data_00002.npz ... data_10000.npz

NPZ files have the following keys:

slo_fundus: Scanning laser ophthalmoscopy (SLO) fundus image
disc_cup_mask: disc and cup masks for the corresponding SLO fundus image
age: patient's age
gender: 0 - Female, 1 - Male
race: 0 - Asian, 1 - Black, 2 - White
ethnicity: 0 - Non-Hispanic, 1 - Hispanic, -1 - Unknown
language: 0 - English, 1 - Spanish, 2 - Others, -1 - Unknown
maritalstatus: 0 - Married or Partnered, 1 - Single, 2 - Divorced, 3 - Widowed, 4 - Legally Separated, -1 - Unknown

More Fairness Datasets

  • 🍻🍻 For more fairness datasets including 2D and 3D images of three different eye diseases, please check our dataset webpage!

Acknowledgement & Citation

If you find this repository useful for your research, please consider citing our paper:

@inproceedings{tian2024fairseg,
      title={FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling}, 
      author={Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang},
      booktitle={International Conference on Learning Representations (ICLR)},
      year={2024},
}

About

[ICLR 2024] FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

https://ophai.hms.harvard.edu/datasets/harvard-fairseg10k/


Languages

Language:Python 99.2%Language:Shell 0.8%