Codes for coordinate generation

Question

Codes for coordinate generation

SamitHuang opened this issue 5 years ago · comments

Could you release the codes for coordinates generation? Now the patch size is fixed to be 768x768. Once it changes, we need to generate new coordinates. Also need to extract and save a new set of extracted images, which is spatially expensive.

Yi Li · Answer 1 · Tue Feb 12 2019 22:11:22 GMT+0800 (China Standard Time)

@SamitHuang The code for generating the coordinates are not prepared yet. It's a little bit messy. You can refer to this #14 . But I think you don't need to generate new coordinates when you change the patch size. Can you just use the current coordinates to generate a different patch size, e.g. 512x512?

Samit · Answer 2 · Tue Feb 12 2019 22:28:18 GMT+0800 (China Standard Time)

But if the patch size is larger than 768, re-checking tumor region is required. As representative sampling is important for the model performance, so I think this part of codes is critical for re-implementation. Thanks.

Usama Baig · Answer 3 · Wed Feb 13 2019 03:03:08 GMT+0800 (China Standard Time)

Google labeled the patch as a tumor even if one of the pixels is of tumor region. so i think we can also do same.

Yi Li · Answer 4 · Wed Feb 13 2019 11:53:56 GMT+0800 (China Standard Time)

@SamitHuang @usamabaig1 So there are two concepts here, first is the grid size, e.g. 768, the other is the patch size within the grid, e.g. 256. Each patch within the grid is modeled through CNN and CRF. We obtained the tumor/normal label of each patch within the grid by checking the center coordinate of each patch with respect to the ground truth annotation mask. Therefore, for a grid size of 768 of 3x3 256 patches, the label will also be 3x3 of 1/0. If you just want to randomly sample tumor or normal coordinates, the code is pretty much already there by using the tissue mask code as noted in #14 . But the performance won't be great without hard negative mining. However the code for hard negative mining is quite a bit painful.

Samit · Answer 5 · Wed Feb 13 2019 14:23:02 GMT+0800 (China Standard Time)

Thanks. I understand the concepts. can you describe the hard negative mining procedure? Do you apply a previously trained CNN on each WSI for training exhaustively to find out all false negative tissue patches, then add them to the training set?

Yi Li · Answer 6 · Fri Feb 15 2019 06:41:12 GMT+0800 (China Standard Time)

@SamitHuang Yes, I first trained a CNN on purely randomly sampled normal/tumor patches. Then applied this CNN on each tumor WSI in training set to find out all false negative tissue patches, then add them to the training set.