Project Details

Question

Project Details

Rainydu184 opened this issue a year ago · comments

Hi, your work is very fascinating！
I would like to replicate your work on TCGA-Lung datasets. I noticed that there are many empty .md files in the repo. Are there any pending contents to be written? /dataset/README.md /dataset_csv/README.md
Could you please provide a brief explanation of how the features extracted through CLAM should be applied in your repo and what format the cross-validation files should be in?

Rainydu184 commented 10 months ago

Thanks!

Zihao Wu · Answer 1 · Sat Aug 19 2023 16:39:21 GMT+0800 (China Standard Time)

Hi, README.md file in /dataset and /dataset_csv just that I forgot to delete, dataloader.py file is placed in /dataset, and the .csv files related to dataset spliting are stored in /dataset_csv (I have uploaded a csv file of camelyon16 dataset as a reference).

Once you get all patch features within each WSI and store in pt file, then you could run M2_update_MIL_classifier.py and set feat_dir as the directory that save patch features, then you will get t0_primary_attn.pkl which stores attention scores of all patches in WSI, then run E_pseudo_labeling.py to generate pseudo labels and then M1_update_feat_encoder.py to optimize patch feature encoder, extract new patch features by updated feature encoder, finally M2 again to iterate.

Rainydu184 · Answer 2 · Wed Aug 23 2023 15:00:45 GMT+0800 (China Standard Time)

HI!

I think I have obtained the correct training through M2_update_MIL_classifier.py. But I got t0_primary.pth and t1_primary_attn.pkl, instead of t0_primary_attn.pkl.

I found that the E_pseudo_labeling.py file only contains the processing for "camelyon". so I made some simple modifications to line80 and line134.

if args.dset == 'camelyon' or args.dset == 'tcga_lung':

When running the E_pseudo_labeling.py, I encountered strange AUC, ACC values, and warnings.

For MIL tasks, there is a difference between TCGA_Lung and Camelyon.

In Camelyon, "normal" samples are negative cases, and "tumor" samples are positive cases, resulting in only two types of patches (negative or positive). However, in TCGA_Lung, there are no negative samples, and LUAD and LUSC are different categories of positive samples, resulting in three types of patches: negative (normal tissue in WSI), LUAD, and LUSC.

For TCGA_Lung, there should be some differences in operations compared to Camelyon. Could you please provide some suggestions?

Zihao Wu · Answer 3 · Wed Aug 23 2023 21:31:25 GMT+0800 (China Standard Time)

Hi, it is fine when you get t0_primary.pth and t1_primary attn.pkl, because t0_primary stores the model weights at t0, and t1_primary_attn stores the attention weights for pseudo labeling at t1 time.
Indeed, you noticed an important point, there is a difference between Camelyon and TCGA Lung. Because there are no "normal" samples in TCGA Lung. Therefore, we add a category called "background" in patch classification branch for typing task liked TCGA Lung. During pseudo labeling, the higher attention weights patches will be assigned to the WSI class, and the smaller will be assigned to "background".
But the WSI classification branch has no "background" category, so the patch classifier will have one more class than the WSI classifier. As you can see in the framework, one more node for the patch classifier.

Rainydu184 · Answer 4 · Wed Aug 23 2023 21:36:20 GMT+0800 (China Standard Time)

Thanks for your reply!
Is the AUC score normal? Just need to ignore warning messages?

Zihao Wu · Answer 5 · Wed Aug 23 2023 21:51:09 GMT+0800 (China Standard Time)

It seems that AUC score is abnormal, you need to debug further.

Rainydu184 · Answer 6 · Tue Sep 12 2023 16:40:57 GMT+0800 (China Standard Time)

Hi, @Zero-We :
As mentioned in your article, the allocation of pseudo-labels to patches is crucial.

I noticed that the allocation of pseudo-labels requires both the MIL Classifier and Patch Classifier. While I can easily understand how the MIL Classifier can be initialized by classifying the entire image, I'm curious about how to initialize the Patch Classifier. This will have an impact on the initial allocation of pseudo-labels.

Using random initialization may be problematic.

Zihao Wu · Answer 7 · Tue Sep 12 2023 17:09:02 GMT+0800 (China Standard Time)

Yes, you're right, pseudo labeling is important.

Actually, we just randomly initialized the Patch Classifier, because the initial pseudo-label assignment does not consider the scores obtained by Patch Classifier, but only attention weights. It means that all classifier scores are set to 1 at initial round (as you can see at Line 60-62 in E_pseudo_labeling.py). So the initialization of Patch Classifier has no effect on the initial pseudo-label assignment.

Rainydu184 · Answer 8 · Tue Sep 12 2023 17:17:28 GMT+0800 (China Standard Time)

Get it! Thanks for your reply.

Rainydu184 · Answer 9 · Thu Sep 14 2023 16:25:01 GMT+0800 (China Standard Time)

Hi Zero-We!
I'm sorry to disturb you again, but I found that the ACC for WSI classification are still poor when following the instructions in the readme.

I think I should utilize the updated weights to re-extract features between the execution of M1_update_feat_encoder.py and M2_update_MIL_classifier.py. Is that correct?

I noticed that your function "extract_feature_clean" seems to be able to perform this task. Is its functionality complete? What is the difference between using it and initializing CLAM's ResNet50 with the new weights for feature extraction?

Zihao Wu · Answer 10 · Thu Sep 14 2023 16:54:46 GMT+0800 (China Standard Time)

Yes, you should re-extract features before running "M2_update_MIL_classifier.py" again, then repeat the EM iteration.

"extract_feature_clean.py" is used to re-extract features, there is no difference between them. But I recommend using "extract_feature_clean.py", which can load patch images directly instead of extract patch from WSI every time.

Rainydu184 · Answer 11 · Fri Sep 15 2023 17:07:50 GMT+0800 (China Standard Time)

Hi @Zero-We :
Thank you very much for your patient explanation. I have successfully achieved performance very similar to that described in your article after 4 iterations on Camelyon16 dataset.

Additionally, I have identified the reason for the unsuccessful results on TCGA_lung dataset. It seems that your "E_pseudo_labeling.py" is specifically designed for pseudo-label classification on Camelyon16. Could you please share the code that is applicable to multi-class classification?

Zihao Wu · Answer 12 · Wed Sep 20 2023 10:03:33 GMT+0800 (China Standard Time)

I have updated 'E_pseudo_labeling.py'.

Rainydu184 · Answer 13 · Wed Sep 20 2023 11:09:28 GMT+0800 (China Standard Time)

Thanks for everything you've done！
I am also trying to implement it, and I found that your Multi-Class Attention here seems to be designed also only for binary classification.
According to your article, it should have a separate path_attention_head for each class. If I simply modify the number of final classes here, it can only calculate the confidence based on the attn value of the predicted class, rather than the attn value of its actual label.

Is my understanding correct? There is a significant difference between comelyon16's binary classification and other multi-instance multi-class problems.

Zihao Wu · Answer 14 · Wed Sep 20 2023 11:38:14 GMT+0800 (China Standard Time)

You have to replace with 'n_classes=n_classes' in this line for multi-class classification.

Rainydu184 · Answer 15 · Wed Sep 20 2023 11:41:38 GMT+0800 (China Standard Time)

Thanks for your prompt response.

Rainydu184 · Answer 16 · Thu Sep 21 2023 10:26:29 GMT+0800 (China Standard Time)

Hi @Zero-We :

You have to replace with 'n_classes=n_classes' in this line for multi-class classification.

I tried this modification, but it resulted in an error.
It seems that the dimension of atten(A_path) has been changed to [n_classes, num_patch], and then further transformed into [1, n_classes*num_patch].

I made some modifications to the original code as follows, so that atten can be correctly computed with feature(wsi_trans).

Then the error was propagated to the loss calculation, and I obtained the following error. The shapes of the returned results logit, y_prob, and Y_hat have been changed to [2, 2], [2, 2], and [2, 1], respectively. I think the correct shapes should be [1, 2], [1, 2], and [1, 1], respectively.

The correct model should be consistent with Fig.3 in your article. How can I implement it? Could you please provide some suggestions?

Zihao Wu · Answer 17 · Thu Sep 21 2023 11:03:15 GMT+0800 (China Standard Time)

please refer to 'models/cls_model_multi.py'