[Question] The accuracy issue in reproducing the nndetection algorithm

Question

[Question] The accuracy issue in reproducing the nndetection algorithm

pengwuke opened this issue 5 months ago · comments

Thank you very much for your work and contribution. We encountered issues when reproducing the nndetection algorithm on the Luna16 dataset. Initially, we tested the results on one fold and observed differences in accuracy compared to our expectations. Subsequently, we ran the algorithm on all folds and performed ensemble, but the results remained consistent. Could we have missed any steps leading to such outcomes? Any suggestions would be greatly appreciated.

Michael Baumgartner · Answer 1 · Tue Jan 02 2024 18:19:23 GMT+0800 (China Standard Time)

Dear @pengwuke ,

I guess your are referring to reproducing the numbers which are in the paper.

The plots you show here are produced by nndet_eval which (1) is based on bounding box IoU (2) does not include ignore locations. To get LUNA complient results you need to use the official LUNA evaluation script (as noted in the project readme of nnDet ;) ) which is provided by the challenge organisers, that will use (1) a center point based criterion (center point in radius of lesion) (2) includes a huge list of locations which are not counted as false positive predictions (please refer to their publication for more information on this). Especially (2) will boost the performance for low number of false positives and will allow for reproducing the numbers from our paper.

Best,
Michael

github-actions · Answer 2 · Fri Feb 02 2024 08:52:38 GMT+0800 (China Standard Time)

This issue is stale because it has been open for 30 days with no activity.

github-actions · Answer 3 · Sat Feb 17 2024 08:51:29 GMT+0800 (China Standard Time)

This issue was closed because it has been inactive for 14 days since being marked as stale.