About the evaluation results

Question

About the evaluation results

SeaBird-Go opened this issue 2 years ago · comments

Hi, thanks for your work.

I wonder what is the test_list.txt in eval.py file this line. Is it the same with the test_list2.txt?

Besides, I want to know the what the data split you used to obtain the evaluation results in Table 1 and Table 2 in your paper?

Ema97 · Answer 1 · Thu Nov 17 2022 17:10:49 GMT+0800 (China Standard Time)

Hi! The test_list.txt is the same astest_list2.txtthat I have uploaded in dataset folder. Those corresponds to the official splits we used to get the results reported in the paper. I have modified eval.py. Thanks for your comment!

SeaBird-Go · Answer 2 · Fri Nov 18 2022 09:30:48 GMT+0800 (China Standard Time)

@Emanuele97x Sorry to comment on this closed issue.
Because I still have some questions about the evaluation results. I download the pretrained models you provided, and I found they are category-specific models, I evaluate these model on each category, I could obtain similar results in your paper.

So I wonder whether the XMFNet should be trained for each category to get the results you presented? I'm also training XMFNet in all categories, but it's too slow so I cannot get any conclusions right now.

Ema97 · Answer 3 · Fri Nov 18 2022 17:19:31 GMT+0800 (China Standard Time)

Hi @SeaBird-1995, the models are category specific, as those presented in ViPC, the ShapeNet-ViPC dataset is huge so training a single model with all categories could still work, however it takes a lot of time.

SeaBird-Go · Answer 4 · Sun Nov 20 2022 11:28:00 GMT+0800 (China Standard Time)

@Emanuele97x ， thanks for your clarification. So it means the evaluation results in your papers are all obtained from category-specific models?

Because I want to compare with your results, I need to make sure this one.

Herman · Answer 5 · Tue Feb 07 2023 20:34:26 GMT+0800 (China Standard Time)

@Emanuele97x , thanks for your work.
I want to ask a question since the viewpoint of the input image has many effects on the evaluation metrics, however, when you compute the metrics, the selection of the view image in the ViPCDataLoader is not fixed. So I wonder, in this case, are the quantitative results not comparable?

Ema97 · Answer 6 · Wed Feb 08 2023 01:58:10 GMT+0800 (China Standard Time)

Dear @zhanghm1995,
Thanks for your interest in this work, during the evaluation the views are selected at random, in this way your evaluation is averaged over all the possible views (good and bad ones), results in the paper are produced in this way to be consistent with previous works (e.g ViPC).

If you use different view selection procedure results will be not comparable anymore.

Herman · Answer 7 · Wed Feb 08 2023 16:57:38 GMT+0800 (China Standard Time)

@Emanuele97x , thanks for your reply. So in the current setting, it would contain randomness to some extent.
What I mean is that the testing set could be randomly selected in advance and then fixing them, then the evaluation results maybe seem more comparable.

Anyway, thanks for your clarification, it helps a lot.

Ema97 · Answer 8 · Wed Feb 08 2023 17:39:57 GMT+0800 (China Standard Time)

Yes, it could be a possibility, by the way, the error bars on those value are very small, even with multiple random view selection the results are very similar for the same model!

Herman · Answer 9 · Mon Feb 13 2023 17:18:38 GMT+0800 (China Standard Time)

I got it, thanks a lot.