Evaluation results

Question

Evaluation results

ed-fish opened this issue 3 years ago · comments

Ed Fish commented 3 years ago

Hi,

Thanks for your work making a Pytorch version of the paper - much appreciated!

How does this implementation compare to results in the original paper. Specifically on the Moments in Time dataset.

Thanks,

Ed

Marco · Answer 1 · Fri May 28 2021 11:24:21 GMT+0800 (China Standard Time)

I am also interested at this topic.
If there is anyone could provide me more information about the model parameters that might help me fix the problem, I would be thankful for that because using the default parameters is always overfitting.

Thanks.
Marco

Linwei Tao · Answer 2 · Sun May 30 2021 15:31:55 GMT+0800 (China Standard Time)

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.

Thanks,
Dylan

Marco · Answer 3 · Sun May 30 2021 15:48:57 GMT+0800 (China Standard Time)

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.

Thanks,
Dylan

I am facing with the same situation with you. I still don't have any ideat about that now. Waiting for the reply for authors.

Linwei Tao · Answer 4 · Sun May 30 2021 15:54:20 GMT+0800 (China Standard Time)

I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips.
Thanks,
Dylan

I am facing with the same situation with you. I still don't have any ideat about that now. Waiting for the reply for authors.

I wonder whether the problem results from the code or from my too-small dataset.

vaibhavsah · Answer 5 · Mon Aug 09 2021 17:33:22 GMT+0800 (China Standard Time)

I tried it with a nearly 2000 Videos. Run different Epochs. But still the accuracy is not more than 21.09%. Strange thing is that it's same for most of the runs. No change in figures.

Linwei Tao · Answer 6 · Tue Aug 10 2021 09:55:56 GMT+0800 (China Standard Time)

I tried it with a nearly 2000 Videos. Run different Epochs. But still the accuracy is not more than 21.09%. Strange thing is that it's same for most of the runs. No change in figures.

Your dataset is too small. You can try run ViViT with ViT’s weight loaded for both temporal and spatial part.

vaibhavsah · Answer 7 · Wed Aug 11 2021 17:31:09 GMT+0800 (China Standard Time)

@DylanTao94 Can you share how I can do that.

Linwei Tao · Answer 8 · Wed Aug 11 2021 17:41:47 GMT+0800 (China Standard Time)

Sry mate, my code is not allowed to share. You can follow the steps in ViViT paper.

seandatasci · Answer 9 · Sun Aug 15 2021 03:28:25 GMT+0800 (China Standard Time)

yes this model works fine i've tested it on a dataset of 50k videos

vaibhavsah · Answer 10 · Sun Aug 15 2021 18:28:54 GMT+0800 (China Standard Time)

@seandatasci i think I might be doing something wrong with the code. Can you help me out here. My code is here

Mark-Dou · Answer 11 · Wed Aug 25 2021 09:43:38 GMT+0800 (China Standard Time)

@seandatasci i think I might be doing something wrong with the code. Can you help me out here. My code is here

i have the same problems with you, and i wonder you have resolved the problems whether or not, the acc or auc results is lower than 50%, the dataset size is also 2000, Thank u

Xin Ma · Answer 12 · Sat Dec 11 2021 22:31:05 GMT+0800 (China Standard Time)

Inspired from the implementation of the ViViT by the author, we have reimplement the TimeSformer and ViViT, and release the pretrain-model weights on Kinetics600 can be found here

Muhammad Naufil · Answer 13 · Thu Jun 08 2023 11:20:52 GMT+0800 (China Standard Time)

The model isn't learning. Trained on 2 classes of UCF101 dataset. Adam optimizer, CrossEntropyLoss