Try out Embedding models and evaluate clustering

Question

Try out Embedding models and evaluate clustering

dennyabrain opened this issue 6 days ago · comments

Try out ResNet, CLIP, ViT, VideoMAE (or something you like) and use tsne (or other approaches) to evaluate clustering visually. You can do this on a jupyter notebook and show results. Use an publicly available dataset. Evaluate if any of these models can be fine tuned

Snehil Shah commented 3 days ago

Hi

Aatman Vaidya · Answer 1 · Fri Jun 14 2024 18:31:06 GMT+0800 (China Standard Time)

CLiP can give us vector embeddings of an image/video
one other dimensionality reduction method to look at could be UMAP, fingerprinting

Aatman Vaidya · Answer 2 · Fri Jun 14 2024 19:18:17 GMT+0800 (China Standard Time)

some data sources to look at

Aatman Vaidya · Answer 3 · Mon Jun 17 2024 18:17:21 GMT+0800 (China Standard Time)

create a mixed dataset of 150-200 datasets
Run Feluda Video Operator on a video dataset, reduce dimensions using t-SNE and do a visual plot - This will act as a baseline for us
Embeddings models - CliP, VideoMAE
Visual display it using t-SNE

Aatman Vaidya · Answer 4 · Mon Jun 17 2024 20:05:46 GMT+0800 (China Standard Time)

@Snehil-Shah was wondering if this could be worth exploring - Video2Vec - the approach is very old, so mostly ResNet should also perform better, but just putting it out there