MrHuangAm's repositories
pytorch-softdtw-cuda
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba
MIT000
RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
BSD-3-Clause000
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
MIT000
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
MIT000
Apache-2.0000
MIT000
X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
MIT000
vigilant-funicular
Exchange of wisdom
000