MrHuangAm's repositories
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter NotebookMIT000
Language:PythonMIT000
pytorch-softdtw-cuda
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba
Language:PythonMIT000
RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
Language:PythonBSD-3-Clause000
Language:PythonApache-2.0000
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:PythonMIT000
vigilant-funicular
Exchange of wisdom
X-CLIP
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
Language:PythonMIT000