Beast code in Giters

MrHuangAm's starred repositories

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookMIT54400

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.0264000

Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)

Language:PythonMIT4600

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Language:PythonApache-2.08800

Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training

Language:PythonNOASSERTION12200

公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码

Apache-2.0719800

【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge

Language:Python1500

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

Language:PythonMIT11800

https://layer6ai-labs.github.io/xpool/

Language:Python10700

Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"

Language:Python3200

ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model

Language:PythonBSD-3-Clause1100

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookMIT2324900

Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!

Language:Python3766000