MrHuangAm's starred repositories
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
LogicStack-LeetCode
公众号「宫水三叶的刷题日记」刷穿 LeetCode 系列文章源码
CVPR23-LOVEU-AQTC
【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge
T-MASS-text-video-retrieval
Official implementation of "Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval (CVPR 2024 Highlight)"
RTQ-MM2023
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!