Beast code in Giters

wenjiajia123's repositories

Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Language:PythonMIT100

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

000

000

CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning

Language:Python000

[ICCV 2021 (Oral Presentation)] Dual-Camera Super-Resolution with Aligned Attention Modules (RefSR)

Language:Python000

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause000

000

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonMIT000

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonNOASSERTION000

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonBSD-3-Clause000

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Language:Python000

UnLoc: A Unified Framework for Video Localization Tasks

Apache-2.0000

[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

000

[EMNLP'22] Weakly-Supervised Temporal Article Grounding

000

YouwikiHow dataset for weakly-supervised article grounding

000