wenjiajia123's repositories

GVL

Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

cpl

CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning

Language:PythonStargazers:0Issues:0Issues:0

DCSR

[ICCV 2021 (Oral Presentation)] Dual-Camera Super-Resolution with Aligned Attention Modules (RefSR)

Language:PythonStargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

UniVTG

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Language:PythonStargazers:0Issues:0Issues:0

UnLoc

UnLoc: A Unified Framework for Video Localization Tasks

License:Apache-2.0Stargazers:0Issues:0Issues:0

UVCOM

[CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection

Stargazers:0Issues:0Issues:0

WSAG

[EMNLP'22] Weakly-Supervised Temporal Article Grounding

Stargazers:0Issues:0Issues:0

YouwikiHow

YouwikiHow dataset for weakly-supervised article grounding

Stargazers:0Issues:0Issues:0