liwei's repositories

DeCap

ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning

Language:Jupyter NotebookStargazers:118Issues:2Issues:9

TOPA

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Language:PythonLicense:MITStargazers:13Issues:3Issues:1

MCL

(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning

Stargazers:10Issues:0Issues:0