weikaih04

Weikai Huang's repositories

shen2347.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptMIT100

Ask-Anything

[CVPR2024][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT000

Chat-UniVi

[CVPR 2024🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonApache-2.0000

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonApache-2.0000

LLaVA-finetune

finetune llava for instructverse

Apache-2.0000

PandaGPT

[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All

Apache-2.0000

"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.0000

weikaih04

Weikai Huang's repositories

AGQA_code_analysis

autogen_auto_homework

GQA_code_analysis

question_generation

shen2347.github.io

Ask-Anything

Autogen_MathGPT

Chat-UniVi

DreamSync

LLaMA-VID

LLaVA-finetune

PandaGPT

TaskMeAnything-rebuttal

TaskMeAnything-website

test_image

Video-ChatGPT

Video-LLaMA

Video-LLaVA

VLMEvalKit

weikaih04.github.io