Guowei Xu's repositories
Language:C++MIT000
Chat-UniVi
[CVPR 2024 Highlightš„] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Language:PythonApache-2.0000
Language:HTML000
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:PythonApache-2.0000
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Language:PythonBSD-3-Clause000
SciCode
A benchmark that challenges language models to code solutions for scientific problems
Language:PythonApache-2.0000
ShareGPT4Video
An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Language:Python000
Language:PythonMIT000
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:PythonApache-2.0000