Guowei Xu (XuGW-Kevin)

XuGW-Kevin

Geek Repo

Company:Tsinghua University

Github PK Tool:Github PK Tool

Guowei Xu's repositories

DrM

DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements in sample efficiency and asymptotic performance across diverse domains.

Language:PythonLicense:MITStargazers:51Issues:2Issues:3
Language:C++License:MITStargazers:0Issues:0Issues:0

Chat-UniVi

[CVPR 2024 HighlightšŸ”„] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

SciCode

A benchmark that challenges language models to code solutions for scientific problems

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0