2132660698's repositories

Stargazers:0Issues:0Issues:0

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

Chinese-LLaVA

支持中英文双语视觉-文本对话的开源可商用多模态模型。

License:Apache-2.0Stargazers:0Issues:0Issues:0

CLIP-VG

CLIP for Visual Grounding

License:Apache-2.0Stargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model

Stargazers:0Issues:0Issues:0

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

License:MITStargazers:0Issues:0Issues:0

EasyMocap

Make human motion capture easier.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

License:MITStargazers:0Issues:0Issues:0

freemocap

Free Motion Capture for Everyone 💀✨

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Hallucination-Correction-for-MLLMs

✨✨The first work to correct hallucination in multimodal large language models.

Stargazers:0Issues:0Issues:0

human-motion-capture

collect papers about human motion capture

Stargazers:0Issues:0Issues:0

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Lion

Lion: Kindling Vision Intelligence within Large Language Models

Stargazers:0Issues:0Issues:0

LRV-Instruction

Aligning Large Multi-Modal Model with Robust Instruction Tuning

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

MoE_demo

A MoE_demo using pytorch

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Pink

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

PVIT

Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

SEEChat

Multimodal chatbot with computer vision capabilities integrated

License:Apache-2.0Stargazers:0Issues:0Issues:0

t2motion

Official implementation of Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions. (ICCV2023)

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0