Beast code in Giters

2132660698's repositories

3DGPT

000

autogen

Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ

CC-BY-4.0000

Chinese-LLaVA

支持中英文双语视觉-文本对话的开源可商用多模态模型。

Apache-2.0000

CLIP-VG

CLIP for Visual Grounding

Apache-2.0000

CogVLM

a state-of-the-art-level open visual language model

000

COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

MIT000

EasyMocap

Make human motion capture easier.

NOASSERTION000

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

MIT000

freemocap

Free Motion Capture for Everyone 💀✨

Language:PythonAGPL-3.0000

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

000

GPT-4V_Medical_Evaluation

000

Hallucination-Correction-for-MLLMs

✨✨The first work to correct hallucination in multimodal large language models.

000

human-motion-capture

collect papers about human motion capture

000

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Apache-2.0000

InternLM-XComposer

000

Lion

Lion: Kindling Vision Intelligence within Large Language Models

000

LRV-Instruction

Aligning Large Multi-Modal Model with Robust Instruction Tuning

BSD-3-Clause000

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Apache-2.0000

ml-ferret

NOASSERTION000

MoE_demo

A MoE_demo using pytorch

Language:Python000

Muffin

000

Pink

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs

000

PISA

000

PVIT

Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models

Language:Python000

Qwen-VL-Chat-colab

Language:Python000

SEEChat

Multimodal chatbot with computer vision capabilities integrated

Apache-2.0000

t2motion

Official implementation of Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions. (ICCV2023)

MIT000

TGM3D

Language:Python000

UnifiedMultimodalInstructionTuning

000

UReader

Apache-2.0000