thomas-yanxin

thomas-yanxin

Geek Repo

Company:@X-D-Lab @ColugoMum

Location:Beijing China

Home Page:https://thomas-yanxin.github.io/

Twitter:@thomas_yanxin

Github PK Tool:Github PK Tool


Organizations
ColugoMum
X-D-Lab

thomas-yanxin's starred repositories

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:25709Issues:286Issues:892

mlx

MLX: An array framework for Apple silicon

SillyTavern

LLM Frontend for Power Users.

Language:JavaScriptLicense:AGPL-3.0Stargazers:6146Issues:51Issues:1228

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:3945Issues:54Issues:145

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonLicense:GPL-3.0Stargazers:3763Issues:37Issues:175

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:Jupyter NotebookLicense:MITStargazers:2334Issues:31Issues:149

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2098Issues:32Issues:73

fish-speech

Brand new TTS solution

Language:PythonLicense:BSD-3-ClauseStargazers:1792Issues:32Issues:134

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1046Issues:23Issues:21

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:986Issues:26Issues:68

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:888Issues:64Issues:117

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:647Issues:7Issues:32

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:598Issues:3Issues:53

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:420Issues:0Issues:0

databonsai

clean & curate your data with LLMs.

Language:PythonLicense:MITStargazers:406Issues:2Issues:2

RoleLLM-public

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

FuseAI

FuseLLM & FuseChat Project

Language:PythonStargazers:339Issues:0Issues:0

mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Language:PythonLicense:LGPL-3.0Stargazers:310Issues:5Issues:8

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:252Issues:26Issues:12

LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Language:PythonLicense:MITStargazers:167Issues:3Issues:15

tc4d

TC4D: Trajectory-Conditioned Text-to-4D Generation

Language:PythonLicense:Apache-2.0Stargazers:134Issues:5Issues:4

ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Language:PythonLicense:MITStargazers:71Issues:4Issues:7

multilingual-safety-for-LLMs

[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"

License:MITStargazers:42Issues:7Issues:0

UltraLink

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Language:PythonLicense:MITStargazers:13Issues:7Issues:0