thomas-yanxin

thomas-yanxin

Geek Repo

Company:@X-D-Lab @ColugoMum

Location:Beijing China

Home Page:https://thomas-yanxin.github.io/

Twitter:@thomas_yanxin

Github PK Tool:Github PK Tool


Organizations
ColugoMum
X-D-Lab

thomas-yanxin's starred repositories

aria2

aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.

Language:C++License:GPL-2.0Stargazers:33691Issues:731Issues:1819

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:24922Issues:280Issues:831

mlx

MLX: An array framework for Apple silicon

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10256Issues:151Issues:152

kornia

Geometric Computer Vision Library for Spatial AI

Language:PythonLicense:Apache-2.0Stargazers:9446Issues:129Issues:888

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:5964Issues:46Issues:554

mlx-examples

Examples in the MLX framework

Language:PythonLicense:MITStargazers:5049Issues:55Issues:349

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:3842Issues:51Issues:134

edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Language:PythonLicense:GPL-3.0Stargazers:3714Issues:36Issues:172

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:1940Issues:31Issues:70

agentscope

Start building LLM-empowered multi-agent applications in an easier way.

Language:PythonLicense:Apache-2.0Stargazers:1450Issues:16Issues:52

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1039Issues:15Issues:61

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:968Issues:29Issues:56

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:724Issues:8Issues:18

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:641Issues:7Issues:32

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 40+ HF models, 20+ benchmarks

databonsai

clean & curate your data with LLMs.

Language:PythonLicense:MITStargazers:402Issues:2Issues:2

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:385Issues:0Issues:0

FuseAI

FuseLLM & FuseChat Project

Language:PythonStargazers:334Issues:0Issues:0

mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Language:PythonLicense:LGPL-3.0Stargazers:298Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:250Issues:26Issues:9

LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Language:PythonLicense:MITStargazers:164Issues:3Issues:15

tc4d

TC4D: Trajectory-Conditioned Text-to-4D Generation

Language:PythonLicense:Apache-2.0Stargazers:131Issues:5Issues:4

Parameter-Efficient-MoE

Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks

Language:PythonLicense:Apache-2.0Stargazers:109Issues:0Issues:0

ShieldLM

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

Language:PythonLicense:MITStargazers:66Issues:4Issues:7

UltraLink

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Language:PythonLicense:MITStargazers:13Issues:7Issues:0