Yezhiqiu's starred repositories

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:45306Issues:338Issues:3718

MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Language:PythonLicense:MITStargazers:43578Issues:895Issues:626

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:32453Issues:197Issues:1188

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:30737Issues:195Issues:4779

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:16982Issues:105Issues:1104

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:13607Issues:115Issues:923

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13049Issues:93Issues:16

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11829Issues:99Issues:506

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10752Issues:138Issues:347

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10741Issues:125Issues:217

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9778Issues:77Issues:465

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

llama-fs

A self-organizing file system with llama 3

Language:Jupyter NotebookLicense:MITStargazers:4786Issues:34Issues:44
Language:PythonLicense:Apache-2.0Stargazers:4058Issues:52Issues:117

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3275Issues:40Issues:163

awesome-hand-pose-estimation

Awesome work on hand pose estimation/tracking

EVA

EVA Series: Visual Representation Fantasies from BAAI

Language:PythonLicense:MITStargazers:2209Issues:30Issues:157
Language:PythonLicense:NOASSERTIONStargazers:1906Issues:91Issues:38

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1559Issues:21Issues:36

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:1425Issues:65Issues:0

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Language:PythonLicense:Apache-2.0Stargazers:1251Issues:37Issues:48

Gaussian-Head-Avatar

[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"

Language:PythonLicense:NOASSERTIONStargazers:742Issues:60Issues:39

Uni-TTS

本项目意图在于让使用各类语音合成引擎的方式变得统一,支持多种语音合成引擎适配器,允许直接作为模组使用或启动后端服务

Language:PythonLicense:MITStargazers:621Issues:8Issues:31

AnyV2V

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"

Language:Jupyter NotebookLicense:MITStargazers:449Issues:17Issues:10

HairFastGAN

Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"

Language:PythonLicense:MITStargazers:405Issues:9Issues:17

VideoSwap

Code for [CVPR 2024] VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:235Issues:11Issues:11

simpleHand

This is the project page for paper "A Simple Baseline for Efficient Hand Mesh Reconstruction, CVPR2024"

Language:PythonLicense:MITStargazers:55Issues:3Issues:11

GaussianHair

A novel explicit hair representation. It enables comprehensive modeling of hair geometry and appearance from images, fostering innovative illumination effects and dynamic animation capabilities.