Chen Yang (chenyangMl)

chenyangMl

Geek Repo

Location:beijing

Twitter:@cyang8050

Github PK Tool:Github PK Tool

Chen Yang's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:136769Issues:1057Issues:7550

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49208Issues:561Issues:202

FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

Language:CLicense:NOASSERTIONStargazers:44030Issues:1438Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:43385Issues:343Issues:2579

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32272Issues:273Issues:1068

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29833Issues:190Issues:982

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28298Issues:168Issues:416

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27829Issues:188Issues:4393

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24710Issues:209Issues:208

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22330Issues:219Issues:125

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18381Issues:158Issues:1416

ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10580Issues:122Issues:207

xmake

🔥 A cross-platform build utility based on Lua

Language:LuaLicense:Apache-2.0Stargazers:9608Issues:141Issues:3097

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8809Issues:82Issues:36

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7049Issues:61Issues:178

glog

C++ implementation of the Google logging module

Language:C++License:BSD-3-ClauseStargazers:6933Issues:261Issues:570

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5574Issues:85Issues:130

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3108Issues:60Issues:91

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1932Issues:29Issues:78

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1601Issues:20Issues:44

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1287Issues:25Issues:62

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:715Issues:34Issues:46

agents

Build real-time multimodal AI applications 🤖🎙️📹

Language:PythonLicense:Apache-2.0Stargazers:709Issues:25Issues:92

AnyGPT

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

llama2.c-zh

支持中文场景的的小语言模型 llama2.c-zh

keyword-spot

端到端语音唤醒工具箱,从模型训练到模型推理。

Language:PythonLicense:MITStargazers:55Issues:0Issues:0