mitayuming's repositories
alpaca-lora
Instruct-tune LLaMA on consumer hardware
ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
ChatGLM-webui
A WebUI for ChatGLM-6B
ComfyUI
A powerful and modular stable diffusion GUI with a graph/nodes interface.
GLM
GLM (General Language Model)
llama
Inference code for LLaMA models
lora-scripts
LoRA training scripts use kohya-ss's trainer, for diffusion model.
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-fork
so-vits-svc fork with REALTIME support (voice changer) and greatly improved interface.
stable-diffusion
A latent text-to-image diffusion model
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-colab
stable diffusion webui colab
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
termux-packages
A package build system for Termux.
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
VITS-Paimon
Implementation of the VITS model using Genshin Impact datasets
vocal-remover
Vocal Remover using Deep Neural Networks
whisper-vits-japanese
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)