ennotsubasa's repositories
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
audio-slicer
A simple GUI application that slices audio with silence detection
audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Auto_Tuning_Zeroshot_TTS_and_VC
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
Bark-Voice-Cloning
Bark Voice Cloning and Voice Cloning for Chinese Speech
bert_language
TensorFlow code and pre-trained models for BERT
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
ChatGLM2-Voice-Cloning
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话
clap
A full featured, fast Command Line Argument Parser for Rust
CLAP_audio_language
Contrastive Language-Audio Pretraining
fanqiang
Network
fish-speech
Brand new TTS solution
h2ogpt
Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports LLaMa2, llama.cpp, and more. Demo: https://gpt.h2o.ai/
magenta
Magenta: Music and Art Generation with Machine Intelligence
musicgen_trainer
simple trainer for musicgen/audiocraft
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
so-vits-svc
SoftVC VITS Singing Voice Conversion
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
vits_chinese
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!