w-okada's starred repositories
whisper.cpp
Port of OpenAI's Whisper model in C/C++
so-vits-svc
SoftVC VITS Singing Voice Conversion
Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
voice-changer
リアルタイムボイスチェンジャー Realtime Voice Changer
faster-whisper
Faster Whisper transcription with CTranslate2
Bert-VITS2
vits2 backbone with multilingual-bert
star-history
The missing star history graph of GitHub repos - https://star-history.com
prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Real-Time-Latent-Consistency-Model
App showcasing multiple real-time diffusion models pipelines with Diffusers
zotero-chatgpt
ChatGPT plugin for Zotero
ConsistencyVC-voive-conversion
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
MOT-Tracking-by-Detection-Pipeline
Tracking-by-Detection形式のMOT(Multi Object Tracking)について、 DetectionとTrackingの処理を分離して寄せ集めたフレームワーク(Tracking-by-Detection method MOT(Multi Object Tracking) is a framework that separates the processing of Detection and Tracking.)
whisper-onnx-cpu
ONNX implementation of Whisper. PyTorch free.
rnnoise_python
python wrapper for rnnoise library
MB-iSTFT-VITS-44100-Ja
44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transformです。
wav_splitter
RVCで音声学習をするための便利スクリプト集
libsamplerate-js
Resample audio in node or browser using a web assembly port of libsamplerate.
crowdhuman_hollywoodhead_yolo_convert
YOLOv7 training. Generates a head-only dataset in YOLO format. The labels included in the CrowdHuman dataset are Head and FullBody, but ignore FullBody.
wasm-audio-resampler
Soxr Audio resampler built to WebAssembly for usage in NodeJS or web browser