w-okada's starred repositories

whisper.cpp

Port of OpenAI's Whisper model in C/C++

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:23914Issues:173Issues:130

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:18830Issues:151Issues:1358

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:14619Issues:103Issues:909

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:8628Issues:111Issues:538

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:6795Issues:48Issues:0

star-history

The missing star history graph of GitHub repos - https://star-history.com

Language:TypeScriptLicense:MITStargazers:5807Issues:25Issues:98

prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Language:PythonLicense:Apache-2.0Stargazers:1817Issues:25Issues:167

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonLicense:MITStargazers:1575Issues:17Issues:58

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:1194Issues:56Issues:28

Real-Time-Latent-Consistency-Model

App showcasing multiple real-time diffusion models pipelines with Diffusers

Language:PythonLicense:Apache-2.0Stargazers:816Issues:21Issues:35

rvc-webui

liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project

Language:PythonLicense:MITStargazers:461Issues:14Issues:67
Language:PythonLicense:MITStargazers:343Issues:9Issues:13

zotero-chatgpt

ChatGPT plugin for Zotero

Language:JavaScriptLicense:AGPL-3.0Stargazers:186Issues:7Issues:15

ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Language:PythonLicense:MITStargazers:114Issues:9Issues:26
Language:PythonLicense:MITStargazers:70Issues:5Issues:3

MOT-Tracking-by-Detection-Pipeline

Tracking-by-Detection形式のMOT(Multi Object Tracking)について、 DetectionとTrackingの処理を分離して寄せ集めたフレームワーク(Tracking-by-Detection method MOT(Multi Object Tracking) is a framework that separates the processing of Detection and Tracking.)

Language:PythonLicense:MITStargazers:62Issues:4Issues:3

whisper-onnx-cpu

ONNX implementation of Whisper. PyTorch free.

Language:PythonLicense:MITStargazers:54Issues:4Issues:0

rnnoise_python

python wrapper for rnnoise library

Language:PythonLicense:BSD-3-ClauseStargazers:44Issues:1Issues:3
Language:PythonStargazers:42Issues:1Issues:0

Latopia

Speech AI training and inference tools

Language:PythonLicense:MITStargazers:37Issues:3Issues:0

MB-iSTFT-VITS-44100-Ja

44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transformです。

Language:PythonLicense:Apache-2.0Stargazers:33Issues:2Issues:3

wav_splitter

RVCで音声学習をするための便利スクリプト集

Language:PythonStargazers:27Issues:1Issues:0

libsamplerate-js

Resample audio in node or browser using a web assembly port of libsamplerate.

Language:JavaScriptLicense:NOASSERTIONStargazers:26Issues:1Issues:11

crowdhuman_hollywoodhead_yolo_convert

YOLOv7 training. Generates a head-only dataset in YOLO format. The labels included in the CrowdHuman dataset are Head and FullBody, but ignore FullBody.

Language:PythonLicense:GPL-3.0Stargazers:26Issues:2Issues:0

wasm-audio-resampler

Soxr Audio resampler built to WebAssembly for usage in NodeJS or web browser

Language:TypeScriptLicense:MITStargazers:16Issues:3Issues:2

RVC-WebUI

Localized fork

Language:PythonLicense:MITStargazers:1Issues:0Issues:0