effusiveperiscope's repositories
so-vits-svc
so-vits-svc
ControllableTalkNet
A web app that lets you play around with TalkNet models
PPPDataset
Various tools for manipulating Pony Preservation Project related data
so-vits-svc-4.1
SoftVC VITS Singing Voice Conversion
audio_psd_matcher
python cli utility for matching psds of audio files
ddsp-singing-vocoders
Customized implementation of SawSing
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
dora
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of experiments without losing your sanity.
HaySway
Alternate UI for Hay Say
hifi_plusplus
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
Retrieval-based-Voice-Conversion-WebUI
Use less than 10 minutes vocal to fast train an any2one voice conversion model!
simple-llama-finetuner
Simple UI for LLaMA Model Finetuning
xtts-repl
Quick REPL for XTTS