effusiveperiscope's repositories
so-vits-svc
so-vits-svc
ControllableTalkNet
A web app that lets you play around with TalkNet models
PPPDataset
Various tools for manipulating Pony Preservation Project related data
so-vits-svc-4.1
SoftVC VITS Singing Voice Conversion
arpaji
Translates romaji into ARPAbet (poorly)
arpaji_talknet
arpaji but for talknet
audio_psd_matcher
python cli utility for matching psds of audio files
ddsp-singing-vocoders
Customized implementation of SawSing
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
dora
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of experiments without losing your sanity.
HaySway
Alternate UI for Hay Say
hifi_plusplus
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement (ICASSP 2023)
MATE403_Project_1
Cubic infinite potential well
NeMo
NeMo: a toolkit for conversational AI
ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
pits
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
repple
Quick and dirty fdisk-like REPLs
Retrieval-based-Voice-Conversion-WebUI
Use less than 10 minutes vocal to fast train an any2one voice conversion model!
simple-llama-finetuner
Simple UI for LLaMA Model Finetuning
xtts-repl
Quick REPL for XTTS