Cookie's repositories
pag-tacotron2
[NOT-in-Progress] PyTorch implementation of "Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis"
podcast_rss_feeds
List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.
pngnw_bert
Unofficial PyTorch implementation of PnG BERT with some changes
VocoderComparisons
Train/test a variety of open source vocoders using the same input features and dataset. Then infer together for easy side-by-side comparisons.
paint-with-words-sd
Unofficial Implementation of Paint-with-words, method from eDiffi that let you generate image from text-labeled segmentation map.
fimfic_quote_attribution
[On Hiatus] Label FimFiction stories for AI Audiodrama generation
DiffSVC_inference_only
Contains inference code for DiffSVC unofficial reimplementation
Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices
derpy-score-predictor
Pytorch - Predict the quality of a derpibooru image given its tags and datetime.
podcast_wds
PyTorch Code to stream Webdataset Format Podcasts Dataset
2023-minimap
Collaborative /r/place 2023 template userscript
batch-whisper
Batch Support for OpenAI Whisper
cambrinary
A linux terminal online dictionary, based on cambridge dictionary: https://dictionary.cambridge.org
DeepLearningExamples
Deep Learning Examples
encodec_fast
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
h2p-parser
Heteronym to Phoneme Parser
PyPonyPixel
Pony Python bot for r/place 2023
stable-diffusion
A latent text-to-image diffusion model
stable-diffusion-webui
Stable Diffusion web UI
stable-diffusion-webui-depthmap-script
High Resolution Depth Maps for Stable Diffusion WebUI
uberduck-ml-dev
ML models for Uberduck
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)