Kyle Kastner's repositories
raw_voice_cleanup
Examples of cleaning up raw voices
hmm_tts_build
a direct repository for building and using a "simple" tts
ai-generated-pokemon-rudalle
Python script to preprocess images of all Pokémon to finetune ruDALL-E
deep-image-prior
Image restoration with neural networks but without learning.
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
esgd
ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.
glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis model
IIRNet
Direct design of biquad filter cascades with deep learning by sampling random polynomials.
ndr
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
Neural-HMM
Neural HMMs are all you need (for high-quality attention-free TTS)
simple-equivariant-gnn
A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
v-diffusion-pytorch
v objective diffusion inference code for PyTorch.
variable-length-piano-expansion
The official implementation of Variable-Length Piano Infilling (VLI).
VQ-Diffusion
Official implementation of VQ-Diffusion