Kyle Kastner's repositories
raw_voice_cleanup
Examples of cleaning up raw voices
hmm_tts_build
a direct repository for building and using a "simple" tts
ai-generated-pokemon-rudalle
Python script to preprocess images of all Pokémon to finetune ruDALL-E
controllable-ncas
Code for "Goal-Guided Neural Cellular Automata: Learning to Control Self-Organising Systems"
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
deep-image-prior
Image restoration with neural networks but without learning.
descent
Toy library for neural networks in Rust using Vulkan compute shaders
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
esgd
ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ocotillo
Performant and accurate speech recognition built on Pytorch
RWKV-LM
RWKV v2 is a RNN with transformer-level performance. It can be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
simple-equivariant-gnn
A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks
SynchronousGoExplore
A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
v-diffusion-pytorch
v objective diffusion inference code for PyTorch.