kastnerkyle

Kyle Kastner's repositories

raw_voice_cleanup

Examples of cleaning up raw voices

Language:PythonBSD-3-Clause18 40

pytorch-text-vae

Language:Python16 30

hmm_tts_build

a direct repository for building and using a "simple" tts

Language:Shell2 10

ai-generated-pokemon-rudalle

Python script to preprocess images of all Pokémon to finetune ruDALL-E

Language:PythonMIT010

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookMIT010

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonNOASSERTION010

deep-image-prior

Image restoration with neural networks but without learning.

Language:Jupyter NotebookNOASSERTION010

deform-conv-np

Language:PythonBSD-3-Clause000

descent

Toy library for neural networks in Rust using Vulkan compute shaders

Language:RustMIT010

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonMIT010

esgd

ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.

MIT000

evojax

Language:PythonApache-2.0010

glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

Language:PythonMIT010

IIRNet

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Apache-2.0000

interspeech2022_human-evaluation

Language:HTML000

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Language:PythonNOASSERTION010

MOSA-Net-Cross-Domain

Language:Python010

ndr

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

MIT000

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Language:Jupyter NotebookMIT010

no7_singing

CC0-1.0010

ParticleFlow_Exp

Language:Julia010

simple-equivariant-gnn

A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks

Language:PythonApache-2.0010

Star-DGT

Code for application of star-DGT in Compressed Sensing and Speech Denoising

Language:MATLAB010

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

MIT000

umss

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Language:PythonApache-2.0010

v-diffusion-pytorch

v objective diffusion inference code for PyTorch.

Language:PythonMIT000

variable-length-piano-expansion

The official implementation of Variable-Length Piano Infilling (VLI).

Language:PythonGPL-3.0010

variational-diffwave

Language:PythonApache-2.0010

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonMIT010

VQ-Diffusion-1

Language:PythonMIT010