Patrick von Platen's repositories
Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
pytorch_diffusion
PyTorch reimplementation of Diffusion Models
k-diffusion
Karras et al. (2022) diffusion models for PyTorch
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
data2vec_vision
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
datasets-1
🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas
ddim
Denoising Diffusion Implicit Models
diffusion
Denoising Diffusion Probabilistic Models
huggingface_hub
All the open source things related to the Hugging Face Hub.
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
markup2im
Diffusion-based markup-to-image generation
PNDM
The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (ICLR 2022) and a generic framework for DDIM-like models
pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
sample-generator
Tools to train a generative model on arbitrary audio samples
score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
Symphonia
Pure Rust multimedia format demuxing, tag reading, and audio decoding library
VQ-Diffusion
Official implementation of VQ-Diffusion