patrickvonplaten

Patrick von Platen's repositories

Wav2Vec2_PyCTCDecode

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Language:Python107 5 4

AdvancedAutomaticSpeechRecognition

Language:Jupyter Notebook1001

metaseq

Repo for external large-scale work

Language:PythonMIT900

t5-mtf-to-hf-converter

Language:Python7 30

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Language:PythonApache-2.0400

blog

Public repo for HF blog posts

Language:Jupyter Notebook300

pytorch_diffusion

PyTorch reimplementation of Diffusion Models

Language:Python200

TRexGameRL

Language:JavaScript2041

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonMIT100

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION100

seq2seq-speech

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Language:Jupyter Notebook100

data2vec_vision

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT000

datasets-1

🤗 Fast, efficient, open-access datasets and evaluation metrics for Natural Language Processing and more in PyTorch, TensorFlow, NumPy and Pandas

Language:PythonApache-2.0000

ddim

Denoising Diffusion Implicit Models

Language:PythonMIT000

diffusion

Denoising Diffusion Probabilistic Models

Language:Python000

huggingface_hub

All the open source things related to the Hugging Face Hub.

Language:PythonApache-2.0000

icefall

Language:PythonNOASSERTION000

imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Language:PythonMIT000

kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Language:ShellNOASSERTION000

karlo

Language:PythonNOASSERTION000

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT000

longt5

Language:PythonApache-2.0000

markup2im

Diffusion-based markup-to-image generation

Language:PythonMIT000

PNDM

The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (ICLR 2022) and a generic framework for DDIM-like models

Language:PythonApache-2.0000

pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Language:PythonApache-2.0000

sample-generator

Tools to train a generative model on arbitrary audio samples

Language:Jupyter NotebookMIT000

score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Language:Jupyter NotebookApache-2.0000

Symphonia

Pure Rust multimedia format demuxing, tag reading, and audio decoding library

Language:RustMPL-2.0000

Versatile-Diffusion

Language:PythonMIT000

VQ-Diffusion

Official implementation of VQ-Diffusion

Language:PythonMIT000