w00zie

followers

following

stars

Florence, IT

https://w00zie.github.io/

Giovanni's starred repositories

ect

Consistency Models Made Easy

Language:Python18700

a-unet

A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.

Language:PythonMIT7500

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.0130700

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2049900

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonMIT108400

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01213600

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonMIT233600

eindex

Multidimensional indexing for tensors

Language:Jupyter Notebook10700

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02481500

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT189700

msprior

Language:Python15500

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.013110900

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT337700

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6651200

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookMIT241700

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT986200

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookMIT1138300

creative_ml

Creative Machine Learning course and notebook tutorials in JAX, PyTorch and Numpy

Language:Jupyter NotebookGPL-3.021100

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT3005400

pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Language:PythonMIT61500

flow_synthesizer

Universal audio synthesizer control learning with normalizing flows

Language:MaxMIT13300

oobleck

open soundstream-ish VAE codecs for downstream neural audio synthesis

Language:PythonMIT10800

opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Language:PythonMIT83800

torchinfo

View model summaries in PyTorch!

Language:PythonMIT246200

panel

Panel: The powerful data exploration & web app framework for Python

Language:PythonBSD-3-Clause457000

acids_transforms

A bunch of scriptable audio transforms based on the torchaudio backend

Language:PythonGPL-3.0500

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0219300

gdown

Google Drive Public File Downloader when Curl/Wget Fails

Language:PythonMIT413600

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonMIT311200

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3586100