Giovanni (w00zie)

w00zie

Geek Repo

Location:Florence, IT

Home Page:https://w00zie.github.io/

Github PK Tool:Github PK Tool

Giovanni's starred repositories

ect

Consistency Models Made Easy

Language:PythonStargazers:187Issues:0Issues:0

a-unet

A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.

Language:PythonLicense:MITStargazers:75Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1307Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20499Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1084Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12136Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2336Issues:0Issues:0

eindex

Multidimensional indexing for tensors

Language:Jupyter NotebookStargazers:107Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24815Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1897Issues:0Issues:0
Language:PythonStargazers:155Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:131109Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3377Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66512Issues:0Issues:0

uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Language:Jupyter NotebookLicense:MITStargazers:2417Issues:0Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:9862Issues:0Issues:0

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookLicense:MITStargazers:11383Issues:0Issues:0

creative_ml

Creative Machine Learning course and notebook tutorials in JAX, PyTorch and Numpy

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:211Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30054Issues:0Issues:0

pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Language:PythonLicense:MITStargazers:615Issues:0Issues:0

flow_synthesizer

Universal audio synthesizer control learning with normalizing flows

Language:MaxLicense:MITStargazers:133Issues:0Issues:0

oobleck

open soundstream-ish VAE codecs for downstream neural audio synthesis

Language:PythonLicense:MITStargazers:108Issues:0Issues:0

opt_einsum

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Language:PythonLicense:MITStargazers:838Issues:0Issues:0

torchinfo

View model summaries in PyTorch!

Language:PythonLicense:MITStargazers:2462Issues:0Issues:0

panel

Panel: The powerful data exploration & web app framework for Python

Language:PythonLicense:BSD-3-ClauseStargazers:4570Issues:0Issues:0

acids_transforms

A bunch of scriptable audio transforms based on the torchaudio backend

Language:PythonLicense:GPL-3.0Stargazers:5Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2193Issues:0Issues:0

gdown

Google Drive Public File Downloader when Curl/Wget Fails

Language:PythonLicense:MITStargazers:4136Issues:0Issues:0

musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Language:PythonLicense:MITStargazers:3112Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35861Issues:0Issues:0