Leonardo Pepino (mrpep)

mrpep

Geek Repo

Company:Laboratorio de Inteligencia Artificial Aplicada - UBA

Location:Argentina

Home Page:https://mrpep.github.io/myblog/

Twitter:@neuralsound

Github PK Tool:Github PK Tool


Organizations
habla-liaa
neurasound

Leonardo Pepino's starred repositories

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30621Issues:312Issues:885

ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Language:PythonLicense:Apache-2.0Stargazers:9641Issues:114Issues:697

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:7883Issues:150Issues:530

pedalboard

🎛 🔊 A Python library for audio.

Language:C++License:GPL-3.0Stargazers:4970Issues:58Issues:172

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4345Issues:52Issues:199

OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language:PythonLicense:Apache-2.0Stargazers:4231Issues:42Issues:254

textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Language:PythonLicense:MITStargazers:3326Issues:65Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2577Issues:31Issues:162

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1825Issues:32Issues:160

FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Language:PythonLicense:MITStargazers:1685Issues:27Issues:211

omnizart

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Language:PythonLicense:MITStargazers:1593Issues:25Issues:75

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonLicense:NOASSERTIONStargazers:1302Issues:31Issues:11

RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Language:PythonLicense:NOASSERTIONStargazers:1244Issues:41Issues:165

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Language:PythonLicense:MITStargazers:1067Issues:17Issues:84

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1056Issues:18Issues:131

cleanthesis

Clean Thesis is a clean, simple, and elegant LaTeX style (or template) for thesis documents.

auraloss

Collection of audio-focused loss functions in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:685Issues:18Issues:35

convit

Code for the Convolutional Vision Transformer (ConViT)

Language:PythonLicense:Apache-2.0Stargazers:456Issues:17Issues:19

GST-Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Language:PythonLicense:MITStargazers:353Issues:14Issues:17

lyrebird-wav2clip

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Language:PythonLicense:MITStargazers:318Issues:11Issues:13

soundata

Python library for downloading, loading & working with sound datasets

Language:PythonLicense:BSD-3-ClauseStargazers:289Issues:10Issues:75

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Language:PythonLicense:Apache-2.0Stargazers:197Issues:10Issues:10

cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Language:PythonLicense:MITStargazers:182Issues:22Issues:14

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Language:PythonLicense:NOASSERTIONStargazers:181Issues:5Issues:7

ltspice-guitar-pedals

A collection of LTSpice simulation files for popular guitar effects. :guitar: :electron: :musical_note: :chart_with_upwards_trend: Pull requests welcome :smiley:

Language:AGS ScriptStargazers:105Issues:9Issues:0

PyTorch-Raspberry-Pi-64-OS

PyTorch installation wheels for Raspberry Pi 64 OS

simple-speaker-embedding

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:79Issues:2Issues:6

spiceAmp

Non-realtime high realistic software guitar processor. Works with *.wav files as input and output. It uses ngspice for electric circuit simulation and FFT convolver with Impulse Response *.wav file for cabinet simulation.

Language:C++License:GPL-3.0Stargazers:30Issues:5Issues:0

faseAlign

Command line tool for forced-alignment of Spanish speech data

Language:PythonLicense:MITStargazers:12Issues:5Issues:6

CLEAR-dataset-generation

Generation code for the CLEAR dataset (Compositional Language and Elementary Acoustic Reasoning)

Language:PythonLicense:NOASSERTIONStargazers:3Issues:0Issues:0