Leonardo Pepino (mrpep)

mrpep

Geek Repo

Company:Laboratorio de Inteligencia Artificial Aplicada - UBA

Location:Argentina

Home Page:https://mrpep.github.io/myblog/

Twitter:@neuralsound

Github PK Tool:Github PK Tool


Organizations
habla-liaa
neurasound

Leonardo Pepino's starred repositories

VisionMamba

Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images

Language:PythonLicense:MITStargazers:292Issues:0Issues:0

qedr

Quantitative evaluation of disentangled representations

Language:Jupyter NotebookLicense:MITStargazers:59Issues:0Issues:0

conformal-predictions-from-scratch

Various Conformal Prediction methods implemented from scratch in pure NumPy for an educational purpose.

Language:Jupyter NotebookStargazers:173Issues:0Issues:0

frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Language:PythonLicense:MITStargazers:212Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11304Issues:0Issues:0

transformer-contributions

Measuring the Mixing of Contextual Information in the Transformer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:23Issues:0Issues:0

Fermat-distance

We propose a density-based estimator for weighted geodesic distances suitable for data lying on a manifold of lower dimension than ambient space and sampled from a possibly nonuniform distribution

Language:PythonStargazers:14Issues:0Issues:0

vulkan

The ultimate Python binding for Vulkan API

Language:C++License:Apache-2.0Stargazers:489Issues:0Issues:0

wir2wav

a simple tool for the conversion of .wir impulse response files into standard PCM .wav files

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

plla-tisvs

Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

Lyrics-to-Audio-Alignment

Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.

Language:PythonStargazers:85Issues:0Issues:0

msaf

Music Structure Analysis Framework

Language:PythonLicense:MITStargazers:477Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11688Issues:0Issues:0

IPET

Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:53Issues:0Issues:0

TorchPQ

Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda

Language:CudaLicense:MITStargazers:204Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38253Issues:0Issues:0

libri-light

dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.

Language:PythonLicense:MITStargazers:459Issues:0Issues:0

layerwise-analysis

Layer-wise analysis of self-supervised pre-trained speech representations

Language:PythonStargazers:82Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:3285Issues:0Issues:0

xmanager

A platform for managing machine learning experiments

Language:PythonLicense:Apache-2.0Stargazers:803Issues:0Issues:0

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:351Issues:0Issues:0

rockstar

The Rockstar programming language specification

Language:JavaScriptLicense:MITStargazers:6868Issues:0Issues:0

pytorch-dann

A PyTorch implementation for Unsupervised Domain Adaptation by Backpropagation

Language:Jupyter NotebookLicense:MITStargazers:143Issues:0Issues:0

listening-test

An open source platform for browser based speech and audio subjective quality tests.

Language:TypeScriptLicense:MITStargazers:30Issues:0Issues:0

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonLicense:Apache-2.0Stargazers:2365Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33288Issues:0Issues:0

FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Language:PythonLicense:MITStargazers:515Issues:0Issues:0

NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Language:PythonLicense:MITStargazers:611Issues:0Issues:0

rotary-embedding-torch

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Language:PythonLicense:MITStargazers:453Issues:0Issues:0