jeonchangbin49's starred repositories

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1073Issues:0Issues:0

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2311Issues:0Issues:0

llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:285Issues:0Issues:0

ddc_onset

Music onset detector from Dance Dance Convolution packaged as a lightweight PyTorch module

Language:PythonLicense:MITStargazers:31Issues:0Issues:0
Language:PythonLicense:MITStargazers:45Issues:0Issues:0

MusicLDM

The latent diffusion model for text-to-music generation.

Language:PythonLicense:NOASSERTIONStargazers:145Issues:0Issues:0

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Language:PythonLicense:MITStargazers:370Issues:0Issues:0

RemFx

General Purpose Audio Effect Removal

Language:PythonLicense:Apache-2.0Stargazers:89Issues:0Issues:0

DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

Language:C++License:GPL-3.0Stargazers:870Issues:0Issues:0

sigsep-mus-db

Python parser and tools for MUSDB18 Music Separation Dataset

Language:PythonLicense:MITStargazers:159Issues:0Issues:0

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:1538Issues:0Issues:0

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2184Issues:0Issues:0

finding-tori

Finding Tori: Self-supervised Learning for Analyzing Korean Folk Song(ISMIR 2023)

Language:PythonStargazers:10Issues:0Issues:0

cqt-pytorch

An invertible and differentiable implementation of the Constant-Q Transform (CQT).

Language:PythonLicense:MITStargazers:51Issues:0Issues:0

perceiver-io

A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training

Language:PythonLicense:Apache-2.0Stargazers:424Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20423Issues:0Issues:0

moises-db

Moises Source Separation Public Dataset

Language:PythonStargazers:98Issues:0Issues:0

dr14_t.meter

Compute the DR14 of a given audio file according to the procedure described by the Pleasurize Music Foundation

Language:PythonLicense:GPL-3.0Stargazers:123Issues:0Issues:0

sigsep-mus-eval

museval - source separation evaluation tools for python

Language:PythonLicense:MITStargazers:194Issues:0Issues:0

ai-audio-startups

Community list of startups working with AI in audio and music technology

License:Apache-2.0Stargazers:1509Issues:0Issues:0

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Language:PythonStargazers:263Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34323Issues:0Issues:0

WavJourney

WavJourney: Compositional Audio Creation with LLMs

Language:PythonLicense:NOASSERTIONStargazers:512Issues:0Issues:0

CQT_pytorch

Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters

Language:Jupyter NotebookStargazers:28Issues:0Issues:0

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonLicense:MITStargazers:1895Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2303Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

sdx23

Sound Demixing Challenge 2023

Language:PythonLicense:MITStargazers:69Issues:0Issues:0

gomin

GOMIN; Gaudio Open Mel-spectrogram Inversion Network

Language:PythonLicense:MITStargazers:108Issues:0Issues:0

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonLicense:Apache-2.0Stargazers:3212Issues:0Issues:0