Beast code in Giters

melodyless's repositories

arranger

An AI for Automatic Instrumentation

MIT000

audio2midi

Language:Python010

AudioFile

A simple C++ library for reading and writing audio files.

Language:C++GPL-3.0010

audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

MIT000

automatic_melody_harmonization

melody harmoniztion using orderless NADE, chord balancing and blocked Gibbs sampling

Language:Python010

Catch-A-Waveform

Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)

Language:PythonNOASSERTION010

charsiu

Charsiu: A neural phonetic aligner.

Language:Jupyter NotebookMIT000

clpcnet

Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)

Language:PythonNOASSERTION000

ConvNeXt

Code release for ConvNeXt model

Language:PythonMIT010

DeepAFx-ST

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

Language:PythonNOASSERTION000

deepperformer

Deep Performer: Score-to-audio music performance synthesis

000

denoising-historical-recordings

A two-stage U-Net for high-fidelity denoising of historical recordings

MIT000

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

CC-BY-4.0000

e2e_lfmmi

E2E system with LF-MMI; word N-gram for Mandarin

Language:Python000

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Language:PythonApache-2.0000

mctx

Monte Carlo tree search in JAX

Language:PythonApache-2.0000

MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

MIT000

MidiTok

A convenient MIDI tokenizer for Deep Learning networks, with multiple encoding strategies

MIT000

MixCycle

000

MuseMorphose

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

Language:PythonMIT010

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Language:Jupyter NotebookMIT000

RapidASR

A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.

Language:C++NOASSERTION000

RAVE

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Language:PythonMIT000

RAVE-audition

VST/AU Plugin for Auditioning RAVE Models in Real-time

GPL-3.0000

rVAD

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

NOASSERTION000

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:Python000

steerable-nafx

Steerable discovery of neural audio effects

Language:Jupyter NotebookApache-2.0010

Supervised-Learning-for-Multi-Zone-Sound-Field-Reproduction-under-Harsh-Environmental-Conditions

This repository provides the source code that was used to create the data for the paper "Supervised Learning for Multi Zone Sound Field Reproduction under Realistic Conditions".

Language:MATLAB010

wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Language:C++NOASSERTION000

you-only-hear-once

Language:Jupyter NotebookMIT000