amanteur

followers

following

stars

Zvuk

Bishkek, Kyrgyzstan

Amantur Amatov's starred repositories

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookNOASSERTION730000

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookMIT9025100

audioFlux

A library for audio and music analysis, feature extraction.

Language:CMIT211000

RTFS-Net

Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024

Language:PythonMIT3300

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonMIT72400

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonApache-2.0110000

ssspy

A Python toolkit for sound source separation.

Language:PythonApache-2.012000

flashy

Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!

Language:PythonMIT9600

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookApache-2.090300

music-text-representation-pp

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]

Language:Python1700

DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Language:PythonApache-2.07000

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonMIT15200

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonBSD-3-Clause214100

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonMIT25800

top_audio_id

Repository for audio identification with topological fingerprints

Language:Jupyter NotebookMIT500

friture

Real-time audio visualizations (spectrum, spectrogram, etc.)

Language:PythonGPL-3.088800

SingFake

Official Repository for "SingFake: Singing Voice Deepfake Detection"

Language:JavaScriptMIT4500

AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Language:PythonMIT17800

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0818600

harmonixset

The Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music

Language:Jupyter NotebookMIT14300

genmusic_demo_list

a list of demo websites for automatic music generation research

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Language:PythonApache-2.013900

SCNet-PyTorch

Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"

Language:PythonMIT4100

yet-another-lightning-hydra-template

Flexible and scalable template based on PyTorch Lightning + Hydra. Efficient workflow and reproducibility for rapid ML experiments.

Language:Python18200

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonMIT26900

audioset_tagging_cnn

Language:PythonMIT129900

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonMIT849000

wavmark

AI-based Audio Watermarking Tool

Language:PythonMIT19900

wav2tok

Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"

Language:PythonNOASSERTION3000

neural-audio-fp

Language:PythonMIT17400