Amantur Amatov (amanteur)

amanteur

Geek Repo

Company:Zvuk

Location:Bishkek, Kyrgyzstan

Github PK Tool:Github PK Tool

Amantur Amatov's starred repositories

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7300Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:90251Issues:0Issues:0

audioFlux

A library for audio and music analysis, feature extraction.

Language:CLicense:MITStargazers:2110Issues:0Issues:0

RTFS-Net

Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:724Issues:0Issues:0

lightning-thunder

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Language:PythonLicense:Apache-2.0Stargazers:1100Issues:0Issues:0

ssspy

A Python toolkit for sound source separation.

Language:PythonLicense:Apache-2.0Stargazers:120Issues:0Issues:0

flashy

Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!

Language:PythonLicense:MITStargazers:96Issues:0Issues:0

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:903Issues:0Issues:0

music-text-representation-pp

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]

Language:PythonStargazers:17Issues:0Issues:0

DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

aasist

Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2141Issues:0Issues:0

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonLicense:MITStargazers:258Issues:0Issues:0

top_audio_id

Repository for audio identification with topological fingerprints

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0

friture

Real-time audio visualizations (spectrum, spectrogram, etc.)

Language:PythonLicense:GPL-3.0Stargazers:888Issues:0Issues:0

SingFake

Official Repository for "SingFake: Singing Voice Deepfake Detection"

Language:JavaScriptLicense:MITStargazers:45Issues:0Issues:0

AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Language:PythonLicense:MITStargazers:178Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8186Issues:0Issues:0

harmonixset

The Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music

Language:Jupyter NotebookLicense:MITStargazers:143Issues:0Issues:0

genmusic_demo_list

a list of demo websites for automatic music generation research

Stargazers:588Issues:0Issues:0

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0

SCNet-PyTorch

Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"

Language:PythonLicense:MITStargazers:41Issues:0Issues:0

yet-another-lightning-hydra-template

Flexible and scalable template based on PyTorch Lightning + Hydra. Efficient workflow and reproducibility for rapid ML experiments.

Language:PythonStargazers:182Issues:0Issues:0

MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Language:PythonLicense:MITStargazers:269Issues:0Issues:0
Language:PythonLicense:MITStargazers:1299Issues:0Issues:0

hydra

Hydra is a framework for elegantly configuring complex applications

Language:PythonLicense:MITStargazers:8490Issues:0Issues:0

wavmark

AI-based Audio Watermarking Tool

Language:PythonLicense:MITStargazers:199Issues:0Issues:0

wav2tok

Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"

Language:PythonLicense:NOASSERTIONStargazers:30Issues:0Issues:0
Language:PythonLicense:MITStargazers:174Issues:0Issues:0