Amantur Amatov's starred repositories
whisper-medusa
Whisper with Medusa heads
RustPython
A Python Interpreter written in Rust
SepReformer
Official repository of SepReformer for speech separation
awesome-music
Awesome Music Projects
project-NN-Pytorch-scripts
see README
CoverHunter
Official PyTorch implementation of CoverHunter
whisper-finetune
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
query-bandit
Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
hearinganythinganywhere
Hearing Anything Anywhere Code Release
encodecmae
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
SemantiCodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
streamlit-audio-recorder
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
Audio-Mamba-AuM
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
speechbrain
A PyTorch-based Speech Toolkit
instruct-MusicGen
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
images-that-sound
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
ThunderKittens
Tile primitives for speedy kernels