Amantur Amatov (amanteur)

amanteur

Geek Repo

Company:Zvuk

Location:Bishkek, Kyrgyzstan

Github PK Tool:Github PK Tool

Amantur Amatov's starred repositories

whisper-medusa

Whisper with Medusa heads

Language:PythonLicense:MITStargazers:455Issues:0Issues:0

RustPython

A Python Interpreter written in Rust

Language:RustLicense:MITStargazers:18435Issues:0Issues:0

mira

MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metrics to assess exact data replication of the training set.

Language:PythonLicense:AGPL-3.0Stargazers:20Issues:0Issues:0

stemgen

Examples for ICASSP2024 paper "StemGen: A music generation model that listens"

License:MITStargazers:33Issues:0Issues:0

SepReformer

Official repository of SepReformer for speech separation

Language:PythonStargazers:51Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:125Issues:0Issues:0

awesome-music

Awesome Music Projects

Stargazers:1795Issues:0Issues:0

mamba.py

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Language:PythonLicense:MITStargazers:841Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:317Issues:0Issues:0

Dasheng

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Language:PythonLicense:Apache-2.0Stargazers:22Issues:0Issues:0

CoMoSVC

CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:117Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:28432Issues:0Issues:0

bandit-v2

Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"

Language:PythonLicense:Apache-2.0Stargazers:14Issues:0Issues:0

CoverHunter

Official PyTorch implementation of CoverHunter

Language:PythonStargazers:23Issues:0Issues:0

whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Language:PythonLicense:MITStargazers:205Issues:0Issues:0

query-bandit

Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems

Language:Jupyter NotebookLicense:MITStargazers:20Issues:0Issues:0

hearinganythinganywhere

Hearing Anything Anywhere Code Release

Language:Jupyter NotebookStargazers:19Issues:0Issues:0

encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language:PythonStargazers:78Issues:0Issues:0

SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Language:PythonLicense:MITStargazers:96Issues:0Issues:0

streamlit-audio-recorder

Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)

Language:TypeScriptLicense:MITStargazers:401Issues:0Issues:0

ssamba

The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Language:PythonLicense:BSD-3-ClauseStargazers:85Issues:0Issues:0

Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Language:PythonStargazers:72Issues:0Issues:0

soundata

Python library for downloading, loading & working with sound datasets

Language:PythonLicense:BSD-3-ClauseStargazers:307Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8351Issues:0Issues:0

instruct-MusicGen

The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning".

Language:PythonLicense:Apache-2.0Stargazers:54Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:1525Issues:0Issues:0

MusicGPT

Generate music based on natural language prompts using LLMs running locally

Language:RustLicense:MITStargazers:533Issues:0Issues:0

images-that-sound

Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions

Language:PythonLicense:MITStargazers:202Issues:0Issues:0

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1431Issues:0Issues:0