Beast code in Giters

mamezy's starred repositories

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2038500

2021-ISMIR-MSS-Challenge-CWS-PResUNet

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Language:Python11300

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonNOASSERTION235900

song-solver

A Python application that allows users to sing in front of their laptop's microphone, processes the recording using the Whisper API, and then leverages a Large Language Model (LLM) to recognize the song.

Language:Python3700

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonApache-2.0319600

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonMIT67400

python-speech-recognition-course

Python Speech Recognition Course

Language:Python13800

lecture_dtw_notebook

Language:Jupyter Notebook7000

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonMIT801500

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT2997400

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6577300

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04612200