mamezy

mamezy

Geek Repo

Github PK Tool:Github PK Tool

mamezy's starred repositories

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20385Issues:0Issues:0

2021-ISMIR-MSS-Challenge-CWS-PResUNet

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Language:PythonStargazers:113Issues:0Issues:0

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:2359Issues:0Issues:0

song-solver

A Python application that allows users to sing in front of their laptop's microphone, processes the recording using the Whisper API, and then leverages a Large Language Model (LLM) to recognize the song.

Language:PythonStargazers:37Issues:0Issues:0

basic-pitch

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Language:PythonLicense:Apache-2.0Stargazers:3196Issues:0Issues:0

nnsvs

Neural network-based singing voice synthesis library for research

Language:PythonLicense:MITStargazers:674Issues:0Issues:0

python-speech-recognition-course

Python Speech Recognition Course

Language:PythonStargazers:138Issues:0Issues:0
Language:Jupyter NotebookStargazers:70Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:8015Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29974Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65773Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:46122Issues:0Issues:0

audio-transcription-bot

Audio Transcription WhatsApp Bot using Whisper

Language:PythonLicense:MITStargazers:40Issues:0Issues:0

BasicAutoTranscriptionRepo

Basics of Pitch Estimation and Automatic Music Transcription

Language:Jupyter NotebookStargazers:51Issues:0Issues:0

manim

Animation engine for explanatory math videos

Language:PythonLicense:MITStargazers:61258Issues:0Issues:0

crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Language:PythonLicense:MITStargazers:1091Issues:0Issues:0

mt3

MT3: Multi-Task Multitrack Music Transcription

Language:PythonLicense:Apache-2.0Stargazers:1383Issues:0Issues:0

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

Language:C++License:AGPL-3.0Stargazers:2779Issues:0Issues:0
Language:Jupyter NotebookStargazers:68Issues:0Issues:0

dtw-python

Python port of R's Comprehensive Dynamic Time Warp algorithms package

Language:PythonLicense:GPL-3.0Stargazers:268Issues:0Issues:0