DSP-8192's starred repositories

U-Net-in-PyTorch

This is an implementation of the U-Net model from the paper, U-Net: Convolutional Networks for Biomedical Image Segmentation.

Language:Jupyter NotebookStargazers:20Issues:0Issues:0

MIDI-BERT

This is the official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

polyffusion

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

symbolic-music-diffusion

Symbolic Music Generation with Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:204Issues:0Issues:0

tango

A family of diffusion models for text-to-audio generation.

Language:PythonLicense:NOASSERTIONStargazers:967Issues:0Issues:0

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

License:NOASSERTIONStargazers:25972Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24279Issues:0Issues:0

openvino-plugins-ai-audacity

A set of AI-enabled effects, generators, and analyzers for Audacity®.

Language:C++License:GPL-3.0Stargazers:794Issues:0Issues:0

noisy-student-emotion-training

Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging

Language:PythonStargazers:11Issues:0Issues:0

essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

Language:C++License:AGPL-3.0Stargazers:2773Issues:0Issues:0

FakeDonaldTrump

Fake Trump's faces generated by GAN, study only

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

multif0-estimation-polyvocals

Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"

Language:PythonLicense:MITStargazers:52Issues:0Issues:0

seld-dcase2020

Baseline method for sound event localization task of DCASE 2020 challenge

Language:PythonLicense:NOASSERTIONStargazers:52Issues:0Issues:0

seld-dcase2019

Benchmark for sound event localization task of DCASE 2019 challenge

Language:PythonLicense:NOASSERTIONStargazers:70Issues:0Issues:0

global-cs-application.github.io

欧港新CS留学项目指北

Language:HTMLLicense:NOASSERTIONStargazers:599Issues:0Issues:0

seld-dcase2023

Baseline method for sound event localization task of DCASE 2023 challenge

Language:PythonStargazers:40Issues:0Issues:0

sed-crnn

Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection winning method.

Language:PythonLicense:NOASSERTIONStargazers:184Issues:0Issues:0

Stereographic-Projection-of-Otto

通过球极投影的方式得到 otto 的多种形态

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

OttoKeyboard

使用Arduino制作的音乐(?)键盘。Related video: bilibili.com/BV1R8411o7kC

Language:CLicense:MITStargazers:38Issues:0Issues:0

SynthV_plugin_auto_loudness_by_vibrato

Auto editing loudness by vibrato (notes properties & vibrato env.) Currently only support overwrite mode.

Language:JavaScriptStargazers:7Issues:0Issues:0

synthesizer-v-r2-docs

非官方的 Synthesizer V R2 文档存储仓库

Language:JavaScriptStargazers:15Issues:0Issues:0

real-voice

Scripts for working with a real voice in Synthesizer V Studio Pro

Language:LuaLicense:MITStargazers:81Issues:0Issues:0

HUOZI

ottohzys

Language:PythonStargazers:100Issues:0Issues:0

OpenUtau

Open singing synthesis platform / Open source UTAU successor

Language:C#License:MITStargazers:1940Issues:0Issues:0

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:16808Issues:0Issues:0