bongjun

Tame the Web MIDI API. Send and receive MIDI messages with ease. Control instruments with user-friendly functions (playNote, sendPitchBend, etc.). React to MIDI input with simple event listeners (noteon, pitchbend, controlchange, etc.).

Language:JavaScriptApache-2.0153400

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.0238700

ismir-2021-tutorial-case-studies

Code for the ISMIR 2021 tutorial "Programming MIR Baselines from Scratch: Three Cases Studies"

Language:Jupyter Notebook2900

torchsynth

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

Language:PythonApache-2.032600

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01166100

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03165400

evidential-deep-learning

Learn fast, scalable, and calibrated measures of uncertainty using neural networks!

Language:PythonApache-2.042900

deepbayes-2019

Practical assignments of the Deep|Bayes summer school 2019

Language:Jupyter Notebook82700

expVAE

Visually Explainable VAE

Language:Python6100

DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Language:PythonMIT555900

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Language:PythonNOASSERTION1077800

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookMIT572500

dominate

Dominate is a Python library for creating and manipulating HTML documents using an elegant DOM API. It allows you to write HTML pages in pure Python very concisely, which eliminate the need to learn another template language, and to take advantage of the more powerful features of Python.

Language:PythonLGPL-3.0170000

meyda

Audio feature extraction for JavaScript.

Language:TypeScriptMIT145500

tensor-sensor

The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflow, keras, fastai.

Language:Jupyter NotebookMIT77100

ml-surveys

📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.

MIT281600

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonCC-BY-4.0107500

uncertainty-toolbox

Uncertainty Toolbox: a Python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization

Language:PythonMIT180000

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION164800

bongjun

Bongjun Kim's starred repositories

paow

encodec

whisper

abjad

handcalcs

intro_dgm

Working-with-the-Web-Audio-API

webmidi

s4