yearnyeen ho's starred repositories
ICASSP-2024-BEAFX-using-DDSP
Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differentiable signal processing
Rank-N-Contrast
[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression
DiffusionRet
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
Hybrid-Net
Real-time audio source separation, generate lyrics, chords, beat.
Awesome-GFlowNets
A curated list of resources about generative flow networks (GFlowNets).
neuromancer
Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.
MT3-pytorch
Unofficial implementation of MT3: Multi-Task Multitrack Music Transcription (Google Research, 2022) in pytorch
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
log-wmse-audio-quality
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.
PartialLabelingCSL
Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Conditional_Diffusion_MNIST
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.