yearnyeen ho's starred repositories

ICASSP-2024-BEAFX-using-DDSP

Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differentiable signal processing

Language:Jupyter NotebookStargazers:10Issues:0Issues:0

Rank-N-Contrast

[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression

Language:PythonStargazers:71Issues:0Issues:0

mini_edm

Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist

Language:PythonStargazers:30Issues:0Issues:0

ect

Consistency Models Made Easy

Language:PythonStargazers:172Issues:0Issues:0

DiffusionRet

[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:106Issues:0Issues:0

MWAFM

Multi-Scale Attention for Audio Question Answering

Language:PythonStargazers:24Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7370Issues:0Issues:0

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonLicense:NOASSERTIONStargazers:425Issues:0Issues:0

Hybrid-Net

Real-time audio source separation, generate lyrics, chords, beat.

Language:PythonStargazers:644Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:79444Issues:0Issues:0

gflownet

Generative Flow Networks - GFlowNet

Language:PythonLicense:Apache-2.0Stargazers:137Issues:0Issues:0

Awesome-GFlowNets

A curated list of resources about generative flow networks (GFlowNets).

License:MITStargazers:360Issues:0Issues:0

neuromancer

Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.

Language:PythonLicense:NOASSERTIONStargazers:825Issues:0Issues:0

MT3-pytorch

Unofficial implementation of MT3: Multi-Task Multitrack Music Transcription (Google Research, 2022) in pytorch

Language:PythonStargazers:12Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20803Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1764Issues:0Issues:0

log-wmse-audio-quality

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0
Language:PythonStargazers:124Issues:0Issues:0
Language:PythonLicense:MITStargazers:78Issues:0Issues:0
Language:PythonStargazers:37Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:2175Issues:0Issues:0
Language:PythonLicense:MITStargazers:3984Issues:0Issues:0

PartialLabelingCSL

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Language:PythonLicense:MITStargazers:127Issues:0Issues:0

FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Language:PythonLicense:NOASSERTIONStargazers:744Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16251Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:1629Issues:0Issues:0

Conditional_Diffusion_MNIST

Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.

Language:PythonLicense:MITStargazers:586Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

fld

Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)

Language:PythonLicense:NOASSERTIONStargazers:215Issues:0Issues:0