Beast code in Giters

yearnyeen ho's starred repositories

ICASSP-2024-BEAFX-using-DDSP

Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differentiable signal processing

Language:Jupyter Notebook1000

Rank-N-Contrast

[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression

Language:Python7100

mini_edm

Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist

Language:Python3000

ect

Consistency Models Made Easy

Language:Python17200

DiffusionRet

[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Language:PythonApache-2.010600

MWAFM

Multi-Scale Attention for Audio Question Answering

Language:Python2400

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0737000

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonNOASSERTION42500

Hybrid-Net

Real-time audio source separation, generate lyrics, chords, beat.

Language:Python64400

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoMIT7944400

gflownet

Generative Flow Networks - GFlowNet

Language:PythonApache-2.013700

Awesome-GFlowNets

A curated list of resources about generative flow networks (GFlowNets).

MIT36000

neuromancer

Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.

Language:PythonNOASSERTION82500

MT3-pytorch

Unofficial implementation of MT3: Multi-Task Multitrack Music Transcription (Google Research, 2022) in pytorch

Language:Python1200

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02080300

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT176400

log-wmse-audio-quality

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Language:PythonApache-2.03100

AudioEditingCode

Language:Python12400

rfwave

Language:PythonMIT7800

HCL

Language:Python3700

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT217500

transformer-debugger

Language:PythonMIT398400

PartialLabelingCSL

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

Language:PythonMIT12700

FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention

Language:PythonNOASSERTION74400

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01625100

RAG-Survey

157700

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

162900

Conditional_Diffusion_MNIST

Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.

Language:PythonMIT58600

Joint-beat-and-downbeat-estimation

Language:Python800

fld

Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)

Language:PythonNOASSERTION21500