Beast code in Giters

MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metrics to assess exact data replication of the training set.

Language:PythonAGPL-3.02000

contriever

Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning

Language:PythonNOASSERTION64900

rectified-flow-pytorch

Implementation of rectified flow and some of its followup research / improvements in Pytorch

Language:PythonMIT11900

ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Language:PythonMIT93500

AudioTime

Language:Python2200

GAMA

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Language:PythonApache-2.05900

ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Language:PythonNOASSERTION11500

notebooks

Notebooks using the Hugging Face libraries 🤗

Language:Jupyter NotebookApache-2.0352400

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookApache-2.035800

PianoMotion10M

Code release for PianoMotion10M

Language:PythonApache-2.04800

LLM101n

LLM101n: Let's build a Storyteller

2717200

CompA

Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Language:Python1100

FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Language:PythonMIT6700

LLM-Codec

The open source code for LLM-Codec

Language:Python10400

soundctm

Pytorch implementation of SoundCTM

Language:PythonMIT6900

m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Language:Jupyter NotebookNOASSERTION6100

SparsePrimingRepresentations

Public repo to document some SPR stuff

MIT71300

big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Language:Jupyter NotebookApache-2.0213900

CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter Notebook19600

Synchformer

Efficient synchronization from sparse cues

Language:PythonMIT2200