yearnyeen ho's starred repositories
generative-models
Generative Models by Stability AI
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
benchmark_VAE
Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)
score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
visualization-curriculum
A data visualization curriculum of interactive notebooks.
SparsePrimingRepresentations
Public repo to document some SPR stuff
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
images-that-sound
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
MIDI-LLM-tokenizer
Tools for converting .mid files into text for training large language models
When-in-Rome
meta-corpus of and code library for the functional harmonic analysis of music
micro-musicgen
a new family of super small music generation models focusing on experimental music and latent space exploration capabilities
musical-word-embedding
Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]
Synchformer
Efficient synchronization from sparse cues
efficient-speech-codec
A lightweight efficient audio codec in 30MB with 30~170x compression ratio. Supports 16kHz mono speech audio.