Eric Lam's starred repositories
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
llm-foundry
LLM training code for Databricks foundation models
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
DESED_task
Domestic environment sound event detection task
MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
clotho-dataset
Python code for handling the Clotho dataset.
HumanMotionQA
Motion Question Answering via Modular Motion Programs
audio-captioning
Audio captioning - DCASE challenge 2023 task 6a
Interactive-Summarization
The official repo of our research work "Interactive Editing for Text Summarization".
SuperCLUEgkzw
SuperCLUE高考作文机器自动阅卷系统