Rongjiehuang's starred repositories
ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
llama-recipes
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
stable-audio-tools
Generative models for conditional audio generation
consistencydecoder
Consistency Distilled Diff VAE
gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
Persona-Dialogue-Generation
The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"
lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.