wm901115nwpu's starred repositories
generative-models
Generative Models by Stability AI
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
text-generation-inference
Large Language Model Text Generation Inference
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
alignment-handbook
Robust recipes to align language models with human and AI preferences
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Qwen-Agent
Agent framework and applications built upon Qwen1.5, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
tensorrtllm_backend
The Triton TensorRT-LLM Backend
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
ring-flash-attention
Ring attention implementation with flash attention
LLM_MultiAgents_Survey_Papers
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
self-speculative-decoding
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
flash-linear-rnn
Implementations of various linear RNN layers using pytorch and triton