崔文耀's starred repositories
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Awesome-state-space-models
Collection of papers on state-space models
streamlit-echarts
A Streamlit component to render ECharts.
lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
accelerated-scan
Accelerated First Order Parallel Associative Scan
LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"
hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
mamba-mini
An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivation. It is probably the code which is the most close to selective_scan_cuda in mamba.
resonance_rope
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.