Ridger Zhu's starred repositories
matmulfreellm
Implementation for MatMul-free LM.
finetune-fuyu
example showing finetuning of fuyu
efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
LinearAttentionArena
Here we will test various linear attention designs.
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Vision-RWKV
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!