ridgerchu

followers

following

stars

University of California, Santa Cruz

https://ruijie-zhu.github.io

Ridger Zhu's starred repositories

BAdam

Language:PythonApache-2.018500

SpectraSuite

2700

triton

Development repository for the Triton language and compiler

Language:C++MIT1234100

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonApache-2.0283800

DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonNOASSERTION52800

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonApache-2.0258000

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonNOASSERTION2215300

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoMIT8659300

finetune-fuyu

example showing finetuning of fuyu

Language:Python1000

SAD

End-to-End Autonomous Driving with Spiking Neural Networks

Language:PythonApache-2.03800

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonMIT28000

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonMIT377300

LinearAttentionArena

Here we will test various linear attention designs.

Language:PythonApache-2.05300

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.0313300

PFA

Official repository for paper: Tensor Decomposition Based Attention Module for Spiking Neural Networks

Language:Python700

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT2291800

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT109400

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonApache-2.0125900

rwkv

RWKV model implementation

Language:PythonMIT3800

HGRN

[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Sequence Modeling

Language:Python5900

CANConv

Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening

Language:PythonGPL-3.03100

hgru-pytorch

Language:Python2400

SFOD

This is the official implementation of the 'SFOD: Spiking Fusion Object Detection'.

Language:PythonMIT1800

Vision-RWKV

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Language:PythonApache-2.031100

ripunzip

Language:RustNOASSERTION14400

MS-ResNet

Advancing Spiking Neural Networks towards Deep Residual Learning

Language:Python3500

ST-P3

[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.

Language:PythonApache-2.029800

bullet

bullet: A Zero-Shot / Few-Shot Learning, LLM Based, text classification framework

Language:Jupyter NotebookApache-2.0500

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0251400

RWKV-infctx-trainer

RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!

Language:Jupyter NotebookApache-2.012900