radarFudan

Shida Wang's repositories

Awesome-state-space-models

Collection of papers on state-space models

428 12 1

mamba-minimal-jax

Language:Python25 2 1

mamba

Language:PythonApache-2.01500

Curse-of-memory

Curse-of-memory phenomenon of RNNs in sequence modelling

Language:Jupyter Notebook12 20

Mamba_State_Space_Model_Paper_List

Paper list for State-Space-Model and it's Applications

MIT400

profiling-cuda-in-torch

Language:Python300

radarFudan.github.io

Language:HTML3 10

StableSSM

Language:Python3 10

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookMIT200

flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Language:C++Apache-2.0200

S5

Language:PythonMIT200

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonMIT100

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaBSD-3-Clause100

EffHDC

Language:Python100

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause100

google-research

Google Research

Language:Jupyter NotebookApache-2.0100

lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Language:Python100

radarFudan

1 10

RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Language:Cuda100

SSM_examples

Language:Jupyter Notebook100

t5-pegasus-pytorch

Language:Python100

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0100

triton

Development repository for the Triton language and compiler

Language:PythonMIT100

gateloop-transformer

Implementation of GateLoop Transformer in Pytorch and Jax

Language:PythonMIT000

hasee

Language:Dockerfile010

in-context-operator-networks

ICON for in-context operator learning

Language:PythonMIT000

LongMamba

Some preliminary explorations of Mamba's context scaling.

000

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0000

s4

Structured state space sequence models

Language:Jupyter NotebookApache-2.0000

S5_StableSSM

Language:Python010