Shida Wang (radarFudan)

radarFudan

Geek Repo

Company:NUS

Location:Singapore

Home Page:https://radarfudan.github.io

Twitter:@SanderWangSD

Github PK Tool:Github PK Tool

Shida Wang's repositories

Awesome-state-space-models

Collection of papers on state-space models

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0

Curse-of-memory

Curse-of-memory phenomenon of RNNs in sequence modelling

Language:Jupyter NotebookStargazers:12Issues:2Issues:0

Mamba_State_Space_Model_Paper_List

Paper list for State-Space-Model and it's Applications

License:MITStargazers:4Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:3Issues:1Issues:0

annotated-mamba

Annotated version of the Mamba paper

Language:Jupyter NotebookLicense:MITStargazers:2Issues:0Issues:0

flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Language:C++License:Apache-2.0Stargazers:2Issues:0Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Language:PythonStargazers:1Issues:0Issues:0

RWKV-CUDA

The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )

Language:CudaStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

gateloop-transformer

Implementation of GateLoop Transformer in Pytorch and Jax

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:DockerfileStargazers:0Issues:1Issues:0

in-context-operator-networks

ICON for in-context operator learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LongMamba

Some preliminary explorations of Mamba's context scaling.

Stargazers:0Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

s4

Structured state space sequence models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0