linear-attention

There are 0 repository under linear-attention topic.

BlinkDL / RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm pytorch rnn rwkv transformer transformers
Language:Python 12409
happinesslz / LION
Official repository of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“
3d-object-detection linear-attention linear-rnn
Language:Python 104
lucidrains / taylor-series-linear-attention
Explorations into the recently proposed Taylor Series Linear Attention
artificial-intelligence attention-mechanisms deep-learning linear-attention
Language:Python 85
lucidrains / agent-attention-pytorch
Implementation of Agent Attention in Pytorch
artificial-intelligence attention-mechanisms deep-learning linear-attention
Language:Python 83
lironui / Multi-Attention-Network
The semantic segmentation of remote sensing images
attention-mechanism remote-sensing semantic-segmentation segmentation linear-attention
Language:Python 64
lironui / MAResU-Net
The semantic segmentation of remote sensing images
attention-mechanism attention linear-attention segmentation semantic-segmentation remote-sensing
Language:Python 43
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
artificial-intelligence attention-mechanisms cuda deep-learning linear-attention
Language:Python 43
glassroom / heinsen_attention
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
attention attention-mechanism attention-model heinsen-attention linear-attention linear-attention-model
Language:Python 22
robflynnyh / hydra-linear-attention
Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)
attention efficient-attention linear-attention machine-learning transformers
Language:Python 10
OSU-STARLAB / LeaPformer
[ICML 2024] Official implementation of "LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions."
efficiency language-modeling linear-attention long-range-arena simultaneous-translation transformer-architecture
Language:Python 9
RWKV-Wiki / rwkv-wiki.github.io
RWKV Wiki website (archived, please visit official wiki)
attention-mechanism deep-learning gpt gpt-2 gpt-3 language-model linear-attention lstm rnn rwkv transformer transformers
Language:Shell 9
gmlwns2000 / sea-attention
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
attention efficient-attention linear-attention sea-attention
Language:Python 6
mtanghu / LEAP
LEAP: Linear Explainable Attention in Parallel for causal language modeling with O(1) path length, and O(1) inference
linear-attention pytorch transformers attention-mechanism deep-learning local-attention softmax additive-attention dot-product-attention parallel rnn transformer
Language:Jupyter Notebook 4
fattorib / flashy_linear_attention
Flash linear attention kernels in Triton
attention gpu transformer triton linear-attention
Language:Python 1
Rushi314 / Transformers-for-high-resolution-image-synthesis
Taming Transformers for High-Resolution Image Synthesis
computer-vision gans linear-attention transformers-gpt2 sflickr
Language:Jupyter Notebook 0

linear-attention

BlinkDL / RWKV-LM

happinesslz / LION

lucidrains / taylor-series-linear-attention

lucidrains / agent-attention-pytorch

lironui / Multi-Attention-Network

lironui / MAResU-Net

lucidrains / autoregressive-linear-attention-cuda

glassroom / heinsen_attention

robflynnyh / hydra-linear-attention

OSU-STARLAB / LeaPformer

RWKV-Wiki / rwkv-wiki.github.io

gmlwns2000 / sea-attention

mtanghu / LEAP

fattorib / flashy_linear_attention

Rushi314 / Transformers-for-high-resolution-image-synthesis