Songlin Yang's repositories
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
flash-linear-rnn
Implementations of various linear RNN layers using pytorch and triton
disco-pointer
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection
FlagAttention
A collection of memory efficient attention operators implemented in the Triton language.
nanokitchen
Parallel Associative Scan for Language Models
streaming-llm
Efficient Streaming Language Models with Attention Sinks
sustcsonglin.github.io
:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress
sustcsonglin_old.github.io
:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress
Academic-project-page-template
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
hyena-dna
Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
m2
Monarch Mixer
s5-pytorch
Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)
SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
stack-attention
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
state-spaces
Sequence Modeling with Structured State Spaces