sustcsonglin

followers

following

stars

MIT

Cambridge

https://sustcsonglin.github.io/

Songlin Yang's repositories

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT396 14 1

TN-PCFG

source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conference "Neural Bilexicalized PCFG Induction"

Language:Python40 6 7

flash-linear-rnn

Implementations of various linear RNN layers using pytorch and triton

Language:Python33 2 1

mamba-triton

Language:Python33 1 1

gated_linear_attention_layer

Language:Python28 40

disco-pointer

Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection

Language:Python1300

TN-LCFRS

Official Implementation of ACL2023: Unsupervised Discontinuous Constituency Parsing with Mildly Context-Sensitive Grammars

Language:Python900

FlagAttention

A collection of memory efficient attention operators implemented in the Triton language.

Language:PythonNOASSERTION200

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0200

cuda-playground

Language:Cuda100

mamba.py

An efficient Mamba implementation in PyTorch and MLX.

100

nanokitchen

Parallel Associative Scan for Language Models

Apache-2.0100

safari

Convolutions for Sequence Modeling

Language:AssemblyApache-2.0100

stk

Apache-2.0100

streaming-llm

Efficient Streaming Language Models with Attention Sinks

MIT100

sustcsonglin.github.io

:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress

Language:HTMLMIT1 10

sustcsonglin_old.github.io

:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress

Language:HTML100

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Apache-2.0100

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonApache-2.0100

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

000

BeamTreeRecursiveCells

MIT000

cutlass-kernels

MIT000

hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Language:AssemblyApache-2.0000

m2

Monarch Mixer

000

mamba

Apache-2.0000

S5

MIT000

s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

MPL-2.0000

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

000

stack-attention

Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"

000

state-spaces

Sequence Modeling with Structured State Spaces

Language:Jupyter NotebookApache-2.0000