Songlin Yang (sustcsonglin)

sustcsonglin

Geek Repo

Company:MIT

Location:Cambridge

Home Page:https://sustcsonglin.github.io/

Twitter:@SonglinYang4

Github PK Tool:Github PK Tool

Songlin Yang's repositories

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:396Issues:14Issues:1

TN-PCFG

source code of NAACL2021 "PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols“ and ACL2021 main conference "Neural Bilexicalized PCFG Induction"

flash-linear-rnn

Implementations of various linear RNN layers using pytorch and triton

disco-pointer

Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection

Language:PythonStargazers:13Issues:0Issues:0

TN-LCFRS

Official Implementation of ACL2023: Unsupervised Discontinuous Constituency Parsing with Mildly Context-Sensitive Grammars

Language:PythonStargazers:9Issues:0Issues:0

FlagAttention

A collection of memory efficient attention operators implemented in the Triton language.

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

License:Apache-2.0Stargazers:2Issues:0Issues:0
Language:CudaStargazers:1Issues:0Issues:0

mamba.py

An efficient Mamba implementation in PyTorch and MLX.

Stargazers:1Issues:0Issues:0

nanokitchen

Parallel Associative Scan for Language Models

License:Apache-2.0Stargazers:1Issues:0Issues:0

safari

Convolutions for Sequence Modeling

Language:AssemblyLicense:Apache-2.0Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0

streaming-llm

Efficient Streaming Language Models with Attention Sinks

License:MITStargazers:1Issues:0Issues:0

sustcsonglin.github.io

:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress

Language:HTMLLicense:MITStargazers:1Issues:1Issues:0

sustcsonglin_old.github.io

:page_facing_up: Elegant & friendly homepage (bio, tech portfolio, resume, doc...) template with Markdown and VuePress

Language:HTMLStargazers:1Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

License:Apache-2.0Stargazers:1Issues:0Issues:0

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Language:AssemblyLicense:Apache-2.0Stargazers:0Issues:0Issues:0

m2

Monarch Mixer

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

s5-pytorch

Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)

License:MPL-2.0Stargazers:0Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Stargazers:0Issues:0Issues:0

stack-attention

Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"

Stargazers:0Issues:0Issues:0

state-spaces

Sequence Modeling with Structured State Spaces

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0