66RING

66RING

Geek Repo

Company:Chaotic Futurism; @SJTU-IPADS

Home Page:https://66ring.github.io/

Github PK Tool:Github PK Tool


Organizations
ChaosDaily
LosersDelight

66RING's repositories

tiny-flash-attention

flash attention tutorial written in python, triton, cuda, cutlass

dotfiles

My dotfiles

Language:ShellStargazers:17Issues:3Issues:0

LongShortTokenDecoding

Long short token decoding speed up 4x for long context LLM. A hundred lines of core code. Open source for learning.

Language:PythonStargazers:5Issues:0Issues:0

ring-attention-pytorch

tiny ring attention implement for learning purpose

scripts

some scripts

66RING.github.io

https://66ring.github.io/

Language:HTMLStargazers:2Issues:3Issues:0

Counting-Stars-Local

Counting-Stars scripts for evaluating local llm.

Language:PythonStargazers:2Issues:0Issues:0

Notes

my note things

pytorch-cuda-binding-tutorial

Tutorial for building a custom CUDA and C function for torch

Language:PythonLicense:MITStargazers:1Issues:3Issues:0
Language:RustStargazers:1Issues:2Issues:0

15445-bootcamp

A basic introduction to coding in modern C++.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

bufferline.nvim

A snazzy bufferline for Neovim

Language:LuaLicense:UnlicenseStargazers:0Issues:1Issues:0

clash-verge

A Clash GUI based on tauri. Supports Windows, macOS and Linux.

Language:TypeScriptLicense:GPL-3.0Stargazers:0Issues:1Issues:0

ContinuousBatching

A demo about continuous batching, which is simple than you think.

Stargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

LightSeq

Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Language:PythonStargazers:0Issues:0Issues:0

llama-playground

play with llama

Language:PythonStargazers:0Issues:0Issues:0

LLMTest_NeedleInAHaystack-Local

run Needle In A Haystack with local LLM. check the makefile

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

minitorch

The full minitorch student suite.

Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

PoSE

Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

License:Apache-2.0Stargazers:0Issues:0Issues:0

st

my st

Language:CLicense:MITStargazers:0Issues:3Issues:0

ThunderKittens

Tile primitives for speedy kernels

License:MITStargazers:0Issues:0Issues:0

vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

zephyr-nvim

Customized nvimdev/zephyr-nvim

Language:LuaLicense:MITStargazers:0Issues:1Issues:0