Chaofan Lin (SiriusNEO)

SiriusNEO

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai, China

Home Page:chaofanlin.com

Twitter:@siriusneox

Github PK Tool:Github PK Tool

Chaofan Lin's starred repositories

cs-self-learning

计算机自学指南

Language:HTMLLicense:MITStargazers:52894Issues:312Issues:173

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23351Issues:190Issues:196

downkyi

哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。

Language:C#License:GPL-3.0Stargazers:19992Issues:141Issues:1053

candle

Minimalist ML framework for Rust

Language:RustLicense:Apache-2.0Stargazers:14595Issues:146Issues:622

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:14563Issues:129Issues:597

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7666Issues:75Issues:151

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3404Issues:33Issues:1089

SJTUThesis

上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template

Language:TeXLicense:Apache-2.0Stargazers:3265Issues:54Issues:484

Awesome-GPTs

Curated list of awesome GPTs 👍.

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Language:PythonLicense:Apache-2.0Stargazers:2864Issues:30Issues:286

Checkpoint

Fast and simple homebrew save manager for 3DS and Switch.

Language:C++License:GPL-3.0Stargazers:2529Issues:135Issues:428

GodMode9

GodMode9 Explorer - A full access file browser for the Nintendo 3DS console :godmode:

Language:CLicense:GPL-3.0Stargazers:2082Issues:117Issues:647

CUDA-Learn-Notes

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:891Issues:10Issues:5

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:873Issues:7Issues:8

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:761Issues:12Issues:15

pokeyellow

Disassembly of Pokemon Yellow

pokegold

Disassembly of Pokémon Gold/Silver

Awesome-CUDA

This is a list of useful libraries and resources for CUDA development.

LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

pygmtools

A Python Graph Matching Toolkit.

Language:PythonLicense:NOASSERTIONStargazers:279Issues:4Issues:20

mirage

A multi-level tensor algebra superoptimizer

Language:C++License:Apache-2.0Stargazers:268Issues:10Issues:17

3DSident

PSPident clone for 3DS

Language:CLicense:ZlibStargazers:265Issues:24Issues:24

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Language:PythonLicense:MITStargazers:214Issues:11Issues:18

DistServe

Disaggregated serving system for Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:188Issues:4Issues:15

vidur

A large-scale simulation framework for LLM inference

Language:PythonLicense:MITStargazers:136Issues:6Issues:11

Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

ParrotServe

[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable

Language:PythonLicense:MITStargazers:66Issues:4Issues:2

preble

Stateful LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:16Issues:1Issues:7