Leo Tian (beimingxinghai)

beimingxinghai

Geek Repo

Location:China

Github PK Tool:Github PK Tool

Leo Tian's starred repositories

pytest-xprocess

pytest external process plugin

Language:PythonLicense:MITStargazers:97Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:3657Issues:0Issues:0

mem0

The memory layer for Personalized AI

Language:PythonLicense:Apache-2.0Stargazers:18401Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1057Issues:0Issues:0

RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Language:PythonLicense:Apache-2.0Stargazers:2457Issues:0Issues:0

circuitbreaker

Python "Circuit Breaker" implementation

Language:PythonLicense:NOASSERTIONStargazers:443Issues:0Issues:0

pybreaker

Python implementation of the Circuit Breaker pattern.

Language:PythonLicense:BSD-3-ClauseStargazers:503Issues:0Issues:0

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonLicense:Apache-2.0Stargazers:906Issues:0Issues:0

Docker_training_with_DockerMe

The tools and sample needed to learn the Docker

Language:HTMLLicense:Apache-2.0Stargazers:484Issues:0Issues:0

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4368Issues:0Issues:0

dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Language:GoLicense:Apache-2.0Stargazers:773Issues:0Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1513Issues:0Issues:0
Language:CudaLicense:MITStargazers:38Issues:0Issues:0

python_ebook

收集了一些Python相关资料

Language:HTMLStargazers:494Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3605Issues:0Issues:0

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaLicense:Apache-2.0Stargazers:907Issues:0Issues:0

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonLicense:Apache-2.0Stargazers:488Issues:0Issues:0

kiss-translator

A simple, open source bilingual translation extension & Greasemonkey script (一个简约、开源的 双语对照翻译扩展 & 油猴脚本)

Language:JavaScriptLicense:GPL-3.0Stargazers:2533Issues:0Issues:0

kernel_tuner

Kernel Tuner

Language:PythonLicense:Apache-2.0Stargazers:262Issues:0Issues:0

pytest-xdist

pytest plugin for distributed testing and loop-on-failures testing modes.

Language:PythonLicense:MITStargazers:1414Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:3170Issues:0Issues:0

CSView

CSView是一个互联网面试知识学习和汇总项目,包括面试高频算法、系统设计、计算机网络、操作系统、C++、Java、golang、MySQL、Redis、K8s、消息队列等常见面试题。

Language:TypeScriptStargazers:428Issues:0Issues:0

k8s-books

2021年最新linux运维面试题,k8s面试题,kubernetes面试题,Linux运维面试题,K8s视频教程,Docker面试题,kubernetes视频,等资料收集分享

Stargazers:30Issues:0Issues:0

k8s_awesome_document

【2021年新鲜出炉】K8s(Kubernetes)的工程师资料合辑,书籍推荐,面试题,精选文章,开源项目,PPT,视频,大厂资料

Stargazers:1334Issues:0Issues:0

llmperf

LLMPerf is a library for validating and benchmarking LLMs

Language:PythonLicense:Apache-2.0Stargazers:503Issues:0Issues:0
License:Apache-2.0Stargazers:408Issues:0Issues:0

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Language:PythonStargazers:442Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:3318Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17966Issues:0Issues:0

lsd

The next gen ls command

Language:RustLicense:Apache-2.0Stargazers:12924Issues:0Issues:0