HanHui (inkinworld)

inkinworld

Geek Repo

Location:HangZhou

Home Page:www.chenhanhui.com

Github PK Tool:Github PK Tool

HanHui's starred repositories

AI-RecommenderSystem

该仓库尝试整理推荐系统领域的一些经典算法模型

Language:Jupyter NotebookStargazers:1730Issues:0Issues:0

cpufp

A CPU tool for benchmarking the peak of floating points

Language:AssemblyLicense:GPL-3.0Stargazers:499Issues:0Issues:0

DeepCTR

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

Language:PythonLicense:Apache-2.0Stargazers:7564Issues:0Issues:0

RecLearn

Recommender Learning with Tensorflow2.x

Language:PythonLicense:MITStargazers:1857Issues:0Issues:0

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:2779Issues:0Issues:0

MatrixSlow

A simple deep learning framework in pure python for purpose of learning in DL

Language:TypeScriptStargazers:426Issues:0Issues:0

annotated-transformer

An annotated implementation of the Transformer paper.

Language:Jupyter NotebookLicense:MITStargazers:5695Issues:0Issues:0

marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Language:PythonLicense:Apache-2.0Stargazers:604Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:29440Issues:0Issues:0
Language:C++Stargazers:44Issues:0Issues:0

KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Language:PythonLicense:MITStargazers:238Issues:0Issues:0

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:736Issues:0Issues:0

lo

💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)

Language:GoLicense:MITStargazers:17820Issues:0Issues:0

OpenHands

🙌 OpenHands: Code Less, Make More

Language:PythonLicense:MITStargazers:33353Issues:0Issues:0

CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

Stargazers:1245Issues:0Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5852Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4551Issues:0Issues:0

Recommender-System

推荐系统综述

Stargazers:458Issues:0Issues:0

map

路书,路线规划,高德地图 api 示例,地图信息 vue3 ts vite

Language:VueStargazers:95Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9160Issues:0Issues:0

flog

:tophat: A fake log generator for common log formats

Language:GoLicense:MITStargazers:1113Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:CudaStargazers:1555Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:2748Issues:0Issues:0

The-Art-of-Linear-Algebra-zh-CN

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.

Language:PostScriptLicense:CC0-1.0Stargazers:4485Issues:0Issues:0

llm-viz

3D Visualization of an GPT-style LLM

Language:TypeScriptStargazers:3962Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:85Issues:0Issues:0

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Language:PythonLicense:Apache-2.0Stargazers:7636Issues:0Issues:0

bcc

BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more

Language:CLicense:Apache-2.0Stargazers:20506Issues:0Issues:0

cherry-markdown

✨ A Markdown Editor

Language:JavaScriptLicense:NOASSERTIONStargazers:3551Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8979Issues:0Issues:0