Chen Jianming's repositories

hpc

Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )

Language:C++License:Apache-2.0Stargazers:61Issues:2Issues:0

deeplearning-paper-notes

Reading notes on deep learning papers---深度学习论文阅读笔记 (2013-2018)

Language:HTMLLicense:MITStargazers:39Issues:4Issues:0

dlex-cnn

DIY - A deep learning framework

Language:C++License:MITStargazers:9Issues:2Issues:0

patterns

A collection of architectural patterns and design patterns.

Language:C++License:Apache-2.0Stargazers:4Issues:3Issues:0

ai-infra-notes

Reading notes on the open source code of AI infrastructure (sglang, llm, cutlass, hpc, etc.)

Stargazers:2Issues:0Issues:0

ecas

ECAS is a library for edge AI computing acceleration.

Language:C++License:MITStargazers:2Issues:2Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0

lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

mxnet

A fork of apache/incubator-mxnet.

Language:C++License:Apache-2.0Stargazers:1Issues:2Issues:0

pocket-ai

A Portable Toolkit for deploying Edge AI and HPC (opencl, vulkan, simd, task scheduling)

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

cpy

Notes on calling each other between C and python.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tflite_micro

Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0