Beast code in Giters

JiCheng's repositories

QLLM

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

Language:PythonApache-2.0107 9 11

XbitOps

[X] bit GEMV/DQ support for quantized LLM

Language:CudaApache-2.0200

12306cpp

12306 自动订票工具c++实现版

Language:C++Apache-2.01 30

onnxKapok

An AOT compiler for onnx model, for accelerating transformers on Mobile/Server/GPUs. One Line of code, 30% faster at most on ARM/INTEL CPU

Language:PythonApache-2.01 20

vllm-backup

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.01 10

winograd_study

a easy understand python implementation

Language:PythonApache-2.01 30

AiLearning

AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP

Language:PythonGPL-3.0000

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

020

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeX030

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.0010

machine-learning-cheat-sheet

Classical equations and diagrams in machine learning

Language:TeX030

mapreduce

C++ MapReduce Library for efficient multi-threading on single-machine

Language:C++030

neural-networks-and-deep-learning

Code samples for my book "Neural Networks and Deep Learning"

Language:Python030

onnx

Open standard for machine learning interoperability

Language:C++Apache-2.0010

EAGLE

EAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation

Language:PythonApache-2.0010

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++MIT010

onnxruntime-extensions

The pre- and post processing library for ONNX Runtime

Language:PythonMIT000

opensift_vs2013

Language:C030

string-splitting

String splitting benchmarks

Language:C++020

Tensor2TensorPermute

Language:C++MIT000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0020

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

Language:CNOASSERTION000