JiCheng (wejoncy)

wejoncy

Geek Repo

Github PK Tool:Github PK Tool

JiCheng's repositories

QLLM

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

Language:PythonLicense:Apache-2.0Stargazers:107Issues:9Issues:11

XbitOps

[X] bit GEMV/DQ support for quantized LLM

Language:CudaLicense:Apache-2.0Stargazers:2Issues:0Issues:0

12306cpp

12306 自动订票工具c++实现版

Language:C++License:Apache-2.0Stargazers:1Issues:3Issues:0

onnxKapok

An AOT compiler for onnx model, for accelerating transformers on Mobile/Server/GPUs. One Line of code, 30% faster at most on ARM/INTEL CPU

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0

vllm-backup

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

winograd_study

a easy understand python implementation

Language:PythonLicense:Apache-2.0Stargazers:1Issues:3Issues:0

AiLearning

AiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:0Issues:2Issues:0

deeplearningbook-chinese

Deep Learning Book Chinese Translation

Language:TeXStargazers:0Issues:3Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

machine-learning-cheat-sheet

Classical equations and diagrams in machine learning

Language:TeXStargazers:0Issues:3Issues:0

mapreduce

C++ MapReduce Library for efficient multi-threading on single-machine

Language:C++Stargazers:0Issues:3Issues:0

neural-networks-and-deep-learning

Code samples for my book "Neural Networks and Deep Learning"

Language:PythonStargazers:0Issues:3Issues:0

onnx

Open standard for machine learning interoperability

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

EAGLE

EAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++License:MITStargazers:0Issues:1Issues:0

onnxruntime-extensions

The pre- and post processing library for ONNX Runtime

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:3Issues:0

string-splitting

String splitting benchmarks

Language:C++Stargazers:0Issues:2Issues:0
Language:C++License:MITStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

XNNPACK

High-efficiency floating-point neural network inference operators for mobile, server, and Web

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0