Luchang Li's repositories

export_llama_to_onnx

export llama to onnx

Language:PythonLicense:MITStargazers:73Issues:1Issues:14

onnxsim_large_model

simplify >2GB large onnx model

Language:PythonLicense:MITStargazers:34Issues:1Issues:8

BFGS-Optimization-for-curve-fitting

use BFGS optimization algorithm to solve problems like curve fitting

Language:C++Stargazers:17Issues:0Issues:0

android_ndk_examples

android_ndk_examples

License:MITStargazers:0Issues:2Issues:0

CppTemplateTutorial

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

decoupleQ

A quantization algorithm for LLM

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepLearningExamples

Deep Learning Examples

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

EAGLE

EAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation

License:Apache-2.0Stargazers:0Issues:0Issues:0

EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

License:Apache-2.0Stargazers:0Issues:0Issues:0

gemmlowp

Low-precision matrix multiplication

License:Apache-2.0Stargazers:0Issues:0Issues:0

HashingDeepLearning

Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"

Language:C++License:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

kernel_tuner

Kernel Tuner

License:Apache-2.0Stargazers:0Issues:0Issues:0

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

License:MITStargazers:0Issues:0Issues:0

modelbox

为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用。

License:Apache-2.0Stargazers:0Issues:0Issues:0

models-1

A collection of pre-trained, state-of-the-art models in the ONNX format

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

MVision

机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶

Stargazers:0Issues:0Issues:0

OpenCL-examples-1

Simple OpenCL examples for exploiting GPU computing

Stargazers:0Issues:0Issues:0

pytorch-cifar

95.47% on CIFAR10 with PyTorch

License:MITStargazers:0Issues:0Issues:0

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

License:MITStargazers:0Issues:0Issues:0

tensorflow-1

An Open Source Machine Learning Framework for Everyone

License:Apache-2.0Stargazers:0Issues:0Issues:0

torch2trt

An easy to use PyTorch to TensorRT converter

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

UGATIT-pytorch

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

License:MITStargazers:0Issues:0Issues:0

weight_only_quant_rot

weight only quantization with rotation

Language:PythonStargazers:0Issues:0Issues:0

xla

Enabling PyTorch on Google TPU

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

YHs_Sample

Yinghan's Code Sample

License:GPL-3.0Stargazers:0Issues:0Issues:0