Beast code in Giters

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Apache-2.0000

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Apache-2.0000

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现包括二次预训练、有监督微调、奖励建模、强化学习训练。

Language:PythonApache-2.0000

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++000

nann

A flexible, high-performance framework for large-scale retrieval problems based on TensorFlow.

Language:C++Apache-2.0000

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++NOASSERTION000

Needle

An imperative deep learning framework with customized GPU and CPU backend

Language:Python000

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++MIT000

onnxconverter-common

Common utilities for ONNX converters

Language:PythonMIT000

pdfs

Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)

Language:HTML000

ppl.nn

A primitive library for neural network

Language:C++Apache-2.0000

robin-hood-hashing

Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

Language:C++MIT000

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonNOASSERTION000

Torch2TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Language:Jupyter NotebookBSD-3-Clause000

torchrec

Pytorch domain library for recommendation systems

Language:PythonBSD-3-Clause000

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000

ChrisGao001

ChrisGao001's repositories

ChatGLM-6B

ChatGLM-MNN

euler

fastllm

flash_attention_inference

FlexGen

ggml

gpt-2

graph-learn

havenask

lightllm

LLM_Notes

lmdeploy

MedicalGPT

MNN

nann

ncnn

Needle

nnfusion

onnxconverter-common

pdfs

ppl.nn

robin-hood-hashing

text-generation-inference

Torch2TensorRT

torchrec

TransformerEngine

transformers

tvm

vllm