hudengjunai

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.0000

llama-deepspeed

train llama-30B on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Language:PythonApache-2.0000

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLM

Language:C++Apache-2.0000

mcveil

an apsect oriented programming lib

Language:C++MIT010

nginx_cmake

the nginx-cmake file to quickly build cmake-vim clangd code view and debug.

Language:CMake020

nvim-lspconfig

Quickstart configurations for the Nvim LSP client

Language:LuaNOASSERTION010

rtdsync

A C++ library implements channel, timer, wait group, for multi-thread synchronization, inspired by Golang design.

Language:C++010

scanner

Efficient video analysis at scale

Language:C++Apache-2.0010

seastar

High performance server-side application framework

Language:C++Apache-2.0010

serving

A flexible, high-performance serving system for machine learning models

Language:C++Apache-2.0010

simple_vim

the most simple vim config for online docker

010

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0000

hudengjunai

hudengjun's repositories

WorkAccelerate

Algorithm-DataStructure

easyprofiler

btrace

ChatGLM-Tuning

click

cmake-init

Colab_notebooks

DeepSpeedExamples

dotfiles

EnergonAI

FasterTransformer

kubernetes-cloud

Learn-Vim

lightllm

llama-deepspeed

lmdeploy

mcveil

nginx_cmake

nvim-lspconfig

rtdsync

scanner

seastar

serving

simple_vim

TensorRT-LLM

thread-pool

vcpkg

vcpkg_libs

yhosts