Zhan Lu's repositories
Apache-IoTDB-Client-CSharp
C# client for Apache IoTDB
config-files
A collection of my config files.
CppGuide
C/C++学习,后端开发进阶指南。
flash-attention
Fast and memory-efficient exact attention
FlashModels
Fast and easy distributed model training examples.
HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
imagepi
树莓派上基于TensorFlow Lite的图像识别
iotdb
Apache IoTDB
iotdb-client-csharp
Apache IoTDB Client for C#
Megatron-LM
Ongoing research training transformer models at scale
MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models.
Siren-fastai2
Unofficial implementation of 'Implicit Neural Representations with Periodic Activation Functions'
stablehlo
Backward compatible ML compute opset inspired by HLO/MHLO
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
torch-quiver
PyTorch Library for Fast and Easy Distributed Graph Learning
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators