Lei Zhang's repositories
Prometheus-Kudu-Exporter
High-availably monitor for Apache Kudu by Prometheus
magicdevil-example
The examples and use cases about Kafka, Spark, Flink, SpringBoot, etc. 组件及框架使用实例
English-Writing
Enhance Your English Writing
Active-Learning-as-a-Service
A scalable & efficient active learning/data selection system for everyone.
alpa
Training and serving large-scale neural networks with auto parallelization.
Awesome-System-for-Machine-Learning
A curated list of research in machine learning systems (MLSys). Paper notes are also provided.
alpaca-lora
Instruct-tune LLaMA on consumer hardware
applied-ai
Applied AI experiments and examples for PyTorch
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
datahub-helm
Repository of helm charts for deploying DataHub on a Kubernetes cluster
Federated-Lifelong-Person-ReID
Spatial-Temporal Federated Learning for Lifelong Person Re-identification on Distributed Edges. (FedSTIL)
InternLM
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system.
LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
magicdevilzhang.github.io
Ryan's blog for study & personal resources review. 技术栈文档
MLE-agent
MLE-Agent is designed to be a pair agent for machine learning engineers or researchers
OpenLLaMA2
DeepSpeed+Ray based LLaMA2 SFT/RLHF training framework
openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
ring-flash-attention
Ring attention implementation with flash attention
ST-ReID-Datasets
Person re-identification datasets across time & space for federated continual learning.
TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
TorchScene
Scene recognition tool based on pytorch. Provide training, test and deployment functions, as well as many pretrained models.
torchtitan
A native PyTorch Library for large model training
triton-2.2.0-fp8
Development repository for the Triton language and compiler
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs