WanXing Wang's starred repositories
data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
datatunerx
Large language model fine-tuning capabilities based on cloud native and distributed computing.
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Ray-Forward
Some resources about Ray Forward Meetup
ColossalAI
Making large AI models cheaper, faster and more accessible
tmux-config
Tmux configuration, that supercharges your tmux to build cozy and cool terminal environment
flashlight
A C++ standalone library for machine learning
CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
GraphScope
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统