LuGY's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
the-algorithm
Source code for Twitter's Recommendation Algorithm
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
paper-reading
深度学习经典、新论文逐段精读
Modern-CPP-Programming
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
awesome-llm-understanding-mechanism
awesome papers in LLM interpretability
awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
Megatron-Kwai
[USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Parallelism
VGC-Damage-Calculator-Chinese
Personal fork of the VGC damage calculator at trainer tower