Weigao Sun's starred repositories
FasterTransformer
Transformer related optimization, including BERT, GPT
FastPointTransformer
Official source code of Fast Point Transformer, CVPR 2022
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
FlagAttention
A collection of memory efficient attention operators implemented in the Triton language.
llm-numbers
Numbers every LLM developer should know
pymobiledevice3
Pure python3 implementation for working with iDevices (iPhone, etc...).
Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
nccl-tests
NCCL Tests
DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
TclPyHyperWorks
基于hyperworks (hypermesh & hyperview) 的二次开发相关,TCL为主,部分python
matplotlib-gallery
Examples of matplotlib codes and plots
dadaptation
D-Adaptation for SGD, Adam and AdaGrad
TransnormerLLM
Official implementation of TransNormerLLM: A Faster and Better LLM
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
Transformer-Evolution-Paper
记录Transformer升级的论文笔记
Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
cs-video-courses
List of Computer Science courses with video lectures.
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
iTerm2-Color-Schemes
Over 250 terminal color schemes/themes for iTerm/iTerm2. Includes ports to Terminal, Konsole, PuTTY, Xresources, XRDB, Remmina, Termite, XFCE, Tilda, FreeBSD VT, Terminator, Kitty, MobaXterm, LXTerminal, Microsoft's Windows Terminal, Visual Studio, Alacritty
wikiextractor
A tool for extracting plain text from Wikipedia dumps