Jiashu's starred repositories
matplotlib-curly-brace
Plot curly brace with matplotlib
Paper-Picture-Writing-Code
MLNLP: Paper Picture Writing Code
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
GPUDB-Prefetch
Source code of our DaMoN@SIGMOD 2024 paper "How Does Software Prefetching Work on GPU Query Processing?"
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
torchtitan
A native PyTorch Library for large model training
DeepSeek-LLM
DeepSeek LLM: Let there be answers
llama_index
LlamaIndex is a data framework for your LLM applications
contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用