Mikasa's starred repositories
CitationMap
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
ServerlessLLM
Cost-efficient and fast multi-LLM serving.
how-to-learn-deep-learning-framework
how to learn PyTorch and OneFlow
pytorch-cppcuda-tutorial
tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
sarathi-serve
A low-latency & high-throughput serving engine for LLMs
calculate-flops.pytorch
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
torchtitan
A native PyTorch Library for large model training
long-context-attention
Sequence Parallel Attention for Long Context LLM Model Training and Inference
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
scattermoe
Triton-based implementation of Sparse Mixture of Experts.
LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models