BaaBaa's starred repositories
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
Megatron-LM
Ongoing research training transformer models at scale
coder-kung-fu
开发内功修炼
bob-plugin-openai-translator
基于 ChatGPT API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!Licensed under CC BY-NC-SA 4.0
k8s-device-plugin
NVIDIA device plugin for Kubernetes
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
clearml-agent
ClearML Agent - ML-Ops made easy. ML-Ops scheduler & orchestration solution
nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
csconferences
Major CS conference publication stats (including accepted and submitted) by year.
kubernetes-scheduler-simulator
Kubernetes Scheduler Simulator
brainstorm
Compiler for Dynamic Neural Networks
dear_pytorch
[ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining