Jianbang Yang's starred repositories
ext-saladict
🥗 All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching.
Paddle-Lite
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
Awesome-LLM-Learning
Learning Large Language Model (LLM)(大语言模型学习)
ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
grouped-query-attention-pytorch
(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)
Megatron-Energon
Megatron's multi-modal data loader
TorchProfiling
在module level分析模型的性能
PaddleAPEX
PaddleAPEX:Paddle Accuracy and Performance EXpansion pack