weigao266

followers

following

stars

Shanghai, China

Weigao Sun's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT63135 525 3526

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Language:RustNOASSERTION43362 193 7157

google-research

Google Research

Language:Jupyter NotebookApache-2.033578 749 1224

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029246 340 267

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonMIT20738 252 3489

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.018573 159 1431

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonNOASSERTION18061 92 216

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause12802 114 947

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011174 201 2182

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

Language:CNOASSERTION5898 117 229

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03485 23 443

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.02484 23 25

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.02167 23 177

torchmetrics

Torchmetrics - Machine learning metrics for distributed, scalable PyTorch applications.

Language:PythonApache-2.02039 30 848

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language:Jupyter NotebookApache-2.01941 51 34

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01707 36 282

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION1295 24 143

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellNOASSERTION968 38 19

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonMIT935 15 35

megablocks-public

Language:PythonApache-2.0856 90

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0816 7 18

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT793 21 31

megablocks

Language:PythonApache-2.0765 12 34

Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

Apache-2.0500 14 4

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT491 11 61

NeMo-Megatron-Launcher

NeMo Megatron launcher and tools

Language:PythonApache-2.0389 19 29

MathPile

Generative AI for Math: MathPile

Language:PythonApache-2.0364 7 5

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonNOASSERTION237 6 23

lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Language:PythonMIT175 11 12