wangdach

followers

following

stars

shanghai.china

dachao .Wang's starred repositories

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT110000

higress

Cloud Native API Gateway | 云原生API网关

Language:GoApache-2.0252600

Megatron-LLaMA

Best practice for training LLaMA models in Megatron-LM

Language:PythonNOASSERTION57100

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION931800

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell596400

Programming-Massively-Parallel-Processors

Language:Cuda2800

HighPerformanceComputing

Class of High Performance Computing taken at U.T.P 2017

Language:Jupyter NotebookMIT2000

llm_training_handbook

An open collection of methodologies to help with successful training of large language models.

Language:PythonCC-BY-SA-4.042800

bisheng

Bisheng is an open LLM devops platform for next generation AI applications.

Language:PythonApache-2.0812300

smplayer

Free Media Player for Windows, Linux and Mac OS with YouTube support.

Language:C++GPL-2.061900

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT222000

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1189200

CaptuocrToy

A tool to capture screenshot and recognize text by online ocr apis

Language:Swift134900

full_stack

必要的计算机科学及软件开发知识

Language:Jupyter NotebookUnlicense2100

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3452700

vscode_debug_transformers

Language:Python11700

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonApache-2.0355600

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0190000

personal_chatgpt

personal chatgpt

Language:Jupyter Notebook26300

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoApache-2.02814600

llmperf

LLMPerf is a library for validating and benchmarking LLMs

Language:PythonApache-2.047400

ppl.nn

A primitive library for neural network

Language:C++Apache-2.0124000

FightingCV-Paper-Reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Language:Shell78100

External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Language:PythonMIT1111200

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Language:PythonApache-2.0109100

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.0320500

Code-LMs

Guide to using pre-trained large language models of source code

Language:PythonMIT173900

Tutorial

LLM Tutorial

Language:Python92700

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Language:PythonApache-2.0570300