dachao .Wang (wangdach)

wangdach

Geek Repo

Location:shanghai.china

Github PK Tool:Github PK Tool

dachao .Wang's starred repositories

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1861Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:64552Issues:0Issues:0

alpaca.cpp

Locally run an Instruction-Tuned Chat-Style LLM

Language:CLicense:MITStargazers:10256Issues:0Issues:0

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4155Issues:0Issues:0

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Stargazers:1025Issues:0Issues:0

ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Language:PythonLicense:Apache-2.0Stargazers:527Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5961Issues:0Issues:0

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

Stargazers:1497Issues:0Issues:0

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

Language:HTMLStargazers:2399Issues:0Issues:0

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonLicense:BSD-3-ClauseStargazers:3903Issues:0Issues:0
Language:PythonStargazers:6Issues:0Issues:0

typora_plugin

Typora plugin. Feature enhancement tool | Typora 插件,功能增强工具

Language:JavaScriptLicense:MITStargazers:1571Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5996Issues:0Issues:0

xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

Language:C++License:NOASSERTIONStargazers:2435Issues:0Issues:0

pytorch-model-train-template

pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用

Language:PythonStargazers:65Issues:0Issues:0

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1784Issues:0Issues:0

transformers_zh_docs

Huggingface transformers的中文文档

Language:PythonStargazers:143Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1307Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1824Issues:0Issues:0

tutorials

PyTorch tutorials.

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8088Issues:0Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4546Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7625Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15707Issues:0Issues:0

zig

General-purpose programming language and toolchain for maintaining robust, optimal, and reusable software.

Language:ZigLicense:MITStargazers:33586Issues:0Issues:0

python_backend

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

Language:C++License:BSD-3-ClauseStargazers:520Issues:0Issues:0

common

Common source, scripts and utilities shared across all Triton repositories.

Language:C++License:BSD-3-ClauseStargazers:62Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:83Issues:0Issues:0

tutorials

This repository contains tutorials and examples for Triton Inference Server

Language:PythonLicense:BSD-3-ClauseStargazers:518Issues:0Issues:0

client

Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

Language:C++License:BSD-3-ClauseStargazers:544Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5974Issues:0Issues:0