zbtrs

zbtrs

Geek Repo

Company:Huazhong University of Science and Technology

Github PK Tool:Github PK Tool

zbtrs's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:130789Issues:1117Issues:15508

awesome-cpp

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34401Issues:340Issues:2682

clash-rules

🦄️ 🎃 👻 Clash Premium 规则集(RULE-SET),兼容 ClashX Pro、Clash for Windows 等基于 Clash Premium 内核的客户端。

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15525Issues:105Issues:1003

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:13253Issues:113Issues:874

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:11371Issues:91Issues:311

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6180Issues:35Issues:1014

ModernCppStarter

🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.

Language:CMakeLicense:UnlicenseStargazers:4279Issues:68Issues:54

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:4238Issues:44Issues:404

clarity-upscaler

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

Language:PythonLicense:AGPL-3.0Stargazers:3525Issues:32Issues:39

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2191Issues:32Issues:7

LLM-Finetuning

LLM Finetuning with peft

Language:Jupyter NotebookStargazers:1977Issues:30Issues:3

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1621Issues:33Issues:630

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

CUDA-Learn-Notes

🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:1039Issues:11Issues:5

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaLicense:Apache-2.0Stargazers:1008Issues:14Issues:88

AzurePublicDataset

Microsoft Azure Traces

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:768Issues:37Issues:33

Awesome-GPU

Awesome resources for GPUs

License:BSD-3-ClauseStargazers:449Issues:23Issues:0

glake

GLake: optimizing GPU memory management and IO transmission.

Language:PythonLicense:Apache-2.0Stargazers:335Issues:6Issues:21

clash-premium-installer

Simple clash premium core installer for Linux.

how-to-learn-deep-learning-framework

how to learn PyTorch and OneFlow

ComfyUI-BiRefNet-ZHO

Better version for BiRefNet in ComfyUI | Both img & video

Language:PythonLicense:GPL-3.0Stargazers:199Issues:4Issues:13

vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

Language:CLicense:MITStargazers:170Issues:2Issues:4

summerschool

CSC Summer School in High-Performance Computing

Language:C++License:NOASSERTIONStargazers:89Issues:10Issues:1

gpu_memory_profiling

Profile the GPU memory usage of every line in a Pytorch code

comfyui-job-iterator

A for loop for ComfyUI

Language:PythonLicense:GPL-3.0Stargazers:69Issues:5Issues:3