zpye's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:36231Issues:0Issues:0

AppShift-MemoryPool

A very fast cross-platform memory pool mechanism for C++ built using a data-oriented approach (3 to 24 times faster than regular new or delete, depending on operating system & compiler)

Language:C++License:Apache-2.0Stargazers:204Issues:0Issues:0

pgvector

Open-source vector similarity search for Postgres

Language:CLicense:NOASSERTIONStargazers:11426Issues:0Issues:0

meta.hpp

C++20 Dynamic Reflection Library

Language:C++License:MITStargazers:126Issues:0Issues:0

caches

C++ cache with LRU/LFU/FIFO policies implementation

Language:C++License:BSD-3-ClauseStargazers:330Issues:0Issues:0

mppp

Multiprecision for modern C++

Language:C++License:MPL-2.0Stargazers:294Issues:0Issues:0

junction

Concurrent data structures in C++

Language:C++License:NOASSERTIONStargazers:1395Issues:0Issues:0

MxEngine

C++ open source 3D game engine

Language:C++License:BSD-3-ClauseStargazers:1107Issues:0Issues:0

hello-algo

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Language:JavaLicense:NOASSERTIONStargazers:93758Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:27203Issues:0Issues:0

gpt4all

GPT4All: Chat with Local LLMs on Any Device

Language:C++License:MITStargazers:68715Issues:0Issues:0

3d-game-shaders-for-beginners

🎮 A step-by-step guide to implementing SSAO, depth of field, lighting, normal mapping, and more for your 3D game.

Language:C++Stargazers:17648Issues:0Issues:0

Clang-Compiler-Frontend

《Clang Compiler Frontend》的非专业个人翻译

Language:TeXLicense:Apache-2.0Stargazers:27Issues:0Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2261Issues:0Issues:0

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9851Issues:0Issues:0

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5943Issues:0Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:2970Issues:0Issues:0

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Language:PythonLicense:Apache-2.0Stargazers:199Issues:0Issues:0

model_optimization

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

Language:PythonLicense:Apache-2.0Stargazers:295Issues:0Issues:0

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Stargazers:1774Issues:0Issues:0

model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Language:PythonLicense:Apache-2.0Stargazers:1485Issues:0Issues:0

BigNumber

C++ class for creating and computing arbitrary-length integers

Language:C++License:Apache-2.0Stargazers:191Issues:0Issues:0

flashinfer

FlashInfer: Kernel Library for LLM Serving

Language:CudaLicense:Apache-2.0Stargazers:1029Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3162Issues:0Issues:0

KuiperLLama

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama的大模型推理框架。

Language:C++Stargazers:147Issues:0Issues:0

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:4418Issues:0Issues:0

drawio-desktop

Official electron build of draw.io

Language:JavaScriptLicense:Apache-2.0Stargazers:49273Issues:0Issues:0

sobjectizer

An implementation of Actor, Publish-Subscribe, and CSP models in one rather small C++ framework. With performance, quality, and stability proved by years in the production.

Language:C++License:NOASSERTIONStargazers:474Issues:0Issues:0

recycle

Simple resource pool for recycling resources in C++

Language:C++License:BSD-3-ClauseStargazers:63Issues:0Issues:0

high_impact

A 2d game engine written in C

Language:CLicense:MITStargazers:1014Issues:0Issues:0