XHPlus's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:164999Issues:1560Issues:2399

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23428Issues:216Issues:3562

zstd

Zstandard - Fast real-time compression algorithm

Language:CLicense:NOASSERTIONStargazers:22862Issues:412Issues:1383

triton

Development repository for the Triton language and compiler

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10709Issues:83Issues:141

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9682Issues:137Issues:31

resume

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

Language:TeXLicense:MITStargazers:8998Issues:84Issues:64

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:7924Issues:56Issues:1477

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:6946Issues:59Issues:159

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptLicense:NOASSERTIONStargazers:6552Issues:86Issues:32

RealChar

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

Language:JavaScriptLicense:MITStargazers:5904Issues:57Issues:135

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4462Issues:76Issues:87

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2109Issues:22Issues:169

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1644Issues:24Issues:37

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

evo.ninja

A versatile generalist agent.

Language:TypeScriptLicense:MITStargazers:1052Issues:20Issues:261

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookLicense:MITStargazers:1037Issues:17Issues:25

encord-active

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.

Language:PythonLicense:Apache-2.0Stargazers:428Issues:10Issues:13

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaLicense:Apache-2.0Stargazers:160Issues:5Issues:4

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit"

Language:PythonLicense:Apache-2.0Stargazers:127Issues:9Issues:3

Dipoorlet

Offline Quantization Tools for Deploy.

Language:PythonLicense:Apache-2.0Stargazers:108Issues:16Issues:9

QDrop

The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization

Language:PythonLicense:Apache-2.0Stargazers:107Issues:1Issues:20

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

License:Apache-2.0Stargazers:55Issues:9Issues:0

NART

NART = NART is not A RunTime, a deep learning inference framework.

Language:PythonLicense:Apache-2.0Stargazers:37Issues:10Issues:1

Outlier_Suppression_Plus

Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling

Language:PythonLicense:MITStargazers:35Issues:8Issues:6

EasyLLM

Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:8Issues:1

Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

AAAI2023_EAMPD

AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline

general-sam

A general suffix automaton implementation in Rust with Python bindings

Language:RustLicense:Apache-2.0Stargazers:2Issues:6Issues:1