Beast code in Giters

XHPlus's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT164999 1560 2399

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023428 216 3562

zstd

Zstandard - Fast real-time compression algorithm

Language:CNOASSERTION22862 412 1383

triton

Development repository for the Triton language and compiler

Language:C++MIT12061 187 1314

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonMPL-2.010709 83 141

AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookApache-2.09682 137 31

resume

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

Language:TeXMIT8998 84 64

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.07924 56 1477

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonMIT6946 59 159

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptNOASSERTION6552 86 32

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

Language:JavaScriptMIT5904 57 135

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.04462 76 87

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.02109 22 169

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonApache-2.01644 24 37

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

MIT1302 30 1

evo.ninja

A versatile generalist agent.

Language:TypeScriptMIT1052 20 261

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookMIT1037 17 25

encord-active

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.

Language:PythonApache-2.0428 10 13

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaApache-2.0160 5 4

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit"

Language:PythonApache-2.0127 9 3

Dipoorlet

Offline Quantization Tools for Deploy.

Language:PythonApache-2.0108 16 9

QDrop

The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization

Language:PythonApache-2.0107 1 20

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

Apache-2.055 90

NART

NART = NART is not A RunTime, a deep learning inference framework.

Language:PythonApache-2.037 10 1

Outlier_Suppression_Plus

Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling

Language:PythonMIT35 8 6

EasyLLM

Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

Language:PythonApache-2.031 8 1

Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

24 30

LPCV_2023_solution

Language:Python18 2 1

AAAI2023_EAMPD

AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline

13 7 2

general-sam

A general suffix automaton implementation in Rust with Python bindings

Language:RustApache-2.02 6 1