XHPlus's starred repositories

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10709Issues:0Issues:0

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptLicense:NOASSERTIONStargazers:6552Issues:0Issues:0

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookLicense:MITStargazers:1037Issues:0Issues:0

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit"

Language:PythonLicense:Apache-2.0Stargazers:127Issues:0Issues:0

Awesome-Efficient-Diffusion

Curated list of methods that focuses on improving the efficiency of diffusion models

Stargazers:24Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2109Issues:0Issues:0

zstd

Zstandard - Fast real-time compression algorithm

Language:CLicense:NOASSERTIONStargazers:22861Issues:0Issues:0

EasyLLM

Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0

evo.ninja

A versatile generalist agent.

Language:TypeScriptLicense:MITStargazers:1052Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:164994Issues:0Issues:0

Awesome-LLM-Reasoning

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.

License:MITStargazers:1302Issues:0Issues:0

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1644Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9682Issues:0Issues:0

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaLicense:Apache-2.0Stargazers:160Issues:0Issues:0

Outlier_Suppression_Plus

Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

general-sam

A general suffix automaton implementation in Rust with Python bindings

Language:RustLicense:Apache-2.0Stargazers:2Issues:0Issues:0

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:7924Issues:0Issues:0

encord-active

The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.

Language:PythonLicense:Apache-2.0Stargazers:428Issues:0Issues:0

RealChar

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

Language:JavaScriptLicense:MITStargazers:5904Issues:0Issues:0

GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Language:PythonLicense:MITStargazers:6946Issues:0Issues:0
Language:PythonStargazers:18Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4462Issues:0Issues:0

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

License:Apache-2.0Stargazers:55Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23417Issues:0Issues:0

Dipoorlet

Offline Quantization Tools for Deploy.

Language:PythonLicense:Apache-2.0Stargazers:108Issues:0Issues:0

resume

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

Language:TeXLicense:MITStargazers:8998Issues:0Issues:0

QDrop

The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization

Language:PythonStargazers:107Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:12061Issues:0Issues:0

AAAI2023_EAMPD

AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline

Stargazers:13Issues:0Issues:0

NART

NART = NART is not A RunTime, a deep learning inference framework.

Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0