Beast code in Giters

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

Language:PythonApache-2.01180300

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.01799600

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonApache-2.0702700

llama

Inference code for Llama models

Language:PythonNOASSERTION5477400

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0206100

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.0213800

aliendao

huggingface mirror download

Language:PythonMIT54000

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01319200

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT112600

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause1267700

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.062100

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0773800

firejq

firejq's starred repositories

Mooncake

resolvelib

pip

DeepSeek-MoE

libarchive

DiT

Chinese-Mixtral

WeChatMsg

PaddleNLP

Chinese-LLaMA-Alpaca

Chinese-LLaMA-Alpaca-2

llama

Medusa

lightllm

aliendao

ChatGLM3

smoothquant

flash-attention

tensorrtllm_backend

TensorRT-LLM

AutoGPTQ

text-generation-inference

peft

fastllm

text-generation-webui

ppl.nn.llm

ControlNet

generative-models

AITemplate

stablediffusion