Beast code in Giters

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0792300

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.0856400

llama-cpp-python

Python bindings for llama.cpp

Language:PythonMIT748300

LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

Language:HTML95200

distilling-step-by-step

Language:PythonApache-2.038200

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION4190600

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.03583900

ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Language:PythonMIT277400

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.01415500

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT2265700

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02124100

NexusRaven

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Language:PythonApache-2.029400

ChatGPTX-Uni

实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案，LLM-Base+LLM-X+Alpaca，初期，LLM-Base为Chatglm6B底座模型，LLM-X是LLAMA增强模型。该方案简易高效，目标是使此类语言模型能够低能耗广泛部署，并最终在小模型的基座上发生“智能涌现”，力图最小计算代价达成ChatGPT、GPT4、ChatRWKV等人类友好亲和效果。当前可以满足总结、提问、问答、摘要、改写、评论、扮演等各种需求。

Language:PythonGPL-3.011800

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01325800

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.0195200

bbo0924

Bing Bo's starred repositories

ToolSandbox

datatrove

sglang

mem0

LLM101n

AIOS

OpenAGI

JioNLP

cosmopedia

EasyNLP

lmdeploy

AutoAWQ

dive-into-llms

llama2.c

qserve

TensorRT-LLM