Beast code in Giters

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.0267100

FT-Data-ranker-7b

Language:Python700

ActiveRAG

This is the code repo for our paper "Revealing the Treasures of Knowledge via Active Learning".

Language:PythonMIT8900

LLM-Knowledge-Boundary

Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"

Language:Python2200

RAG-Survey

175700

FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Language:PythonMIT27800

RGB

Language:PythonNOASSERTION26600

FAVA

Language:Python5300

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION1480800

ToolAlpaca

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

Language:PythonApache-2.028600

IncarnaMind

Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs

Language:PythonApache-2.077500

Llama-2-Open-Source-LLM-CPU-Inference

Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A

Language:PythonMIT94600

LangChain_LLM_ChatBot

基于LLM和LangChain实现基于本地文档的QA chatbot

Language:Python3400

MemGPT

Letta (fka MemGPT) is a framework for creating stateful LLM services.

Language:PythonApache-2.01193100