Beast code in Giters

init's starred repositories

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION46971 349 4016

mem0

The Memory layer for your AI apps

Language:PythonApache-2.022057 126 654

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonMIT17735 112 468

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonMIT9455 133 1523

translation-agent

Language:PythonMIT4654 51 15

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

MIT2009 22 50

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonNOASSERTION1777 26 46

UMOE-Scaling-Unified-Multimodal-LLMs

The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"

Language:Python756 11 9

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonBSD-3-Clause662 12 26

Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

CC0-1.0242 5 4

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Language:Python240 2 18

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)

Language:Python231 4 23

LLM-Tool-Survey

This is the repository for the Tool Learning survey.

206 1 4

AutoIF

Language:PythonApache-2.0199 6 7

loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Language:PythonApache-2.0132 11 4

Humback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Language:PythonApache-2.0130 3 9

retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Language:Python99 30

Lookback-Lens

Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Language:Python96 3 6

DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Language:Jupyter NotebookCC-BY-4.094 5 5

Internalize_CoT_Step_by_Step

Language:PythonMIT90 1 4

swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Language:PythonApache-2.085 3 1

LitSearch

A Retrieval Benchmark for Scientific Literature Search

Language:PythonMIT53 5 2

RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Language:Python52 2 5

BRIGHT

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Language:PythonCC-BY-4.044 4 7

llm_factuality_tuning

Language:Python21 2 3

rag-qa-arena

Language:PythonApache-2.019 2 2

llm_hallucinations

Language:Python11 10

FactScoreLite

FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.

Language:PythonMIT600

pragmatic_calibration

Language:PythonApache-2.05 10

HalluPAQ

Leveraging Generated Q&A Pairs for Efficient Confidence Scoring and Hallucination Detection

Language:Python200