init's starred repositories

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:46971Issues:349Issues:4016

mem0

The Memory layer for your AI apps

Language:PythonLicense:Apache-2.0Stargazers:22057Issues:126Issues:654

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:17735Issues:112Issues:468

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:9455Issues:133Issues:1523

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1777Issues:26Issues:46

UMOE-Scaling-Unified-Multimodal-LLMs

The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"

bonito

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Language:PythonLicense:BSD-3-ClauseStargazers:662Issues:12Issues:26

Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

ToolkenGPT

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)

LLM-Tool-Survey

This is the repository for the Tool Learning survey.

Language:PythonLicense:Apache-2.0Stargazers:199Issues:6Issues:7

loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Language:PythonLicense:Apache-2.0Stargazers:132Issues:11Issues:4

Humback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Language:PythonLicense:Apache-2.0Stargazers:130Issues:3Issues:9

retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Language:PythonStargazers:99Issues:3Issues:0

Lookback-Lens

Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

DocGenome

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:94Issues:5Issues:5

swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Language:PythonLicense:Apache-2.0Stargazers:85Issues:3Issues:1

LitSearch

A Retrieval Benchmark for Scientific Literature Search

Language:PythonLicense:MITStargazers:53Issues:5Issues:2

RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

BRIGHT

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Language:PythonLicense:CC-BY-4.0Stargazers:44Issues:4Issues:7
Language:PythonLicense:Apache-2.0Stargazers:19Issues:2Issues:2

FactScoreLite

FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package builds upon the framework provided by the original FactScore repository, which is no longer maintained and contains outdated functions.

Language:PythonLicense:MITStargazers:6Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:5Issues:1Issues:0

HalluPAQ

Leveraging Generated Q&A Pairs for Efficient Confidence Scoring and Hallucination Detection

Language:PythonStargazers:2Issues:0Issues:0