yuflo

Yufluo Lee's starred repositories

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookMIT170700

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.0436900

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT1595600

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonMIT1814800

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonMIT512900

llm-paper-daily

Daily updated LLM papers. 每日更新 LLM 相关的论文，欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

86600

awesome-ai-agents

A list of AI autonomous agents

NOASSERTION932400

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Language:PythonAGPL-3.0930800

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

608500

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

87700

Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

Apache-2.081500

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

1432700

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.0208900

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonApache-2.0470400

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python916400

gptrpg

A demo of an GPT-based agent existing in an RPG-like environment

Language:JavaScript97200

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0357700

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT619700

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT425900

AI_Tutorial

精选机器学习，NLP，图像识别，深度学习等人工智能领域学习资料，搜索，推荐，广告系统架构及算法技术资料整理。算法大牛笔记汇总

298800

AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

Language:TypeScriptGPL-3.03116100

camel

🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org

Language:PythonApache-2.0522100

babyagi

Language:PythonMIT1984400

assistgpt

Language:JavaScript6500

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.0518600

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonApache-2.01109600

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonApache-2.084400

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonApache-2.0183300

PandaLM

Language:PythonApache-2.087500

CPM-Bee

百亿参数的中英文双语基座大模型

Language:Python268300