vision-zhao's starred repositories

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM, Qwen 与 Llama 等)基于 Langchain 与 ChatGLM 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:29403Issues:273Issues:3432

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25281Issues:169Issues:4095

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14839Issues:103Issues:945

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14316Issues:266Issues:202

DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Language:PythonLicense:MITStargazers:12255Issues:113Issues:892

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7696Issues:107Issues:438

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

DecryptPrompt

总结Prompt&LLM论文,开源数据&模型,AIGC应用

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Language:PythonLicense:Apache-2.0Stargazers:1961Issues:50Issues:79

mathematics_dataset

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Language:PythonLicense:Apache-2.0Stargazers:1751Issues:67Issues:13

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1330Issues:118Issues:15

textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Language:PythonLicense:Apache-2.0Stargazers:909Issues:11Issues:52

mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Language:PythonLicense:MITStargazers:572Issues:5Issues:10

sft_datasets

开源SFT数据集整理,随时补充

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonLicense:MITStargazers:354Issues:10Issues:34

DeepUtteranceAggregation

Modeling Multi-turn Conversation with Deep Utterance Aggregation (COLING 2018)

Humback

🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.

Language:PythonLicense:Apache-2.0Stargazers:121Issues:3Issues:9

BotChat

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:113Issues:2Issues:1

HalluQA

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:102Issues:5Issues:0

FeTaQA

Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"

Language:PythonLicense:CC-BY-SA-4.0Stargazers:69Issues:9Issues:5

TencentLLMEval

TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, data verification methods, and more.

KnowledgeHierarchy

高中理科知识体系总结

DPT

The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering

generate_question

generate question

Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:2
Language:PythonStargazers:3Issues:0Issues:0
Language:PythonStargazers:3Issues:1Issues:0