Eric8932

followers

following

stars

WenH's starred repositories

AIDB

ai4db and db4ai work

DBTune

A customized and efficient database tuning system [VLDB'22]

Language:PythonNOASSERTION3100

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.0784300

chatgpt-corpus

ChatGPT 中文语料库对话语料小说语料客服语料用于训练大模型

GPL-3.083300

chatterbot-corpus

A multilingual dialog corpus

Language:PythonBSD-3-Clause136700

chinese-chatbot-corpus

中文公开聊天语料库

Language:PythonApache-2.0397500

Awesome-Continual-Learning

A curated list of Continual Learning papers and BibTeX entries

Language:TeX14000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02786400

IE-Datasets-Collections

中英文信息抽取数据集整理

1300

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.02938800

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.01359700

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.0259300

DB-GPT

An LLM Based Diagnosis System (https://arxiv.org/pdf/2312.01454.pdf)

Language:PythonApache-2.054200

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonApache-2.047500

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.03189300

Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

Apache-2.084100

Awesome-LLM-Interpretability

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

awesome-chatgpt-prompts-zh

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

MIT5233300

COIG

awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

Apache-2.050500

pCLUE

pCLUE: 1000000+多任务提示学习数据集

Language:Jupyter Notebook46700

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

MIT157500

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookMIT181500

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

resume-ng

A LaTeX resume template designed for optimal information density and aesthetic appeal.

Language:TeXLPPL-1.3c28100

prompt2model

prompt2model - Generate Deployable Models from Natural Language Instructions

Language:PythonApache-2.0194600

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.0223400

sparse-probing-paper

Sparse probing paper full code.

Language:Jupyter NotebookMIT4900

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonMIT197000

mend

MEND: Fast Model Editing at Scale

Language:PythonMIT22900