Beast code in Giters

huziyuan14's repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0100

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. Meanwhile, we created a new branch to build a Tabular LLM.（我们分别统一了丰富的IFT数据（如CoT数据，目前仍不断扩充）、多种训练效率方法（如lora，p-tuning）以及多种LLMs，三个层面上的接口，打造方便研究人员上手的LLM-IFT研究平台。同时tabular_llm分支构建了面向表格智能任务的LLM。

Language:Jupyter NotebookApache-2.0000

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.0000

bookcorpus

Crawl BookCorpus

Language:PythonMIT000

chatGLM-6B-QLoRA

使用peft库，对chatGLM-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。

Language:Python000

ChatGLM-Tuning

一种平价的chatgpt实现方案, 基于ChatGLM-6B + LoRA

Language:PythonMIT000

chatgpt-corpus

ChatGPT 中文语料库对话语料小说语料客服语料用于训练大模型

GPL-3.0000

ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调。

Language:PythonApache-2.0000

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.0000

LexiconAugmentedNER

Reject complicated operations for incorporating lexicon for Chinese NER.

Language:Python000

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT000

Chinese-Medical-Entity-Recognition

Using BERT+Bi-LSTM+CRF

MIT000

ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Language:Python000

CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

000

ColossalAI

Making large AI models cheaper, faster and more accessible

Apache-2.0000

hugging-multi-agent

A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development

000

KnowLM

Knowledgable Large Language Model Framework.

Apache-2.0000

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

000

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0000

LLaMA-Efficient-Tuning

Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen)

Apache-2.0000

LLM-SFT

中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微调, 推理, 测评, 接口)等.

Apache-2.0000

LLM-Tuning

Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.

000

NLP2

Some application models of natural language processing

000

Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Apache-2.0000

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Apache-2.0000

text-generation-webui

A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.

AGPL-3.0000

Transformer-pytorch

Transformer, pytorch, python

Apache-2.0000

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

000

huziyuan14

huziyuan14's repositories

FastChat

adapter-bert

Alpaca-CoT

alpaca-lora

BELLE

bookcorpus

chatGLM-6B-QLoRA

ChatGLM-Tuning

chatgpt-corpus

ChatLM-mini-Chinese

Chinese-LLaMA-Alpaca

LexiconAugmentedNER

qlora

Chinese-Medical-Entity-Recognition

ChineseNLPCorpus

CLUEDatasetSearch

ColossalAI

hugging-multi-agent

KnowLM

Linly

lit-llama

LLaMA-Efficient-Tuning

LLM-SFT

LLM-Tuning

NLP2

Phi2-mini-Chinese

RedPajama-Data

text-generation-webui

Transformer-pytorch

transformers_tasks