Beast code in Giters

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

Language:PythonMIT000

Chatglm_lora_multi-gpu

chatglm多gpu用deepspeed和

Language:Python000

ChatGLM_LoRA_zh

在ChatGLM大模型上利用LoRA方法进行小参数学习，训练语料库选择中文的[alpaca-zh](https://huggingface.co/datasets/shibing624/alpaca-zh)

Language:Jupyter Notebook000

ChatGLM_yunying_helper

010

chatgpt-llamaindex-demo

ChatGPT and LlamaIndex demo

Language:Python000

Chinese-LangChain

中文langchain项目|小必应，Q.Talk，强聊，QiangTalk

Language:Python000

ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Language:Python000

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.0000

DeepSpeed-Chat-ChatGLM

包含了RLHF

Language:Python000

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目，利用开源开放来促进「AI+金融」。

000

hcgf

Humanable ChatGLM/GPT Fine-tuning | ChatGLM微调

Language:PythonApache-2.0000

InstructGLM

ChatGLM-6B 指令学习|指令数据|Instruct

MIT000

japanese-alpaca-lora

A japanese finetuned instruction LLaMA

Language:Jupyter NotebookApache-2.0000

llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Language:PythonApache-2.0000

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.0000

LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

Apache-2.0000

minChatGPT

A minimum example of aligning language models with RLHF similar to ChatGPT

Language:PythonGPL-3.0000

promptlib

A collection of prompts for use with GPT-4 via ChatGPT, OpenAI API w/ Gradio frontend & notebook

000

RLHF

Implementation of Chinese ChatGPT

000

text-to-sql-wizardcoder

Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.

Language:Jupyter Notebook000

reborm

reborm's repositories

agentx

alpaca_chinese_dataset

Auto-GPT

BillyGPT

ChatGLM-Finetuning

ChatGLM-LLaMA-chinese-insturct

ChatGLM-LoRA-RLHF-PyTorch

Chatglm_lora_multi-gpu

ChatGLM_LoRA_zh

ChatGLM_yunying_helper

chatgpt-llamaindex-demo

Chinese-LangChain

ChineseNLPCorpus

ColossalAI

DeepSpeed-Chat-ChatGLM

FinGLM

hcgf

InstructGLM

japanese-alpaca-lora

llama-trl

Llama-X

LLMs_interview_notes

minChatGPT

promptlib

RLHF

text-to-sql-wizardcoder

textgen

transformers_tasks

unstructured

Vicuna-LoRA-RLHF-PyTorch