xianbin7's starred repositories

alpaca-chinese-dataset

alpaca中文指令微调数据集

Stargazers:391Issues:0Issues:0

ChineseNlpCorpus

搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。

Language:Jupyter NotebookStargazers:5824Issues:0Issues:0

nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

License:MITStargazers:9432Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38691Issues:0Issues:0

xmtf

Crosslingual Generalization through Multitask Finetuning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:513Issues:0Issues:0

awesome-totally-open-chatgpt

A list of totally open alternatives to ChatGPT

License:CC0-1.0Stargazers:4507Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:39848Issues:0Issues:0

DecryptPrompt

总结Prompt&LLM论文,开源数据&模型,AIGC应用

Stargazers:2608Issues:0Issues:0

Learn_Prompting

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

Language:MDXLicense:NOASSERTIONStargazers:4278Issues:0Issues:0

PaddleFleetX

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Language:PythonLicense:Apache-2.0Stargazers:438Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18566Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26497Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40465Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3417Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9003Issues:0Issues:0

Instructdial

Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning

Language:PythonLicense:Apache-2.0Stargazers:96Issues:0Issues:0

TencentPretrain

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Language:PythonLicense:NOASSERTIONStargazers:1021Issues:0Issues:0

llama-docker-playground

Quick Start LLaMA models with multiple methods, and fine-tune 7B/65B with One-Click.

Language:PythonLicense:GPL-3.0Stargazers:350Issues:0Issues:0

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonLicense:Apache-2.0Stargazers:6864Issues:0Issues:0

ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Language:PythonLicense:Apache-2.0Stargazers:9393Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:4006Issues:0Issues:0

gpt-2-output-dataset

Dataset of GPT-2 outputs for research in detection, biases, and more

Language:PythonLicense:MITStargazers:1937Issues:0Issues:0

pretraining-with-human-feedback

Code accompanying the paper Pretraining Language Models with Human Preferences

Language:PythonLicense:MITStargazers:175Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXLicense:MITStargazers:58848Issues:0Issues:0

iPrompt

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

Language:PythonStargazers:121Issues:0Issues:0

minRLHF

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Language:PythonStargazers:84Issues:0Issues:0

TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Language:PythonLicense:MITStargazers:539Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8843Issues:0Issues:0

GPT2

An implementation of training for GPT2, supports TPUs

Language:PythonLicense:MITStargazers:1419Issues:0Issues:0

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Language:Jupyter NotebookStargazers:2124Issues:0Issues:0