shahules786 / Open-Instruction-Generalist

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Open-Instruction-Generalist Assistant

Open Instruction Generalist (OIG) Assistant is an assistant that is part of the LAION-AI's OpenChat family of assistants. OIG Assistant is trained on massive synthetic instructions with the goal of performing many diverse types of tasks.

We will have several versions of the OIG Assistant ranging from an OIG Assistant that is trained on a small (less than 1M) high quality synthetic dataset, to an OIG Aisstant trained on medium quality but massive synthetic instructions. The research goal of OIG Assistant is to create high performing bots by using simple finetuning instead of RLHF.

We will create ever larger instruction datasets starting with 2M with the goal to generate eventually 1T medium quality instructions. The receipe for training is to do additional pretrain on some subset of the larger instruction sets, followed by a finetune on OIG-small or some other high quality small dataset.

  • OIG-small - Done and released. See small_instruction_set sub-directory.
  • OIG-2M - Done. See 2M sub-directory
  • OIG-4M - In progress, to be released soon
  • OIG-20M - In progress, to be released expected March 1, 2023

About

Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks

License:Apache License 2.0


Languages

Language:Python 96.9%Language:Jupyter Notebook 3.1%