EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Home Page:https://www.eleuther.ai/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Support for Mosaic Models

rajveer43 opened this issue · comments

Support for Mosaic Models

Mosaic models use our tokenizer, so the main to-do items are:

  1. Are there any configs or features they use that we don't yet support?
  2. Converting the checkpoints to the GPT-NeoX format