rinnakk / japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., Ltd.

Home Page:https://huggingface.co/rinna

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Train japanese-gpt2-xsmall from scratch

jurader opened this issue · comments

After the following command,

python -m corpus.jp_wiki.build_pretrain_dataset

the following command is necessary for training japanese-gpt2-xsmall from scratch.

python -m corpus.jp_wiki.split_to_small_files

If so, please update the usage.

Thank you and it has been fixed.