GPT

This repository is a simple and clean GPT implementation in Tensorflow.

Dependencies

The model trains by default on the OpenWebText dataset. Use --model_dir=<model_dir> to provide the model directory name.

python train.py --model_dir=<model_dir>

Some other options:

Use --model_dir=<model_dir> and --context=<context> to provide the model directory name and context.

python generate.py --model_dir=<model_dir> --context=<context>

Adjust hyperparameters on the config.py file.

Run tensorboard --logdir ./.

Implementation notes:

MIT

Implementation of Generative Pre-trained Transformer Model in Tensorflow

MIT License

Language:Python 100.0%