wilfreddesert / headline_generation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Paper: Advances of Transformer-Based Models for News Headline Generation

Dependencies

Note that there are Pipfile and Pipfile.lock in src.

As long as you have pipenv installed, you can just run pipenv install and set the environment up.

After that, you can just run pipenv shell.

RuBERT

Download RuBERT from DeepPavlov: http://docs.deeppavlov.ai/en/master/features/models/bert.html and extract the archive to src

Checkpoint

BertSumAbs checkpoint: https://yadi.sk/d/2jcjmdEXp0EX-Q

If you want to use the model, download the checkpoint using the link above and put this file in the rubert_cased_L-12_H-768_A-12_pt folder.

Config

You can easily change any parameters and hyperparameters by modifying config.yaml

1. Data Preprocessing

Run the following command to convert your data to the required format. Pay attention to the samples in the repo and the config file.

python preprocess_input.py --config_path ./config.yaml

Once your data is ready, you can either train your model (again, check the config file to set everything up and then go to step 2) or test your model (go directly to step 3).

2. Model Training

python run.py --mode train --config_path ./config.yaml  

3. Predicting

python run.py --mode test --config_path ./config.yaml  

About


Languages

Language:Python 98.7%Language:Jupyter Notebook 1.3%