To get the dataset run in this order:
1_get_pages.py
to get all the links (or already included in the repo)
2_get_dataset.py
to extract the recipes from the links
3_generate_txt_dataset.py
or 3.1_generate_csv_dataset.py
to generate a single txt/csv file from all the extracted recipes
If you want you can play around with the model in Google colab from this notebook
This project is for demonstration purposes only.