Synthetically generate random text documents with ground truth!
Check here for list of all features.
Note:
This project is only for research purposes like this.
Template | Generated | Augmented |
---|---|---|
- Atleast Python 3.7
pip install -r dependencies.txt
- Check
documentation/Installation
for further instructions
python generate.py <template.json> <num_samples> <output_folder>
Check the templates/
folder for sample document templates.
python augment.py <config.json> <input_folder> <num_epochs> <output_folder> <num_workers>
Check documentation/Augmentation
for more details.
Ensure you have installed StreamLit by pip install streamlit
.
UI to generate document using desired template by filling data manually (for demo purpose)
streamlit run generator_ui.py
ToDo: (Contributions welcome)
- Add augmentation support in UI
- Create another UI for creating templates.
For any problems or queries, please report under the "Issues" tab.
Feel free to contribute by sending a Pull Request.