EADA, a data augment method for NLP tasks
Use the atis dataset as an example,
-
First, run the 'Atis_Dataset_Generate' to generate active_entity and active_packages in Atis folder.
-
Then, run the 'Entity-based_Tree_Atis' to generate entity-based tree for atis dataset. This tree is stored in sentence-simulator-master folder.
-
Last, use run.py in sentence-simulator-master folder, like
python run.py -f TreeSum.json -c 10 -w out/word.txt -s out/sent.txt
Upload all dataset in experiment, which are atis, conll2003 and snips.
Conll2003_Dataset_Generate.py: Generate dataset's entity,packages from Conll2003 dataset.
Atis_Dataset_Generate.py: Generate dataset's entity,packages from Conll2003 dataset.
Snip_Dataset_Generate.py: Generate dataset's entity,packages from Conll2003 dataset.
Entity-based_Tree_Conll2003.py: Generate Entity-based Tree from Conll2003 dataset.
Entity-based_Tree_Atis.py: Generate Entity-based Tree from Aits dataset.
Entity-based_Tree_Snip.py: Generate Entity-based Tree from Snip dataset.
Atis_dataset_Splite.py: generating seq.in,seq.out form dataset for atis dataset