Token level enrichment
mmcauliffe opened this issue · comments
Michael McAuliffe commented
It should be possible to import token properties via a CSV. The workflow imagined is:
- Generate query matching specific conditions (i.e., all sibilants)
- Export CSV with information for further processing
- Run a script that adds column information or creates a new CSV with relevant idenfiers
- Upload new CSV back to ISCAN and enrich tokens with new properties
For this to be possible, the following steps have to be done:
- Token ID information in query needs to be exportable
- Token enrichment UI needs to be created
- Token enrichment functionality has to be added to the Enrichment model
In addition, new functionality has to be written in PolyglotDB (see MontrealCorpusTools/PolyglotDB#131).
msonderegger commented
whoa cool, steps are now possible..