User Stories
ngawangtrinley opened this issue · comments
User Stories
# | As a... | I want to... | so that... |
---|---|---|---|
1. | researcher | segment a collection of Tibetan texts | I can do statistics in AntConc |
2. | tibetan text proofreader | mark potential errors | I can catch and correct more mistakes |
3. | corpus researcher for amdo dialects | create several custom profiles | I can do statistics on different spoken dialects |
4. | corpus researcher on literary Tibetan | create a custom profile for the kangyur | I can do accurate statistics on the kangyur and tengyur |
5. | |||
Rule based segmentation steps (for story 3 & 4)
- Segment a volume with the default profile
- Create a word list from the volume, ordered by frequency
- Manually cleanup the wordlist
- Use the wordlist as the main list
- Segment the volume again
- Edit the custom profile (word /remove /adjustments) till the segmentation is good
- Merge custom profile with main profile
- Repeat with a second volume