User Stories

Question

ngawangtrinley opened this issue 4 years ago · comments

#	As a...	I want to...	so that...
1.	researcher	segment a collection of Tibetan texts	I can do statistics in AntConc
2.	tibetan text proofreader	mark potential errors	I can catch and correct more mistakes
3.	corpus researcher for amdo dialects	create several custom profiles	I can do statistics on different spoken dialects
4.	corpus researcher on literary Tibetan	create a custom profile for the kangyur	I can do accurate statistics on the kangyur and tengyur
5.

Rule based segmentation steps (for story 3 & 4)

Segment a volume with the default profile
Create a word list from the volume, ordered by frequency
Manually cleanup the wordlist
Use the wordlist as the main list
Segment the volume again
Edit the custom profile (word /remove /adjustments) till the segmentation is good
Merge custom profile with main profile
Repeat with a second volume