Bucket wiki into 3 buckets - evaluation scripts
kudkudak opened this issue · comments
Stanislaw Jastrzebski commented
- Compute thresholds of wiki splitting based on percentiles on samples of wiki 1.6M
- Rescore whole wiki using factorized and prototypical
- Write script taking top100 in each bucket by greedy procedure (very efficient). Eye-ball them.