How to handle short sentences/contexts

Question

raman-r-4978 opened this issue 4 years ago · comments

The default model performs well for SQUAD_v1.1 dataset (where context length is ~700 chars)
But It fails to perform, when I try to index my custom data which has small paragraph/contexts (length ~100-150 characters).
- The problem is, irrespective of the query, the same result (wrong) is being returned as the output
- Most the time, the result is just single random character like ? . (end of the context)
- I have debugged into this and realized that the problem stays in start vectors which we generate from model output

Ques:

Setting:
All the results are obtained using the commands mentioned in README.

Jinhyuk Lee · Answer 1 · Fri Jan 31 2020 15:55:29 GMT+0800 (China Standard Time)

Reference: #2