Dose the pre-training data also use this prompt structure related to downstream tasks？

Question

Dose the pre-training data also use this prompt structure related to downstream tasks？

Aurora-slz opened this issue 2 years ago · comments

I read the gpt2 paper, but not sure whether the pre-training data from WebText will add format information.
For example, we konw data format will be english sentence = french sentencein the translation task. So during pre-training time, will we add similar promt to the training data?

Thanks!

joan126 · Answer 1 · Wed Mar 15 2023 21:39:29 GMT+0800 (China Standard Time)

interested about this