Why not directly generate training data for the generator using GPT4?

Question

kakaxisisan opened this issue 8 months ago · comments

Akari Asai · Answer 1 · Fri Oct 27 2023 02:02:46 GMT+0800 (China Standard Time)

We briefly discussed in Section 3.2.1 in our paper, and there are essentially two reasons:

We have four different types of reflection tokens and insert them almost at the segment level, resulting in a million orders of inferences if we run GPT-4 to insert reflection tokens at each segment independently. This is simply too expensive for us to afford, especially since our model input can get long (i.e., OpenAI API costs depend on input context length).
We are also afraid that relying on GPT-4 completely to insert special tokens may also hurt reproducibility in the future, as the model behaviors have been reported to change over time e.g., How is ChatGPT's behavior changing over time? (Chen et al., 2023).

Akari Asai · Answer 2 · Sun Oct 29 2023 06:43:05 GMT+0800 (China Standard Time)

I am closing this issue for now but feel free to reopen it!