To confirm 2 questions made me confused in the paper.
Young1993 opened this issue · comments
shimin commented
Hi,
I think this is a very interesting work and I have two questions want to check:
- For Schema-Based Prompt DST w/ Independent Decoding, during the inference stage, does the model predict the domain, slot, and value in parallel or in order? I'm confused about whether the T5 needs to predict 8 times for 8 domains in each sample. Then, it predicts the slots in each domain. Finally, T5 has to predict the slot value many many times? How to get all this in one go?
- The case with desc impresses me a lot, where the model can modify the 4:45 PM to 16:45, that's quite amazing, how the model makes it?
Looking forward to your reply.
Best
Chia-Hsuan Lee commented
Hi, thanks for the interest!
-
You are right about independent decoding. Let's say there are 8 domains and each domain has 6 slots. Then for a single turn, the model needs to run predictions 8 * 6 = 48 times. This is, of course, totally parallel (no order is required)
-
In this example, the model is not just copying from the inputs. Our hypothesis is that the model can leverage descriptions and knows that this is a time-related value to predict.
shimin commented
OK, THX!