Thank you very much for your code.
When I am training the dpvraft model, I will report an error when calculating the drop_loss. The dimension of the drop_conf is (2,8192), but the dimension of the drop_conf_gt is (1,16384). How can I solve this problem?