rakshithShetty / captionGAN

Source code for the paper "Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question about your data file

Noahsark opened this issue · comments

Dear Rakshith,

May I ask the exact format of your file "resnet150_2048-mean.npy"?

  1. Does it have to be in same order with the items in your "labels.txt" file and "fasterRcnn_clasDetFEat80.npy"?

  2. By running the scripts under https://github.com/akirafukui/vqa-mcb/tree/master/preprocess , it extract feature in 2048x14x14 dims. Did you then calculate the mean value of each 14x14 block? Is this the content in file "resnet150_2048-mean.npy" ?

Thank you so much.
Best Regards,
Li

Additionally, may I ask how to pre-train the generator before the adversarial training in your code?

Dear Rakshith,

May I ask the exact format of your file "resnet150_2048-mean.npy"?

  1. Does it have to be in same order with the items in your "labels.txt" file and "fasterRcnn_clasDetFEat80.npy"?
  2. By running the scripts under https://github.com/akirafukui/vqa-mcb/tree/master/preprocess , it extract feature in 2048x14x14 dims. Did you then calculate the mean value of each 14x14 block? Is this the content in file "resnet150_2048-mean.npy" ?

Thank you so much.
Best Regards,
Li

Hello,I don't make it to exact the file resnet150_2048-mean.npy,could you please upload the file if it's still exists.Thank you a lot and look forward for your reply.