how to reproduce the scores in paper?

Question

how to reproduce the scores in paper?

ivy94419 opened this issue 6 years ago · comments

I have train a model and the parameters are set similar with yours:
20 epochs
~80,000 train set
~4000 val set
but I only got 19.3 bleu-4 in epoch 8, but the paper has achieved 24.3 bleu-4

When continue training, the scores decrease as follows:
Epoch bleu4
-------------
15 18.28
16 17.71
17 17.79
18 17.72
19 17.43
20 17.35

I am a beginner in deep learning, and how should I adjust parameters or other things to get higher scores?

Mukesh Mithrakumar · Answer 1 · Thu Jul 19 2018 08:42:47 GMT+0800 (China Standard Time)

Are you using the same code from the notebook? I ask since most couldn't even get it to run

ivy94419 · Answer 2 · Thu Jul 19 2018 09:22:02 GMT+0800 (China Standard Time)

@mukeshmithrakumar Yes I run the same code, and I used Python 3.6, Tensorflow 1.4, it can run on both Windows10 and ubuntu, I only change some trivial code for adaptation.

Mukesh Mithrakumar · Answer 3 · Fri Jul 20 2018 00:55:59 GMT+0800 (China Standard Time)

Hi @ivy94419 I will test the code and will get back to you

Rijul Dhir · Answer 4 · Fri Aug 17 2018 21:14:49 GMT+0800 (China Standard Time)

Hi @ivy94419 @mukeshmithrakumar I am able to train the model (python 2.7 tensorflow = 1 4)but could not evaluate them.
Any help regarding it would be helpful

Mukesh Mithrakumar · Answer 5 · Fri Aug 17 2018 21:18:58 GMT+0800 (China Standard Time)

Hi @rijuldhir, I wanted this to be a benchmark and it turned out to be a lot of hustle to get it to predict something so moved on to a different model. Will let you know if I come back to this but I highly doubt anytime soon. You will have better luck with asking ivy

ivy94419 · Answer 6 · Fri Aug 17 2018 21:21:56 GMT+0800 (China Standard Time)

@rijuldhir what errors have you faced during evaluating?

Rijul Dhir · Answer 7 · Fri Aug 17 2018 23:06:03 GMT+0800 (China Standard Time)

When i run evaluate_model.ipynb
I get the following error
InvalidArgumentError (see above for traceback): Assign requires shapes of both tensors to match. lhs shape= [1500] rhs shape= [1024]
[[Node: save/Assign_7 = Assign[T=DT_FLOAT, _class=["loc:@initial_lstm/b_c"], use_locking=true, validate_shape=true, _device="/job:localhost/replica:0/task:0/device:GPU:0"](initial_lstm/b_c, save/RestoreV2_7/_29)]]
[[Node: save/RestoreV2_19/_14 = _SendT=DT_FLOAT, client_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device_incarnation=1, tensor_name="edge_62_save/RestoreV2_19", _device="/job:localhost/replica:0/task:0/device:CPU:0"]]

@ivy94419 In case you know it shapes are not equal?

ivy94419 · Answer 8 · Sat Aug 18 2018 20:49:10 GMT+0800 (China Standard Time)

@rijuldhir Sorry for late reply, try to change this to 1024

Rijul Dhir · Answer 9 · Sat Aug 18 2018 23:13:05 GMT+0800 (China Standard Time)

Thanks for the reply @ivy94419 .
I actually got it while checking the code.
Did you got the BLEU scores as given in the paper as I got the max of BLEU-4 19.2 only?
Do I need to make any changes to the code?

ivy94419 · Answer 10 · Sun Aug 19 2018 09:38:39 GMT+0800 (China Standard Time)

@rijuldhir I only achieved 19.3 BLEU-4 as mentioned above ...

Rijul Dhir · Answer 11 · Sun Sep 09 2018 15:08:06 GMT+0800 (China Standard Time)

@ivy94419 I have been trying to change vgg network from vgg19 to vgg16 in the code but I am getting some errors.
Any chance you know what's the problem?

gorgeousyouth · Answer 12 · Tue Nov 27 2018 17:07:17 GMT+0800 (China Standard Time)

do you get the same score in the paper？thanks