Decoder output

Question

Decoder output

ankitnit opened this issue 6 years ago · comments

I am getting same output for all the batches

Atul Kumar · Answer 1 · Wed Aug 01 2018 00:22:53 GMT+0800 (China Standard Time)

You mean you are getting similar output for example in a single batch. During decode the batch size is same as beam size and all similar output in s single batch is as expected.
You can compare with the output which I got here

ankitnit · Answer 2 · Wed Aug 01 2018 00:50:24 GMT+0800 (China Standard Time)

No, I have trained the model with 410000 iter with is_coverage=true,
and i am getting same summary for each batch when i am running decode.py

Atul Kumar · Answer 3 · Wed Aug 01 2018 02:25:43 GMT+0800 (China Standard Time)

You might not want to train initially with is_coverage=true, that will make training unstable. You can try is_coverage=false and train and then compare the result.

Can you send me some example output, I want to see if it is a random string?

ankitnit · Answer 4 · Thu Aug 02 2018 13:46:35 GMT+0800 (China Standard Time)

i tried with is_coverage=false and run 175000 iter and also i am comparing the result , it generate same summary for every batch

previous result with is_coverage=True https://drive.google.com/open?id=1vjMsFpoQxdSQCukLxnRNK_ghjJD3ENnG

Yifan Song · Answer 5 · Mon Nov 25 2019 10:57:33 GMT+0800 (China Standard Time)

Just check your input. The batcher doesn't expect binary file input, but string. If your inputs are binary files, just decode it to string.

rainsher · Answer 6 · Wed Dec 11 2019 15:41:00 GMT+0800 (China Standard Time)

have you solved the problem? @ankitnit

rainsher · Answer 7 · Fri Dec 13 2019 17:01:40 GMT+0800 (China Standard Time)

As @DominicSong said, I solved my problem by convert the artcile and abstract from bytes to string in about line 217, batcher.py . I run in python3.
article = str(article,encoding='utf-8')
abstract = str(abstract,encoding='utf-8')
abstract_sentences = [sent.strip() for sent in data.abstract2sents(abstract)]

EmmittXu · Answer 8 · Tue Feb 18 2020 11:29:27 GMT+0800 (China Standard Time)

You mean you are getting similar output for example in a single batch. During decode the batch size is same as beam size and all similar output in s single batch is as expected.
You can compare with the output which I got here

Hi, I don't understand why beam size and batch size are equal in current decode. As I play with it and set them equal the code works fine, otherwise it throws dimension mismatch error. I believe these two are independent and there might be a better decode implementation?

Atul Kumar · Answer 9 · Tue Feb 18 2020 12:53:47 GMT+0800 (China Standard Time)

This is done to take advantage of GPU batch processing. If the beam size is B you need to run B rnns in parallel. Using a batch for beam makes decoding code cleaner and computationally efficient on GPU. If you want to increase beam size increase the batch size.