question regarding bilstm-tagger.py

Question

question regarding bilstm-tagger.py

mehdimashayekhi opened this issue 6 years ago · comments

Hi, Thanks for sharing the source code. I learned a lot. But, I have a quick question regarding "Reinforce score", my question is why the Reinforce score is "Score*reward" as calculated here :

line 126 of bilstm-tagger.py

#then calculate the reinforce scores using reinforce
    reinforce_scores = [r_s*score for r_s, score in zip(rewards_over_baseline, scores)]```

Mehdi Mashayekhi · Answer 1 · Mon Jul 09 2018 07:00:56 GMT+0800 (China Standard Time)

Never mind I just understood the logic/math, thanks though