datamllab / rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.

Home Page:http://www.rlcard.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

In Mahjong game prediction, it appears that the order of state['current_hand'] influences the result of eval_step, what could be the reason?

jacy opened this issue · comments

commented
In Mahjong game prediction, it appears that the order of state['current_hand'] influences the result of eval_step, what could be the reason?
commented

found the root cause: in mahjong extract_state function the raw_legal_actions and legal_actions doesn't match, legal_actions is the unique list of player's hand, but raw_legal_actions is the list of player's hand