iandanforth / smaction

Softmax Action Detection Visualization

Home Page:https://iandanforth.github.io/smaction/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Softmax Action Detection Visualization

Visualization Demo

Description of Softmax Action Selection

The impact of temperature (tau) in the softmax equation on the probability of an action being selected may not be immediately obvious.

This visualization is a simple way to see that impact.

Things to try

  • Set temperature = 1
    • Set the value of 'a' near the value of 'b'. Notice how small changes in value in this regime have large impacts.
    • Set temperature to 1000 and try again.
  • Try to fully recover the equiprobable action selection policy.

About

Softmax Action Detection Visualization

https://iandanforth.github.io/smaction/


Languages

Language:JavaScript 94.7%Language:CSS 3.7%Language:HTML 1.6%