CS-747 Intelligent and Learning Agents
Assignments
Assignment 1
- Multi-arm Bandits problem using Thompson Sampling
Assignment 2
- Markov Decision Process Planning
- Value Iteration
- Policy Iteration
Assignment 3
- SARSA Lambda
- Trace accumulation
- Trace replacement