Applications of Reinforcement Learning, A Modified Action Value Method Applied to n-Armed Bandit Problem

Citation:

Unnikrishnan PC, Ramakrishna, Naga Venkata G. Applications of Reinforcement Learning, A Modified Action Value Method Applied to n-Armed Bandit Problem. In: SETAC. Ibri College of Technology, Sultanate of Oman; 2012.