Modified Action Value Method Applied to ‘n’-Armed Bandit Problems using Reinforcement Learning

Citation:

Unnikrishnan PC, Vijayakumar P. Modified Action Value Method Applied to ‘n’-Armed Bandit Problems using Reinforcement Learning. International Journal of Engineering Science and Technology. 2012;4(12):4711-4716.