Modified Action Value Method Applied to ‘n’-Armed Bandit Problems using Reinforcement Learning
Citation:
Unnikrishnan PC, Vijayakumar P. Modified Action Value Method Applied to ‘n’-Armed Bandit Problems using Reinforcement Learning. International Journal of Engineering Science and Technology. 2012;4(12):4711-4716.