Goletty :: BROWSE

Proposal of an Exploitation-oriented Learning Method on Multiple Rewards and Penalties Environments and the Design Guideline

Journal Title	Journal of Computers
Journal Abbreviation	jcp
Publisher Group	Academy Publisher
Website	http://ojs.academypublisher.com

PDF (1,065 kb)


Title	Proposal of an Exploitation-oriented Learning Method on Multiple Rewards and Penalties Environments and the Design Guideline
Authors	Miyazaki, Kazuteru
Abstract	Among machine-learning (RL) focuses most on goal-directed learning from interaction. Despite important applications, RL is difficult to design to fit real-world problems because, first, interaction requires too many trial-and-error searches and, second, no guidelines exist on how to design reward and penalty signal values. We are interested in approaches treating reward and penalty signals independently and not assigning them values. We also want to reduce the number of trial-and-error searches by strongly enhancing successful experience — a process known as exploitationoriented learning (XoL). Though there are many XoL methods, they cannot apply to multiple rewards and penalties environments adequately. In this paper, we propose a new XoL method that can treat multiple rewards and penalties effectively. We present simulation and experimental results to show the effectiveness of our proposal. Furthermore, we describe the design guideline about rewards and penalties for the XoL methods.
Publisher	ACADEMY PUBLISHER
Date	2013-07-01
Source	Journal of Computers Vol 8, No 7 (2013): Special Issue: Advances in Internet Technologies and Applications
Rights	Copyright © ACADEMY PUBLISHER - All Rights Reserved.To request permission, please check out URL: http://www.academypublisher.com/copyrightpermission.html.