Please use this identifier to cite or link to this item:
https://hdl.handle.net/1959.11/18663
Title: | A Comparison Study of Cooperative Q-learning Algorithms for Independent Learners | Contributor(s): | Abed-Alguni, Bilal (author); Paul, David (author) ; Chalup, Stephan (author); Henskens, Frans (author) | Publication Date: | 2016 | Handle Link: | https://hdl.handle.net/1959.11/18663 | Abstract: | Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value sharing strategies between reinforcement learners to accelerate the learning process. This paper presents a comparison study of the performance of these cooperative algorithms as well as an algorithm that aggregates their results. In addition, this paper studies the effects of the frequency of Q-value sharing on the learning speed of the independent learners that share their Q-values among each other. The algorithms are compared using the taxi problem (multi-task problem) and different instances of the shortest path problem (single-task problem). The experimental results when learners have equal levels of experience suggest that sharing of Q-values is not beneficial and produces similar results to single agent Q-learning. However, the experimental results when learners have different levels of experience suggest that most of the cooperative Q-learning algorithms perform similarly, but better than single agent Q-learning, especially when Q-value sharing is highly frequent. This paper then places Q-value sharing in the context of modern reinforcement learning techniques and suggests some future directions for research. | Publication Type: | Journal Article | Source of Publication: | International Journal of Artificial Intelligence, 14(1), p. 71-93 | Publisher: | Centre for Environment, Social and Economic Research Publications | Place of Publication: | India | ISSN: | 0974-0635 | Fields of Research (FoR) 2008: | 080199 Artificial Intelligence and Image Processing not elsewhere classified | Fields of Research (FoR) 2020: | 460202 Autonomous agents and multiagent systems | Socio-Economic Objective (SEO) 2008: | 970108 Expanding Knowledge in the Information and Computing Sciences | Socio-Economic Objective (SEO) 2020: | 220403 Artificial intelligence | Peer Reviewed: | Yes | HERDC Category Description: | C1 Refereed Article in a Scholarly Journal | Publisher/associated links: | http://www.ceser.in/ceserp/index.php/ijai/article/view/42533 |
---|---|
Appears in Collections: | Journal Article |
Files in This Item:
File | Description | Size | Format |
---|
Page view(s)
1,224
checked on Dec 10, 2023
Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.