Please use this identifier to cite or link to this item:
https://hdl.handle.net/1959.11/18663
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Abed-Alguni, Bilal | en |
dc.contributor.author | Paul, David | en |
dc.contributor.author | Chalup, Stephan | en |
dc.contributor.author | Henskens, Frans | en |
dc.date.accessioned | 2016-02-25T17:16:00Z | - |
dc.date.issued | 2016 | - |
dc.identifier.citation | International Journal of Artificial Intelligence, 14(1), p. 71-93 | en |
dc.identifier.issn | 0974-0635 | en |
dc.identifier.uri | https://hdl.handle.net/1959.11/18663 | - |
dc.description.abstract | Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value sharing strategies between reinforcement learners to accelerate the learning process. This paper presents a comparison study of the performance of these cooperative algorithms as well as an algorithm that aggregates their results. In addition, this paper studies the effects of the frequency of Q-value sharing on the learning speed of the independent learners that share their Q-values among each other. The algorithms are compared using the taxi problem (multi-task problem) and different instances of the shortest path problem (single-task problem). The experimental results when learners have equal levels of experience suggest that sharing of Q-values is not beneficial and produces similar results to single agent Q-learning. However, the experimental results when learners have different levels of experience suggest that most of the cooperative Q-learning algorithms perform similarly, but better than single agent Q-learning, especially when Q-value sharing is highly frequent. This paper then places Q-value sharing in the context of modern reinforcement learning techniques and suggests some future directions for research. | en |
dc.language | en | en |
dc.publisher | Centre for Environment, Social and Economic Research Publications | en |
dc.relation.ispartof | International Journal of Artificial Intelligence | en |
dc.title | A Comparison Study of Cooperative Q-learning Algorithms for Independent Learners | en |
dc.type | Journal Article | en |
dc.subject.keywords | Artificial Intelligence and Image Processing | en |
local.contributor.firstname | Bilal | en |
local.contributor.firstname | David | en |
local.contributor.firstname | Stephan | en |
local.contributor.firstname | Frans | en |
local.subject.for2008 | 080199 Artificial Intelligence and Image Processing not elsewhere classified | en |
local.subject.seo2008 | 970108 Expanding Knowledge in the Information and Computing Sciences | en |
local.profile.school | School of Science and Technology | en |
local.profile.email | Bilal.Abedalguni@uon.edu.au | en |
local.profile.email | dpaul4@une.edu.au | en |
local.profile.email | Stephan.Chalup@newcastle.edu.au | en |
local.profile.email | Frans.Henskens@newcastle.edu.au | en |
local.output.category | C1 | en |
local.record.place | au | en |
local.record.institution | University of New England | en |
local.identifier.epublicationsrecord | une-20160225-094319 | en |
local.publisher.place | India | en |
local.format.startpage | 71 | en |
local.format.endpage | 93 | en |
local.peerreviewed | Yes | en |
local.identifier.volume | 14 | en |
local.identifier.issue | 1 | en |
local.contributor.lastname | Abed-Alguni | en |
local.contributor.lastname | Paul | en |
local.contributor.lastname | Chalup | en |
local.contributor.lastname | Henskens | en |
dc.identifier.staff | une-id:dpaul4 | en |
local.profile.orcid | 0000-0002-2428-5667 | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.profile.role | author | en |
local.identifier.unepublicationid | une:18867 | en |
dc.identifier.academiclevel | Academic | en |
dc.identifier.academiclevel | Academic | en |
local.title.maintitle | A Comparison Study of Cooperative Q-learning Algorithms for Independent Learners | en |
local.output.categorydescription | C1 Refereed Article in a Scholarly Journal | en |
local.relation.url | http://www.ceser.in/ceserp/index.php/ijai/article/view/42533 | en |
local.search.author | Abed-Alguni, Bilal | en |
local.search.author | Paul, David | en |
local.search.author | Chalup, Stephan | en |
local.search.author | Henskens, Frans | en |
local.uneassociation | Unknown | en |
local.year.published | 2016 | en |
local.subject.for2020 | 460202 Autonomous agents and multiagent systems | en |
local.subject.seo2020 | 220403 Artificial intelligence | en |
local.codeupdate.date | 2021-11-01T11:09:17.204 | en |
local.codeupdate.eperson | dpaul4@une.edu.au | en |
local.codeupdate.finalised | true | en |
local.original.for2020 | undefined | en |
local.original.seo2020 | 280115 Expanding knowledge in the information and computing sciences | en |
Appears in Collections: | Journal Article |
Files in This Item:
File | Description | Size | Format |
---|
Page view(s)
1,224
checked on Dec 10, 2023
Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.