Selectively Decentralized Q-Learning

dc.contributor.authorNguyen, Thanh
dc.contributor.authorMukhopadhyay, Snehasis
dc.contributor.departmentComputer and Information Science, School of Scienceen_US
dc.date.accessioned2018-09-27T18:49:41Z
dc.date.available2018-09-27T18:49:41Z
dc.date.issued2017-10
dc.description.abstractIn this paper, we explore the capability of selectively decentralized Q-learning approach in learning how to optimally stabilize control systems, as compared to the centralized approach. We focus on problems in which the systems are completely unknown except the possible domain knowledge that allow us to decentralize into subsystems. In selective decentralization, we explore all of the possible communication policies among subsystems and use the cumulative gained Q-value as the metric to decide which decentralization scheme should be used for controlling. The results show that the selectively decentralized approach not only stabilizes the system faster but also shows superior converging speed on gained Q-value in different systems with different interconnection strength. In addition, the selectively decentralized converging time does not seem to grow exponentially with the system dimensionality. Practically, this fact implies that the selectively decentralized Q-learning could be used as an alternative approach in large-scale unknown control system, where in theory, the Hamilton-Jacobi-Bellman-equation approach is difficult to derive the close-form solution.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationNguyen, T., & Mukhopadhyay, S. (2017). Selectively decentralized Q-learning. In 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) (pp. 328–333). https://doi.org/10.1109/SMC.2017.8122624en_US
dc.identifier.urihttps://hdl.handle.net/1805/17396
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/SMC.2017.8122624en_US
dc.relation.journal2017 IEEE International Conference on Systems, Man, and Cyberneticsen_US
dc.rightsPublisher Policyen_US
dc.sourceAuthoren_US
dc.subjectselective decentralizationen_US
dc.subjectQ-learningen_US
dc.subjectcontrol systemen_US
dc.titleSelectively Decentralized Q-Learningen_US
dc.typeConference proceedingsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
nguyen-2017-selectively.pdf
Size:
572.39 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: