Selective decentralization to improve reinforcement learning in unknown linear noisy systems

Nguyen, Thanh; Mukhopadhyay, Snehasis

Selective decentralization to improve reinforcement learning in unknown linear noisy systems

Files

nguyen-2017-selective.pdf (384.81 KB)

Date

2017-11

Authors

Nguyen, Thanh

Mukhopadhyay, Snehasis

Language

American English

Department

Computer and Information Science, School of Science

Found At

IEEE

Abstract

In this paper, we answer the question of to what extend selective decentralization could enhance the learning and control performance when the system is noisy and unknown. Compared to the previous works in selective decentralization, in this paper, we add the system noise as another complexity in the learning and control problem. Thus, we only perform analysis for some simple toy examples of noisy linear system. In linear system, the Halminton-Jaccobi-Bellman (HJB) equation becomes Riccati equation with closed-form solution. Our previous framework in learning and control unknown system is based on the following principle: approximating the system using identification in order to apply model-based solution. Therefore, this paper would explore the learning and control performance on two aspects: system identification error and system stabilization. Our results show that selective decentralization show better learning performance than the centralization when the noise level is low.

Keywords

selective decentralization, multi-agent systems, reinforcement learning

Cite As

Nguyen, T., & Mukhopadhyay, S. (2017). Selective decentralization to improve reinforcement learning in unknown linear noisy systems. In 2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems (IES) (pp. 77–82). https://doi.org/10.1109/IESYS.2017.8233565

Journal

2017 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems

Rights

Publisher Policy

Source

Author

Type

Article

Permanent Link

https://hdl.handle.net/1805/17394

DOI

https://doi.org/10.1109/IESYS.2017.8233565

Version

Author's manuscript

Collections

Open Access Policy Articles
Department of Computer and Information Science Works

Full item page