Decentralized and Partially Decentralized Multi-Agent Reinforcement Learning

Tilak, Omkar Jayant

Decentralized and Partially Decentralized Multi-Agent Reinforcement Learning

Files

Omkar_Tilak_ProQuestApprovedThesis.pdf (2.39 MB)

Date

2013-08-22

Authors

Tilak, Omkar Jayant

Language

American English

Committee Chair

Mukhopadhyay, Snehasis

Committee Members

Si, Luo
Neville, Jennifer
Raje, Rajeev
Tuceryan, Mihran
Gorman, William J.

Degree

Ph.D.

Degree Year

2012

Department

Department of Computer and Information Science

Grantor

Purdue University

Abstract

Multi-agent systems consist of multiple agents that interact and coordinate with each other to work towards to certain goal. Multi-agent systems naturally arise in a variety of domains such as robotics, telecommunications, and economics. The dynamic and complex nature of these systems entails the agents to learn the optimal solutions on their own instead of following a pre-programmed strategy. Reinforcement learning provides a framework in which agents learn optimal behavior based on the response obtained from the environment. In this thesis, we propose various novel de- centralized, learning automaton based algorithms which can be employed by a group of interacting learning automata. We propose a completely decentralized version of the estimator algorithm. As compared to the completely centralized versions proposed before, this completely decentralized version proves to be a great improvement in terms of space complexity and convergence speed. The decentralized learning algorithm was applied; for the first time; to the domains of distributed object tracking and distributed watershed management. The results obtained by these experiments show the usefulness of the decentralized estimator algorithms to solve complex optimization problems. Taking inspiration from the completely decentralized learning algorithm, we propose the novel concept of partial decentralization. The partial decentralization bridges the gap between the completely decentralized and completely centralized algorithms and thus forms a comprehensive and continuous spectrum of multi-agent algorithms for the learning automata. To demonstrate the applicability of the partial decentralization, we employ a partially decentralized team of learning automata to control multi-agent Markov chains. More flexibility, expressiveness and flavor can be added to the partially decentralized framework by allowing different decentralized modules to engage in different types of games. We propose the novel framework of heterogeneous games of learning automata which allows the learning automata to engage in disparate games under the same formalism. We propose an algorithm to control the dynamic zero-sum games using heterogeneous games of learning automata.

Description

Indiana University-Purdue University Indianapolis (IUPUI)

Keywords

Learning Automata, MARL, Decentralized Learning

LC Subjects

Computer systems -- Analysis, Markov processes, Computer games, Fuzzy algorithms, Computer simulation, Swarm intelligence -- Analysis, Machine theory, Computational complexity, Intelligent agents (Computer software)

Rights

Permanent Link

https://hdl.handle.net/1805/3462
http://dx.doi.org/10.7912/C2/2305

Collections

Computer & Information Science Department Theses and Dissertations

Full item page