Accelerating Experience Replay for Deep Q-Networks with Reduced Target Computation

Zigon, Bob; Song, Fengguang

Accelerating Experience Replay for Deep Q-Networks with Reduced Target Computation

Files

Zigon2023Accelerating-CCBY.pdf (1 MB)

Date

2023

Authors

Zigon, Bob

Song, Fengguang

Language

American English

Department

Computer and Information Science, School of Science

Found At

CS & IT

Abstract

Mnih’s seminal deep reinforcement learning paper that applied a Deep Q-network to Atari video games demonstrated the importance of a replay buffer and a target network. Though the pair were required for convergence, the use of the replay buffer came at a significant computational cost. With each new sample generated by the system, the targets in the mini batch buffer were continually recomputed. We propose an alternative that eliminates the target recomputation called TAO-DQN (Target Accelerated Optimization-DQN). Our approach focuses on a new replay buffer algorithm that lowers the computational burden. We implemented this new approach on three experiments involving environments from the OpenAI gym. This resulted in convergence to better policies in fewer episodes and less time. Furthermore, we offer a mathematical justification for our improved convergence rate.

Keywords

DQN, Experience Replay, Replay Buffer, Target Network

Cite As

Ziggon, B., Song, F., & Coulter, B. (2023). Accelerating Experience Replay for Deep Q-Networks with Reduced Target Computation. CS & IT Conference Proceedings, 13(1), Article 1. https://doi.org/10.5121/csit.2023.130101

Journal

CS & IT Conference Proceedings

Rights

Attribution 4.0 International

Source

Publisher

Type

Article

Permanent Link

https://hdl.handle.net/1805/36952

DOI

https://doi.org/10.5121/csit.2023.130101

Version

Final published version

Collections

Open Access Policy Articles
Department of Computer and Information Science Works

Full item page