Experience Replay and Data Efficiency

July 14, 2022 · 1 min · Orange · Reinforcement Learning: an Introduction | Suggest Changes

Table of Contents

Experience replay
Infer-Collect framework

To make better use of data, we can use experience replay to increase data efficiency.

Experience replay

We can put ${s,a,s’,r}$ pairs in the buffer and update Q using mini batch methods. To decrease noise in the replay, we average over several samples. (That’s why minibatch)

Experience replay#

Infer-Collect framework#

Experience replay

Infer-Collect framework