BitFlip Environment
A simple environment to flip a 01 sequence into a specific state. With the bits number increasing, the task becomes harder. Well suited for testing Hindsight Experience Replay.
DI-engine's HER on BitFlip
The table shows how many envsteps are needed at least to converge for PureDQN and HER-DQN implemented in DI-engine. '-' means no convergence in 20M envsteps.
n_bit | PureDQN | HER-DQN |
---|---|---|
15 | - | 150K |
20 | - | 1.5M |
DI-engine's HER-DQN can converge |
You can refer to the RL algorithm doc for implementation and experiment details.