zjowowen's picture
init space
079c32c
## BitFlip Environment
A simple environment to flip a 01 sequence into a specific state. With the bits number increasing, the task becomes harder.
Well suited for testing Hindsight Experience Replay.
## DI-engine's HER on BitFlip
The table shows how many envsteps are needed at least to converge for PureDQN and HER-DQN implemented in DI-engine. '-' means no convergence in 20M envsteps.
| n_bit | PureDQN | HER-DQN |
| ------ | ------- | ------- |
| 15 | - | 150K |
| 20 | - | 1.5M |
DI-engine's HER-DQN can converge
You can refer to the RL algorithm doc for implementation and experiment details.