HugeFighter
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -22,10 +22,12 @@ model-index:
|
|
22 |
|
23 |
# **Q-Learning** Agent playing1 **FrozenLake-v2**
|
24 |
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v2**. The **goal** of the agent is to navigate from starting state to the goal state by walking only on Frozen tiles and avoid holes.
|
|
|
25 |
The environment support two different sizes: `4x4` and `8x8`grids
|
26 |
The environment has two modes:
|
27 |
- `is_slippery=True` agent may not move in the desired direction due to the slippery nature of the tails.
|
28 |
- `is_slippery=False` agent moves in the desired direction due to the non-slippery nature of the tails.
|
|
|
29 |
How to create the FrozenLake environment?
|
30 |
- ```gym.make("FrozenLake-v1", desc=None, map_name="4x4", is_slippery=True, render_mode="rgb_array")```
|
31 |
- `desc=None` specify non-predetermined maps, `map_name` will be used. You can also specify custom map like ```desc = ["SFFH", "HFHF", "FFHF", "HFFG"]```. If both are None, random `8x8´ map with `80%` of locations frozen will be generated.
|
|
|
22 |
|
23 |
# **Q-Learning** Agent playing1 **FrozenLake-v2**
|
24 |
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v2**. The **goal** of the agent is to navigate from starting state to the goal state by walking only on Frozen tiles and avoid holes.
|
25 |
+
|
26 |
The environment support two different sizes: `4x4` and `8x8`grids
|
27 |
The environment has two modes:
|
28 |
- `is_slippery=True` agent may not move in the desired direction due to the slippery nature of the tails.
|
29 |
- `is_slippery=False` agent moves in the desired direction due to the non-slippery nature of the tails.
|
30 |
+
|
31 |
How to create the FrozenLake environment?
|
32 |
- ```gym.make("FrozenLake-v1", desc=None, map_name="4x4", is_slippery=True, render_mode="rgb_array")```
|
33 |
- `desc=None` specify non-predetermined maps, `map_name` will be used. You can also specify custom map like ```desc = ["SFFH", "HFHF", "FFHF", "HFFG"]```. If both are None, random `8x8´ map with `80%` of locations frozen will be generated.
|