HugeFighter
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -20,15 +20,15 @@ model-index:
|
|
20 |
verified: false
|
21 |
---
|
22 |
|
23 |
-
# **Q-Learning** Agent
|
24 |
-
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v2**. The **goal** of the agent is to navigate from starting state to the goal state by walking only on Frozen tiles and avoid holes.
|
25 |
|
26 |
-
The environment support two different sizes: `4x4` and `8x8`grids
|
27 |
-
The environment has two modes:
|
28 |
- `is_slippery=True` agent may not move in the desired direction due to the slippery nature of the tails.
|
29 |
- `is_slippery=False` agent moves in the desired direction due to the non-slippery nature of the tails.
|
30 |
|
31 |
-
How to create the FrozenLake environment?
|
32 |
- ```gym.make("FrozenLake-v1", desc=None, map_name="4x4", is_slippery=True, render_mode="rgb_array")```
|
33 |
- `desc=None` specify non-predetermined maps, `map_name` will be used. You can also specify custom map like ```desc = ["SFFH", "HFHF", "FFHF", "HFFG"]```. If both are None, random `8x8´ map with `80%` of locations frozen will be generated.
|
34 |
|
|
|
20 |
verified: false
|
21 |
---
|
22 |
|
23 |
+
# **Q-Learning** Agent playing **FrozenLake-v2**
|
24 |
+
- This is a trained model of a **Q-Learning** agent playing **FrozenLake-v2**. The **goal** of the agent is to navigate from starting state to the goal state by walking only on Frozen tiles and avoid holes.
|
25 |
|
26 |
+
- The environment support two different sizes: `4x4` and `8x8`grids
|
27 |
+
- The environment has two modes:
|
28 |
- `is_slippery=True` agent may not move in the desired direction due to the slippery nature of the tails.
|
29 |
- `is_slippery=False` agent moves in the desired direction due to the non-slippery nature of the tails.
|
30 |
|
31 |
+
- How to create the FrozenLake environment?
|
32 |
- ```gym.make("FrozenLake-v1", desc=None, map_name="4x4", is_slippery=True, render_mode="rgb_array")```
|
33 |
- `desc=None` specify non-predetermined maps, `map_name` will be used. You can also specify custom map like ```desc = ["SFFH", "HFHF", "FFHF", "HFFG"]```. If both are None, random `8x8´ map with `80%` of locations frozen will be generated.
|
34 |
|