File size: 1,931 Bytes
cf76bf9 cf3f65d cf76bf9 d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf3f65d d456738 cf76bf9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 |
---
tags:
- FrozenLake-v1-4x4-no_slippery
- q-learning
- reinforcement-learning
- custom-implementation
model-index:
- name: q-FrozenLake-v1-4x4-noSlippery
results:
- task:
type: reinforcement-learning
name: reinforcement-learning
dataset:
name: FrozenLake-v1-4x4-no_slippery
type: FrozenLake-v1-4x4-no_slippery
metrics:
- type: mean_reward
value: 1.00 +/- 0.00
name: mean_reward
verified: false
---
# **Q-Learning** Agent playing1 **FrozenLake-v1**
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v1** .
## Usage
{'env_id': 'FrozenLake-v1',
'max_steps': 99,
'n_training_episodes': 10000,
'n_eval_episodes': 100,
'eval_seed': [],
'learning_rate': 0.7,
'gamma': 0.95,
'max_epsilon': 1.0,
'min_epsilon': 0.05,
'decay_rate': 0.0005,
'qtable': array([[
0.73509189, 0.77378094, 0.77378094, 0.73509189],
[0.73509189, 0. , 0.81450625, 0.77378094],
[0.77378094, 0.857375 , 0.77378094, 0.81450625],
[0.81450625, 0. , 0.77378094, 0.77378094],
[0.77378094, 0.81450625, 0. , 0.73509189],
[0. , 0. , 0. , 0. ],
[0. , 0.9025 , 0. , 0.81450625],
[0. , 0. , 0. , 0. ],
[0.81450625, 0. , 0.857375 , 0.77378094],
[0.81450625, 0.9025 , 0.9025 , 0. ],
[0.857375 , 0.95 , 0. , 0.857375 ],
[0. , 0. , 0. , 0. ],
[0. , 0. , 0. , 0. ],
[0. , 0.9025 , 0.95 , 0.857375 ],
[0.9025 , 0.95 , 1. , 0.9025 ],
[0. , 0. , 0. , 0. ]])}
|