SimingSiming
commited on
Commit
•
883d985
1
Parent(s):
a5be5e8
Update README.md
Browse files
README.md
CHANGED
@@ -22,14 +22,24 @@ model-index:
|
|
22 |
# **Q-Learning** Agent playing **FrozenLake-v1**
|
23 |
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v1** .
|
24 |
|
25 |
-
## Usage
|
26 |
-
```python
|
27 |
-
model = load_from_hub(repo_id="SimingSiming/q-FrozenLake-v1-8x8-non_slippery", filename="q-learning.pkl")
|
28 |
|
29 |
-
#
|
30 |
-
|
31 |
|
32 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
|
34 |
```
|
35 |
|
|
|
22 |
# **Q-Learning** Agent playing **FrozenLake-v1**
|
23 |
This is a trained model of a **Q-Learning** agent playing **FrozenLake-v1** .
|
24 |
|
|
|
|
|
|
|
25 |
|
26 |
+
n_training_episodes = 200000 # Total training episodes
|
27 |
+
learning_rate = 0.8 # Learning rate
|
28 |
|
29 |
+
# Evaluation parameters
|
30 |
+
n_eval_episodes = 100 # Total number of test episodes
|
31 |
+
|
32 |
+
# Environment parameters
|
33 |
+
env_id = "FrozenLake-v1" # Name of the environment
|
34 |
+
max_steps = 100 # Max steps per episode
|
35 |
+
gamma = 0.99 # Discounting rate
|
36 |
+
eval_seed = [] # The evaluation seed of the environment
|
37 |
+
|
38 |
+
# Exploration parameters
|
39 |
+
epsilon = 1.0 # Exploration rate
|
40 |
+
max_epsilon = 1.0 # Exploration probability at start
|
41 |
+
min_epsilon = 0.05 # Minimum exploration probability
|
42 |
+
decay_rate = 0.00005 # Exponential decay rate for exploration prob
|
43 |
|
44 |
```
|
45 |
|