pkalkman commited on
Commit
6f8d39b
·
verified ·
1 Parent(s): d5ff06a

Initial commit

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ replay.mp4 filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: stable-baselines3
3
+ tags:
4
+ - CarRacing-v2
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - stable-baselines3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: reinforcement-learning
14
+ dataset:
15
+ name: CarRacing-v2
16
+ type: CarRacing-v2
17
+ metrics:
18
+ - type: mean_reward
19
+ value: 336.10 +/- 284.97
20
+ name: mean_reward
21
+ verified: false
22
+ ---
23
+
24
+ # **PPO** Agent playing **CarRacing-v2**
25
+ This is a trained model of a **PPO** agent playing **CarRacing-v2**
26
+ using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
+
28
+ ## Usage (with Stable-baselines3)
29
+ TODO: Add your code
30
+
31
+
32
+ ```python
33
+ from stable_baselines3 import ...
34
+ from huggingface_sb3 import load_from_hub
35
+
36
+ ...
37
+ ```
config.json ADDED
The diff for this file is too large to render. See raw diff
 
ppo-CarRacing-v2.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2b6ae4e06a70ff227ea5278eeb663dacbc27e9d39c2d21f8d599c4885bbe7f4
3
+ size 26329444
ppo-CarRacing-v2/_stable_baselines3_version ADDED
@@ -0,0 +1 @@
 
 
1
+ 2.3.2
ppo-CarRacing-v2/data ADDED
The diff for this file is too large to render. See raw diff
 
ppo-CarRacing-v2/policy.optimizer.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6ef1cc02a60575e4678196107c02fdf36015bcaa5901858bf698cc67f004c03d
3
+ size 17415216
ppo-CarRacing-v2/policy.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a344b6bfe95deb32aba1ecd6e7082da58958ae8f21e490655b75bd04ea47dd1
3
+ size 8709502
ppo-CarRacing-v2/pytorch_variables.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d030ad8db708280fcae77d87e973102039acd23a11bdecc3db8eb6c0ac940ee1
3
+ size 431
ppo-CarRacing-v2/system_info.txt ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - OS: macOS-15.0.1-arm64-arm-64bit Darwin Kernel Version 24.0.0: Tue Sep 24 23:35:10 PDT 2024; root:xnu-11215.1.12~1/RELEASE_ARM64_T6031
2
+ - Python: 3.8.10
3
+ - Stable-Baselines3: 2.3.2
4
+ - PyTorch: 2.0.1
5
+ - GPU Enabled: False
6
+ - Numpy: 1.24.4
7
+ - Cloudpickle: 3.1.0
8
+ - Gymnasium: 0.28.1
replay.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4ed7e56cf9053bf1fae4257ec497f14cec0707abafcc71410684c6219935dcb
3
+ size 1719532
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"mean_reward": 336.1005782, "std_reward": 284.9663417423615, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-10-25T14:28:47.424274"}