vwxyzjn commited on
Commit
473d58a
1 Parent(s): a42ad31

pushing model

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ model-index:
16
  type: Breakout-v5
17
  metrics:
18
  - type: mean_reward
19
- value: 1.50 +/- 0.92
20
  name: mean_reward
21
  verified: false
22
  ---
@@ -46,7 +46,7 @@ curl -OL https://huggingface.co/vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_a
46
  curl -OL https://huggingface.co/vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_naturecnn-seed1/raw/main/pyproject.toml
47
  curl -OL https://huggingface.co/vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_naturecnn-seed1/raw/main/poetry.lock
48
  poetry install --all-extras
49
- python cleanba_ppo_envpool_impala_atari_wrapper_naturecnn.py --total-timesteps 200000 --distributed --learner-device-ids 1 --track --save-model --upload-model --env-id Breakout-v5 --seed 1
50
  ```
51
 
52
  # Hyperparameters
@@ -66,10 +66,10 @@ python cleanba_ppo_envpool_impala_atari_wrapper_naturecnn.py --total-timesteps 2
66
  'exp_name': 'cleanba_ppo_envpool_impala_atari_wrapper_naturecnn',
67
  'gae_lambda': 0.95,
68
  'gamma': 0.99,
69
- 'global_learner_decices': ['gpu:1', 'gpu:3'],
70
  'hf_entity': '',
71
- 'learner_device_ids': [1],
72
- 'learner_devices': ['gpu:1'],
73
  'learning_rate': 0.00025,
74
  'local_batch_size': 7680,
75
  'local_minibatch_size': 1920,
 
16
  type: Breakout-v5
17
  metrics:
18
  - type: mean_reward
19
+ value: 2.30 +/- 1.90
20
  name: mean_reward
21
  verified: false
22
  ---
 
46
  curl -OL https://huggingface.co/vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_naturecnn-seed1/raw/main/pyproject.toml
47
  curl -OL https://huggingface.co/vwxyzjn/Breakout-v5-cleanba_ppo_envpool_impala_atari_wrapper_naturecnn-seed1/raw/main/poetry.lock
48
  poetry install --all-extras
49
+ python cleanba_ppo_envpool_impala_atari_wrapper_naturecnn.py --total-timesteps 200000 --distributed --track --save-model --upload-model --env-id Breakout-v5 --seed 1
50
  ```
51
 
52
  # Hyperparameters
 
66
  'exp_name': 'cleanba_ppo_envpool_impala_atari_wrapper_naturecnn',
67
  'gae_lambda': 0.95,
68
  'gamma': 0.99,
69
+ 'global_learner_decices': ['gpu:0', 'gpu:1'],
70
  'hf_entity': '',
71
+ 'learner_device_ids': [0],
72
+ 'learner_devices': ['gpu:0'],
73
  'learning_rate': 0.00025,
74
  'local_batch_size': 7680,
75
  'local_minibatch_size': 1920,
cleanba_ppo_envpool_impala_atari_wrapper_naturecnn.cleanrl_model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e11c4688da03104f3b888acdeb3d20ca2833bcd4cd23f7fa13f4d22395614f17
3
  size 6747982
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:750ba5b5633deae0dd6920a0c7f237887c2de96f04bcc577fed6be8488766d0e
3
  size 6747982
events.out.tfevents.1677098574.ip-26-0-129-85 → events.out.tfevents.1677098668.ip-26-0-130-11 RENAMED
File without changes
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
videos/Breakout-v5__cleanba_ppo_envpool_impala_atari_wrapper_naturecnn__1__12fbdd3b-4519-47b6-b084-ebe1498b733d-eval/0.mp4 DELETED
Binary file (7.54 kB)
 
videos/Breakout-v5__cleanba_ppo_envpool_impala_atari_wrapper_naturecnn__1__ff30480b-43da-4348-97fa-1d95cc60a507-eval/0.mp4 ADDED
Binary file (6.84 kB). View file