Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
asuzuki
/
PPO-LunarLander-v2
like
0
Reinforcement Learning
Transformers
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
71e2525
PPO-LunarLander-v2
/
results.json
asuzuki
first commit - model PPO performing good
71e2525
about 2 years ago
raw
Copy download link
history
blame
165 Bytes
{
"mean_reward"
:
239.92169866976738
,
"std_reward"
:
20.791617039010347
,
"is_deterministic"
:
true
,
"n_eval_episodes"
:
10
,
"eval_datetime"
:
"2023-01-06T08:49:26.545362"
}