Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Heaplax
/
ARMAP-RM-LoRA
like
0
Reinforcement Learning
Transformers
Inference Endpoints
arxiv:
2502.12130
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
29c609c
ARMAP-RM-LoRA
/
RM-sciworld
/
checkpoint-120
/
reward_head
Heaplax
Upload folder using huggingface_hub
29c609c
verified
19 days ago
download
Copy download link
history
11.3 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.