metadata

license: apache-2.0
pipeline_tag: reinforcement-learning
library_name: transformers

ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Project Page | Paper | Model Weights

This repository contains the reward model for the paper Scaling Autonomous Agents via Automatic Reward Modeling And Planning. This reward model automatically learns a reward function from environment interactions.

Code: https://github.com/heaplax/ARMAP

Citation

If you use this work or find it helpful, please consider citing:

@misc{chen2025scalingautonomousagentsautomatic,
      title={Scaling Autonomous Agents via Automatic Reward Modeling And Planning}, 
      author={Zhenfang Chen and Delin Chen and Rui Sun and Wenjun Liu and Chuang Gan},
      year={2025},
      eprint={2502.12130},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2502.12130}, 
}