|
--- |
|
license: apache-2.0 |
|
pipeline_tag: reinforcement-learning |
|
library_name: transformers |
|
--- |
|
|
|
# ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling And Planning |
|
|
|
[Project Page](https://armap-agent.github.io) | [Paper](https://arxiv.org/abs/2502.12130) | [Model Weights](https://huggingface.co/Heaplax/ARMAP-RM-LoRA) |
|
|
|
This repository contains the reward model for the paper [Scaling Autonomous Agents via Automatic Reward Modeling And Planning](https://arxiv.org/abs/2502.12130). This reward model automatically learns a reward function from environment interactions. |
|
|
|
Code: https://github.com/heaplax/ARMAP |
|
|
|
## Citation |
|
If you use this work or find it helpful, please consider citing: |
|
``` |
|
@misc{chen2025scalingautonomousagentsautomatic, |
|
title={Scaling Autonomous Agents via Automatic Reward Modeling And Planning}, |
|
author={Zhenfang Chen and Delin Chen and Rui Sun and Wenjun Liu and Chuang Gan}, |
|
year={2025}, |
|
eprint={2502.12130}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.AI}, |
|
url={https://arxiv.org/abs/2502.12130}, |
|
} |
|
``` |