ARMAP-RM-LoRA / README.md
Heaplax's picture
Update README.md
7474960 verified
---
license: apache-2.0
pipeline_tag: reinforcement-learning
library_name: transformers
---
# ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling And Planning
[Project Page](https://armap-agent.github.io) | [Paper](https://arxiv.org/abs/2502.12130) | [Model Weights](https://huggingface.co/Heaplax/ARMAP-RM-LoRA)
This repository contains the reward model for the paper [Scaling Autonomous Agents via Automatic Reward Modeling And Planning](https://arxiv.org/abs/2502.12130). This reward model automatically learns a reward function from environment interactions.
Code: https://github.com/heaplax/ARMAP
## Citation
If you use this work or find it helpful, please consider citing:
```
@misc{chen2025scalingautonomousagentsautomatic,
title={Scaling Autonomous Agents via Automatic Reward Modeling And Planning},
author={Zhenfang Chen and Delin Chen and Rui Sun and Wenjun Liu and Chuang Gan},
year={2025},
eprint={2502.12130},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2502.12130},
}
```