Heaplax
/

ARMAP-RM-LoRA

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

ARMAP-RM-LoRA / README.md

Heaplax's picture

Update README.md

7474960 verified 12 days ago

|

history blame contribute delete

1.07 kB

	---
	license: apache-2.0
	pipeline_tag: reinforcement-learning
	library_name: transformers
	---

	# ARMAP: Scaling Autonomous Agents via Automatic Reward Modeling And Planning

	[Project Page](https://armap-agent.github.io) \| [Paper](https://arxiv.org/abs/2502.12130) \| [Model Weights](https://huggingface.co/Heaplax/ARMAP-RM-LoRA)

	This repository contains the reward model for the paper [Scaling Autonomous Agents via Automatic Reward Modeling And Planning](https://arxiv.org/abs/2502.12130). This reward model automatically learns a reward function from environment interactions.

	Code: https://github.com/heaplax/ARMAP

	## Citation
	If you use this work or find it helpful, please consider citing:
	```
	@misc{chen2025scalingautonomousagentsautomatic,
	title={Scaling Autonomous Agents via Automatic Reward Modeling And Planning},
	author={Zhenfang Chen and Delin Chen and Rui Sun and Wenjun Liu and Chuang Gan},
	year={2025},
	eprint={2502.12130},
	archivePrefix={arXiv},
	primaryClass={cs.AI},
	url={https://arxiv.org/abs/2502.12130},
	}
	```