Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,42 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
library_name: transformers
|
4 |
+
base_model: Qwen/Qwen2-VL-7B-Instruct
|
5 |
+
pipeline_tag: image-text-to-text
|
6 |
+
---
|
7 |
+
|
8 |
+
# OS-Atlas: A Foundation Action Model For Generalist GUI Agents
|
9 |
+
|
10 |
+
<div align="center">
|
11 |
+
|
12 |
+
[\[🏠Homepage\]](https://osatlas.github.io) [\[💻Code\]](https://github.com/OS-Copilot/OS-Atlas) [\[🚀Quick Start\]](#quick-start) [\[📝Paper\]](https://arxiv.org/abs/2410.23218) [\[🤗Models\]](https://huggingface.co/collections/OS-Copilot/os-atlas-67246e44003a1dfcc5d0d045) [\[🤗ScreenSpot-v2\]](https://huggingface.co/datasets/OS-Copilot/ScreenSpot-v2)
|
13 |
+
|
14 |
+
</div>
|
15 |
+
|
16 |
+
## Overview
|
17 |
+
![os-atlas](https://github.com/user-attachments/assets/cf2ee020-5e15-4087-9a7e-75cc43662494)
|
18 |
+
|
19 |
+
OS-Atlas provides a series of models specifically designed for GUI agents.
|
20 |
+
|
21 |
+
For GUI grounding tasks, you can use:
|
22 |
+
- [OS-Atlas-Base-7B](https://huggingface.co/OS-Copilot/OS-Atlas-Base-7B)
|
23 |
+
- [OS-Atlas-Base-4B](https://huggingface.co/OS-Copilot/OS-Atlas-Base-4B)
|
24 |
+
|
25 |
+
For generating single-step actions in GUI agent tasks, you can use:
|
26 |
+
- [OS-Atlas-Action-7B](https://huggingface.co/OS-Copilot/OS-Atlas-Action-7B)
|
27 |
+
|
28 |
+
|
29 |
+
## OS-Atlas-Action-7B
|
30 |
+
|
31 |
+
|
32 |
+
|
33 |
+
## Citation
|
34 |
+
If you find this repository helpful, feel free to cite our paper:
|
35 |
+
```bibtex
|
36 |
+
@article{wu2024atlas,
|
37 |
+
title={OS-ATLAS: A Foundation Action Model for Generalist GUI Agents},
|
38 |
+
author={Wu, Zhiyong and Wu, Zhenyu and Xu, Fangzhi and Wang, Yian and Sun, Qiushi and Jia, Chengyou and Cheng, Kanzhi and Ding, Zichen and Chen, Liheng and Liang, Paul Pu and others},
|
39 |
+
journal={arXiv preprint arXiv:2410.23218},
|
40 |
+
year={2024}
|
41 |
+
}
|
42 |
+
```
|