stanfordnlp
/

llama8b-nnetnav-wa

PyTorch

llama

Model card Files Files and versions Community

smurty commited on Jan 29

Commit

76d38ec

verified ·

1 Parent(s): 77fe6c5

Update README.md

Browse files

Files changed (1) hide show

README.md +33 -17

README.md CHANGED Viewed

@@ -16,8 +16,6 @@ LLama8b-NNetNav-WA is a [LLama-3.1-8B](https://huggingface.co/meta-llama/Llama-3
 Most details about this model along with details can be found in our paper: [NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild](https://arxiv.org/abs/2410.02907).
-![show an example trajectory from NNetNav-WA](TODO)
 ##  Table of Contents
 - [Model Card for Llama8b-NNetNav-WA](#model-card-for--model_id-)
@@ -40,37 +38,57 @@ Most details about this model along with details can be found in our paper: [NNe
 - [Model Card Contact](#model-card-contact)
 - [How to Get Started with the Model](#how-to-get-started-with-the-model)
 ## Model Details
-### Model Description
 <!-- Provide a longer summary of what this model is/does. -->
-## Uses
 ## Bias, Risks, and Limitations
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
 ## How to Get Started with the Model
-```python
-```
 ## Training Details
 ### Training Data
-<!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-This model was trained on the [NNetnav-WA](https://huggingface.co/datasets/stanfordnlp/nnetnav-wa) dataset. It can be used directly with the open-instruct library.
 ### Training Procedure
@@ -110,10 +128,8 @@ This model was fine-tuned with [Open-Instruct](https://github.com/allenai/open-i
 ## Model Card Authors [optional]
 <!-- This section provides another layer of transparency and accountability. Whose views is this model card representing? How many voices were included in its construction? Etc. -->
 Shikhar Murty
 ## Model Card Contact
 smurty@cs.stanford.edu
-shikhar.murty@gmail.com

 Most details about this model along with details can be found in our paper: [NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild](https://arxiv.org/abs/2410.02907).
 ##  Table of Contents
 - [Model Card for Llama8b-NNetNav-WA](#model-card-for--model_id-)
 - [Model Card Contact](#model-card-contact)
 - [How to Get Started with the Model](#how-to-get-started-with-the-model)
 ## Model Details
+This model is intended to be used as a **web-agent** i.e. given an instruction such as "Upvote the post by user smurty123 on subreddit r/LocalLLaMA", and a web-url "reddit.com", the model can perform the task by executing a sequence of actions.
+### Action Space
 <!-- Provide a longer summary of what this model is/does. -->
+The action space of the model is as follows:
+```plaintext
+Page Operation Actions:
+`click [id]`: This action clicks on an element with a specific id on the webpage.
+`type [id] [content] [press_enter_after=0|1]`: Use this to type the content into the field with id. By default, the "Enter" key is pressed after typing unless press_enter_after is set to 0.
+`hover [id]`: Hover over an element with id.
+`press [key_comb]`:  Simulates the pressing of a key combination on the keyboard (e.g., Ctrl+v).
+`scroll [down|up]`: Scroll the page up or down.
+Tab Management Actions:
+`new_tab`: Open a new, empty browser tab.
+`tab_focus [tab_index]`: Switch the browser's focus to a specific tab using its index.
+`close_tab`: Close the currently active tab.
+URL Navigation Actions:
+`goto [url]`: Navigate to a specific URL.
+`go_back`: Navigate to the previously viewed page.
+`go_forward`: Navigate to the next page (if a previous 'go_back' action was performed).
+Completion Action:
+`stop [answer]`: Issue this action when you believe the task is complete. If the objective is to find a text-based answer, provide the answer in the bracket. If you believe the task is impossible to complete, provide the answer as "N/A" in the bracket.
+```
+## Results on Benchmarks
+This model gets the following results on WebArena and WebVoyager:
+| Model                  | WebArena (SR) | WebVoyager (SR) |
+|------------------------|--------------:|---------------:|
+| **GPT-4**             | **14.1**      | **33.5**      |
+| **llama8b-nnetnav-wa** | **16.3**      | **28.1**      |
 ## Bias, Risks, and Limitations
 <!-- This section is meant to convey both technical and sociotechnical limitations. -->
+TODO
 ## How to Get Started with the Model
+TODO
 ## Training Details
 ### Training Data
+This model was trained on the [NNetnav-WA](https://huggingface.co/datasets/stanfordnlp/nnetnav-wa) dataset, which is comprised of synthetic demonstrations entirely from self-hosted websites.
 ### Training Procedure
 ## Model Card Authors [optional]
 <!-- This section provides another layer of transparency and accountability. Whose views is this model card representing? How many voices were included in its construction? Etc. -->
 Shikhar Murty
 ## Model Card Contact
 smurty@cs.stanford.edu