This is a finetuned Llama-3.2-11B-Vision-Instruct model, the dataset used is Multimodal-Mind2Web dataset.

Safetensors

Model size

10.7B params

Tensor type

F32

Inference API

Unable to determine this model's library. Check the docs .

Model tree for roywei/Llama-3.2-11B-Vision-Instruct-mind2web-finetuned

Base model

Finetuned

(70)

this model

roywei
/

Llama-3.2-11B-Vision-Instruct-mind2web-finetuned