metadata
license: llama3.2
Llama-3.2-SFT-Vision-Arena Model Card
Model Details
Llama-3.2-SFT-Vision-Arena is a chat assistant trained by fine-tuning Llama-3.2-11B-Vision on user-shared conversations collected from Chatbot Arena.
- Developed by: LMArena
- Model type: An auto-regressive vision language model based on the transformer architecture
- License: Llama 3.2 Community License Agreement
- Finetuned from model: Llama-3.2-11B-Vision
Model Sources
- Repository: https://github.com/lm-sys/FastChat
- Paper: https://arxiv.org/abs/2412.08687
Uses
The primary use of Llama-3.2-SFT-Vision-Arena is research on vision language models and chatbots. The primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.
BibTex
@misc{chou2024visionarena,
title={VisionArena: 230K Real World User-VLM Conversations with Preference Labels},
author={Christopher Chou and Lisa Dunlap and Koki Mashita and Krishna Mandal and Trevor Darrell and Ion Stoica and Joseph E. Gonzalez and Wei-Lin Chiang},
year={2024},
eprint={2412.08687},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2412.08687},
}