README.md · lmarena-ai/llama-3.2-sft-vision-arena at dc6c5417cec4dec17b9eb2ccea372f03926b2946

metadata

license: llama3.2

Llama-3.2-SFT-Vision-Arena Model Card

Model Details

Llama-3.2-SFT-Vision-Arena is a chat assistant trained by fine-tuning Llama-3.2-11B-Vision on user-shared conversations collected from Chatbot Arena.

Developed by: LMArena
Model type: An auto-regressive vision language model based on the transformer architecture
License: Llama 3.2 Community License Agreement
Finetuned from model: Llama-3.2-11B-Vision

Model Sources

Repository: https://github.com/lm-sys/FastChat
Paper: https://arxiv.org/abs/2412.08687

Uses

The primary use of Llama-3.2-SFT-Vision-Arena is research on vision language models and chatbots. The primary intended users of the model are researchers and hobbyists in natural language processing, machine learning, and artificial intelligence.

BibTex

@misc{chou2024visionarena,
      title={VisionArena: 230K Real World User-VLM Conversations with Preference Labels}, 
      author={Christopher Chou and Lisa Dunlap and Koki Mashita and Krishna Mandal and Trevor Darrell and Ion Stoica and Joseph E. Gonzalez and Wei-Lin Chiang},
      year={2024},
      eprint={2412.08687},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2412.08687}, 
}