Llama-3.2-1B-Vision(development process continues)

A vision-enhanced version of the Llama-3.3-70B language model, capable of understanding and describing images while maintaining the base model's language capabilities.

Model Details

Base Model: Llama-3.3-70B
Model Type: Vision-Language Model
Last Updated: December ?, 2024
Model Architecture: Llama architecture with SigLIP vision encoder

Downloads last month: 42

Safetensors

Model size

70.6B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The HF Inference API does not support model that require custom code execution.

Model tree for kadirnar/Llama3.3-70b-Vision

Base model

meta-llama/Llama-3.1-70B

Finetuned

meta-llama/Llama-3.3-70B-Instruct

Finetuned

(136)

this model

Collection including kadirnar/Llama3.3-70b-Vision

Llama3 Vision

Collection

3 items • Updated Dec 7, 2024