Visual Question Answering
Transformers
Safetensors
English
vlm
text-generation
image-captioning
Inference Endpoints
File size: 71 Bytes
676ad2e
 
 
 
 
1
2
3
4
5
6
{
  "<image>": 32002,
  "<|im_end|>": 32001,
  "<|im_start|>": 32000
}