Visual Question Answering
Transformers
Safetensors
English
videollama2_qwen2
text-generation
multimodal large language model
large video-language model
Inference Endpoints
VideoLLaMA2-72B / added_tokens.json
Guanzheng's picture
Tokenizer Upload
91876e6 verified
raw
history blame
80 Bytes
{
"<|endoftext|>": 151643,
"<|im_end|>": 151645,
"<|im_start|>": 151644
}