Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

This is a quantization of Yi-VL-34B and of the visual transformer.

You currently need to apply this PR to make it work: https://github.com/ggerganov/llama.cpp/pull/5093 - this adds the additional normalization steps into the projection

Yi-Vl-34B is prone to hallucinations, to me it appears like a rushed release. Something did not go right in training. However, while 6B was the 2nd worst llava-model I've tested, the 34B did show some strengths.

Downloads last month
253
GGUF
Model size
34.4B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .