HuggingFaceTB
/

SmolVLM-Instruct

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

mfarre HF staff commited on Nov 26, 2024

Commit

75a694b

·

verified ·

1 Parent(s): 78b5928

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -160,7 +160,8 @@ We release the SmolVLM checkpoints under the Apache 2.0 license.
 ### Training Data
-The training data comes from [The Cauldron](https://huggingface.co/datasets/HuggingFaceM4/the_cauldron) and [Docmatix](https://huggingface.co/datasets/HuggingFaceM4/Docmatix) datasets, with emphasis on document understanding (25%) and image captioning (18%), while maintaining balanced coverage across other crucial capabilities like visual reasoning, chart comprehension, and general instruction following.<img src="https://huggingface.co/HuggingFaceTB/SmolVLM-Instruct/resolve/main/mixture_the_cauldron.png" alt="Example Image" style="width:70%;" />

 ### Training Data
+The training data comes from [The Cauldron](https://huggingface.co/datasets/HuggingFaceM4/the_cauldron) and [Docmatix](https://huggingface.co/datasets/HuggingFaceM4/Docmatix) datasets, with emphasis on document understanding (25%) and image captioning (18%), while maintaining balanced coverage across other crucial capabilities like visual reasoning, chart comprehension, and general instruction following.
+<img src="https://huggingface.co/HuggingFaceTB/SmolVLM-Instruct/resolve/main/mixture_the_cauldron.png" alt="Example Image" style="width:90%;" />