Post
1732
π€ transformers pipelines now support vision language models for easy local inference π«°π»
h/t @yonigozlan for shipping this π©π
you can also use inference API to infer hosted vision LMs (via Python, JS and cURL) https://huggingface.co/docs/api-inference/en/tasks/image-text-to-text
h/t @yonigozlan for shipping this π©π
you can also use inference API to infer hosted vision LMs (via Python, JS and cURL) https://huggingface.co/docs/api-inference/en/tasks/image-text-to-text