2 13 43

Juan CM

jucamohedano

AI & ML interests

Deep Learning and Robotics 🚀🤖

Recent Activity

updated a model 10 days ago

jucamohedano/paligemma_a-okvqa

View all activity

Organizations

jucamohedano's activity

updated a model 10 days ago

jucamohedano/paligemma_a-okvqa

Updated 10 days ago • 30

updated a model 2 months ago

jucamohedano/char-lstm-shakespeare

Updated Sep 22

liked a dataset 2 months ago

karpathy/tiny_shakespeare

Updated Jan 18 • 1.94k • 43

updated a model 2 months ago

jucamohedano/char-lstm-shakespeare_

Updated Sep 21

liked a model 6 months ago

microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Aug 20 • 155k • 931

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14

• 211

Reacted to merve's post with 🚀 6 months ago

Post

1755

New open Vision Language Model by @Google : PaliGemma 💙🤍

📝 Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
🧩 Combination of Gemma 2B LLM and SigLIP image encoder
🤗 Supported in transformers

PaliGemma can do..
🧩 Image segmentation and detection! 🤯
📑 Detailed document understanding and reasoning
🙋 Visual question answering, captioning and any other VLM task!

Read our blog 🔖 hf.co/blog/paligemma
Try the demo 🪀 hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection 📚 google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda