Merve Noyan's picture

Merve Noyan

merve

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

VLMs, vision & co

Recent Activity

posted an update about 3 hours ago

Google just released PaliGemma 2 Mix: new versatile instruction vision language models 🔥 > Three new models: 3B, 10B, 28B with res 224, 448 💙 > Can do vision language tasks with open-ended prompts, understand documents, and segment or detect anything 🤯 Read more https://huggingface.co/blog/paligemma2mix Try the demo https://huggingface.co/spaces/google/paligemma2-10b-mix All models are here https://huggingface.co/collections/google/paligemma-2-mix-67ac6a251aaf3ee73679dcc4

updated a dataset about 6 hours ago

huggingface/documentation-images

liked a dataset 1 day ago

Salesforce/CogAlign

View all activity

Organizations

Posts 98

Post

201

Google just released PaliGemma 2 Mix: new versatile instruction vision language models 🔥

> Three new models: 3B, 10B, 28B with res 224, 448 💙
> Can do vision language tasks with open-ended prompts, understand documents, and segment or detect anything 🤯

Read more https://huggingface.co/blog/paligemma2mix
Try the demo google/paligemma2-10b-mix
All models are here google/paligemma-2-mix-67ac6a251aaf3ee73679dcc4

Articles 21

Article

1.04k

Open-source DeepResearch – Freeing our search agents

View all Articles

Collections 44

spaces 104

Vision Papers

All paper summaries read by Merve

Running on Zero

Paligemma2 Vqav2

PaliGemma2 LoRA finetuned on VQAv2

Running on Zero

OWLSAM

State-of-the-art open-vocabulary image segmentation ⚡️

Sam2.1

SuperPoint

Identify key points in an image

Running on CPU Upgrade

Gradio Tgi

models 89

merve/colpali_ufo

Updated Dec 20, 2024 • 1

merve/paligemma_vqav2

Image-Text-to-Text • Updated Dec 18, 2024 • 215 • 13

merve/paligemma2-3b-vqav2

Updated Dec 5, 2024 • 2.72k • 6

merve/google-ckpts

Updated Oct 22, 2024

merve/google-tokenizers

Updated Oct 22, 2024

merve/idefics3-llama-vqav2

Updated Sep 11, 2024

merve/idefics3llama-vqav2

Updated Sep 11, 2024 • 8

merve/flux-dreambooth-lora

Updated Aug 16, 2024 • 1

merve/trained-flux-lora-lego

Text-to-Image • Updated Aug 16, 2024 • 8 • • 1

merve/flux-lego-lora-dreambooth

Text-to-Image • Updated Aug 16, 2024 • 372 • • 13

datasets 26

merve/model-test-inputs

Updated Oct 21, 2024 • 31

merve/vqav2-small

Viewer • Updated Aug 8, 2024 • 21.4k • 1.27k • 9

merve/SGinW

Viewer • Updated Jul 11, 2024 • 16.7k • 2.55k • 1

merve/pascal-voc

Viewer • Updated Jul 6, 2024 • 336k • 756

merve/YouCook2

Viewer • Updated May 28, 2024 • 2k • 55

merve/faiss_embeddings

Updated Jan 25, 2024 • 9

merve/pokemon-ds-embeddings

Viewer • Updated Jan 10, 2024 • 833 • 70 • 4

merve/tr-h4-norobots

Updated Jan 7, 2024 • 86 • 10

merve/lego_sets_latest

Viewer • Updated Jan 6, 2024 • 61 • 357 • 4

merve/ai-tube-dummy

Updated Dec 1, 2023 • 38