11 38 159

dinhanhx

dinhanhx

AI & ML interests

Vision Language

Recent Activity

liked a model 1 day ago

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

liked a model 8 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

liked a Space 9 days ago

artificialguybr/Surya-OCR

View all activity

Organizations

dinhanhx's activity

upvoted a paper 29 days ago

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 6

upvoted an article about 1 month ago

Article

Vision Language Models Explained

Apr 11, 2024

• 294

upvoted 3 articles about 2 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 165

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 70

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 825

upvoted a paper 3 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

upvoted 4 collections 5 months ago

upvoted an article 5 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 201

upvoted 2 collections 6 months ago

VisionLM

Collection

855 items • Updated about 6 hours ago • 47

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11, 2024 • 80

upvoted a paper 6 months ago

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding

Paper • 2407.12594 • Published Jul 17, 2024 • 19

upvoted an article 6 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 185