2 8 48

Tan Minh Tran

minhtt32

tanminhtran168

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 3 days ago

erax-ai/EraX-VL-7B-V1.5

liked a Space 11 days ago

Qwen/QVQ-72B-preview

upvoted a paper 12 days ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

View all activity

Organizations

minhtt32's activity

liked a model 3 days ago

erax-ai/EraX-VL-7B-V1.5

Visual Question Answering • Updated 5 days ago • 738 • 3

liked a Space 11 days ago

Running

427

🌍

QVQ 72B Preview

upvoted 2 papers 12 days ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 18 days ago • 23

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 17 days ago • 334

upvoted a paper 24 days ago

Agent-as-a-Judge: Evaluate Agents with Agents

Paper • 2410.10934 • Published Oct 14, 2024 • 18

upvoted a paper 25 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 30 days ago • 123

liked a Space about 1 month ago

Running

582

🌖

Qwen2-VL-72B

liked a dataset about 1 month ago

5CD-AI/LLaVA-CoT-o1-Instruct

Viewer • Updated Nov 27, 2024 • 58.5k • 533 • 64

liked a dataset about 2 months ago

5CD-AI/Viet-Chart-VQA

Viewer • Updated Nov 17, 2024 • 45.3k • 31 • 4

upvoted a paper about 2 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 112

liked 3 models about 2 months ago

upvoted a paper about 2 months ago

DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation

Paper • 2411.04999 • Published Nov 7, 2024 • 17

liked 2 datasets 2 months ago

5CD-AI/Viet-Table-Markdown

Viewer • Updated Nov 17, 2024 • 64.9k • 77 • 10

hewei2001/ReachQA

Viewer • Updated Oct 28, 2024 • 22k • 61 • 4

liked 2 models 2 months ago

arcee-ai/Arcee-VyLinh

Text Generation • Updated Oct 30, 2024 • 41.7k • 27

5CD-AI/Vintern-3B-beta

Image-Text-to-Text • Updated 30 days ago • 985 • 31

liked a Space 3 months ago

Running on Zero

🌍

Chat With Janus 1.3B

A unified multimodal understanding and generation model.

liked a model 3 months ago

deepseek-ai/Janus-1.3B

Any-to-Any • Updated Nov 14, 2024 • 8.65k • 491