Xuefeng Hu's picture

7

Xuefeng Hu

bytehxf

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

upvoted a paper 5 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

upvoted a paper 16 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

View all activity

Organizations

bytehxf's activity

upvoted 2 papers 5 days ago

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

Paper • 2502.17422 • Published 13 days ago • 7

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 6 days ago • 65

upvoted a paper 16 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 17 days ago • 128

upvoted an article 16 days ago

Article

SigLIP 2: A better multilingual vision language encoder

17 days ago

• 126

upvoted a collection 3 months ago

Cambrian Data

3 items • Updated Jun 25, 2024 • 10

authored a paper 6 months ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 48

upvoted a paper 6 months ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 48

upvoted a paper 8 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 40