Jonathan Smith's picture

7 5

Jonathan Smith

blueDragon23

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Mitigating Object Hallucination via Concentric Causal Attention

upvoted a paper 2 days ago

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs

upvoted a paper 2 days ago

Improve Vision Language Model Chain-of-thought Reasoning

View all activity

Organizations

None yet

blueDragon23's activity

upvoted 7 papers 2 days ago

Mitigating Object Hallucination via Concentric Causal Attention

Paper • 2410.15926 • Published Oct 21 • 16

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs

Paper • 2410.16267 • Published Oct 21 • 17

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21 • 22

Shiksha: A Technical Domain focused Translation Dataset and Model for Indian Languages

Paper • 2412.09025 • Published 4 days ago • 4

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation

Paper • 2412.09349 • Published 3 days ago • 5

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

Paper • 2412.08972 • Published 4 days ago • 8

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published 3 days ago • 19