WANG Jiong's picture

3 8 9

WANG Jiong

wjwow

·

wangjiongw

AI & ML interests

None yet

Organizations

None yet

wjwow's activity

upvoted a paper 24 days ago

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Paper • 2410.13925 • Published 29 days ago • 21

upvoted a paper 5 months ago

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12 • 39

upvoted a paper 8 months ago

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Paper • 2403.11703 • Published Mar 18 • 16

upvoted 2 papers 10 months ago

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29 • 55

From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

Paper • 2401.15071 • Published Jan 26 • 35

upvoted a collection 11 months ago

Recent models: last 100 repos, sorted by creation date

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 505

upvoted 2 papers over 1 year ago

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

Paper • 2308.03729 • Published Aug 7, 2023 • 9

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

Paper • 2307.14620 • Published Jul 27, 2023 • 13