arxiv:2412.10345
Jianwei Yang
jw2yang
AI & ML interests
Computer Vision, Vision and Language, Machine Learning
Recent Activity
authored
a paper
4 days ago
TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for
Generalist Robotic Policies
authored
a paper
6 days ago
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary
Embedding Distillation
authored
a paper
16 days ago
Florence-VL: Enhancing Vision-Language Models with Generative Vision
Encoder and Depth-Breadth Fusion
Organizations
Papers
17
models
None public yet
datasets
None public yet