Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published 30 days ago • 15
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published 30 days ago • 17
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models Paper • 2410.17637 • Published 30 days ago • 34
Scalable Ranked Preference Optimization for Text-to-Image Generation Paper • 2410.18013 • Published 30 days ago • 14
DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes Paper • 2410.18084 • Published 30 days ago • 12
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published about 1 month ago • 24
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Paper • 2410.13924 • Published Oct 17 • 6
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts Paper • 2410.18071 • Published 30 days ago • 6
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20 • 10
DinoBloom: A Foundation Model for Generalizable Cell Embeddings in Hematology Paper • 2404.05022 • Published Apr 7 • 2
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Paper • 2410.05262 • Published Oct 7 • 9
TLDR: Token-Level Detective Reward Model for Large Vision Language Models Paper • 2410.04734 • Published Oct 7 • 16