ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper β’ 2412.11815 β’ Published 6 days ago β’ 26
BrushEdit: All-In-One Image Inpainting and Editing Paper β’ 2412.10316 β’ Published 9 days ago β’ 33
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper β’ 2412.09645 β’ Published 12 days ago β’ 34
Byte Latent Transformer: Patches Scale Better Than Tokens Paper β’ 2412.09871 β’ Published 9 days ago β’ 68
Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models Paper β’ 2412.12606 β’ Published 5 days ago β’ 40
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Paper β’ 2412.13018 β’ Published 5 days ago β’ 39
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper β’ 2412.14171 β’ Published 4 days ago β’ 19
AnySat: An Earth Observation Model for Any Resolutions, Scales, and Modalities Paper β’ 2412.14123 β’ Published 4 days ago β’ 11