Q-Ground: Image Quality Grounding with Large Multi-modality Models Paper • 2407.17035 • Published Jul 24 • 1
LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding Paper • 2407.15754 • Published Jul 22 • 19
Iterative Token Evaluation and Refinement for Real-World Super-Resolution Paper • 2312.05616 • Published Dec 9, 2023
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D Paper • 2311.16918 • Published Nov 28, 2023 • 9