MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation Paper β’ 2407.00468 β’ Published Jun 29 β’ 34