Spaces:

MJ-Bench
/

README

Running

App Files Files Community

Zhaorun commited on 22 days ago

Commit

b1b188a

verified ·

1 Parent(s): 7947d8b

Update README.md

Browse files

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -7,7 +7,16 @@ sdk: static
 pinned: false
 ---
-# 👩‍⚖️ [**MJ-Bench**: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?](https://mj-bench.github.io/)
 Project page: https://mj-bench.github.io/
 Code repository: https://github.com/MJ-Bench/MJ-Bench
@@ -16,7 +25,7 @@ While text-to-image models like DALLE-3 and Stable Diffusion are rapidly prolife
 To address this issue, we introduce MJ-Bench, a novel benchmark which incorporates a comprehensive preference dataset to evaluate multimodal judges in providing feedback for image generation models across four key perspectives: **alignment**, **safety**, **image quality**, and **bias**.
-![Dataset Overview](https://raw.githubusercontent.com/MJ-Bench/MJ-Bench.github.io/main/static/images/dataset_overview.png)
 Specifically, we evaluate a large variety of multimodal judges including
@@ -24,7 +33,7 @@ Specifically, we evaluate a large variety of multimodal judges including
 - 11 open-source VLMs (e.g. LLaVA family)
 - 4 and close-source VLMs (e.g. GPT-4o, Claude 3)
 -
-![Radar Plot](https://raw.githubusercontent.com/MJ-Bench/MJ-Bench.github.io/main/static/images/radar_plot.png)
 🔥🔥We are actively updating the [leaderboard](https://mj-bench.github.io/) and you are welcome to submit the evaluation result of your multimodal judge on [our dataset](https://huggingface.co/datasets/MJ-Bench/MJ-Bench) to [huggingface leaderboard](https://huggingface.co/spaces/MJ-Bench/MJ-Bench-Leaderboard).

 pinned: false
 ---
+# MJ-Bench Team: Align
+## 😎 [**MJ-Video**: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation](https://aiming-lab.github.io/MJ-VIDEO.github.io/)
+We release MJ-Bench-Video, a comprehensive fine-grained video preference benchmark, and MJ-Video, a powerful MoE-based multi-dimensional video reward model!
+## 👩‍⚖️ [**MJ-Bench**: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?](https://mj-bench.github.io/)
 Project page: https://mj-bench.github.io/
 Code repository: https://github.com/MJ-Bench/MJ-Bench
 To address this issue, we introduce MJ-Bench, a novel benchmark which incorporates a comprehensive preference dataset to evaluate multimodal judges in providing feedback for image generation models across four key perspectives: **alignment**, **safety**, **image quality**, and **bias**.
+<!-- ![Dataset Overview](https://raw.githubusercontent.com/MJ-Bench/MJ-Bench.github.io/main/static/images/dataset_overview.png) -->
 Specifically, we evaluate a large variety of multimodal judges including
 - 11 open-source VLMs (e.g. LLaVA family)
 - 4 and close-source VLMs (e.g. GPT-4o, Claude 3)
 -
+<!-- ![Radar Plot](https://raw.githubusercontent.com/MJ-Bench/MJ-Bench.github.io/main/static/images/radar_plot.png) -->
 🔥🔥We are actively updating the [leaderboard](https://mj-bench.github.io/) and you are welcome to submit the evaluation result of your multimodal judge on [our dataset](https://huggingface.co/datasets/MJ-Bench/MJ-Bench) to [huggingface leaderboard](https://huggingface.co/spaces/MJ-Bench/MJ-Bench-Leaderboard).