A newer version of the Gradio SDK is available:
5.14.0
metadata
title: Evalica
emoji: π
colorFrom: green
colorTo: purple
sdk: gradio
python_version: 3.11
sdk_version: 5.12.0
app_file: app.py
pinned: true
license: apache-2.0
Evalica
Evalica is an easy-to-use tool transforms pairwise comparisons (aka side-by-side) to a meaningful ranking of items.
- Ustalov, D. Reliable, Reproducible, and Really Fast Leaderboards with Evalica. 2025. Proceedings of the 31st International Conference on Computational Linguistics: System Demonstrations. 46β53. arXiv: 2412.11314 [cs.CL].
Chatbot Arena dataset chatbot_arena_20240814.csv
was derived from the clean_battle_20240814_public.json dataset available from https://lmarena.ai/.