JudgeLM: Fine-tuned Large Language Models are Scalable Judges Paper โข 2310.17631 โข Published Oct 26, 2023 โข 34
Running on CPU Upgrade 12.7k 12.7k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots