Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

Code for evaluating new models?

#620

by YannDubs - opened Mar 5, 2024

Mar 5, 2024

Hi, is the exact script used to run the open_lm_leaderboard open-sourced? I only found vague commands that suggest using lm_eval.

Thanks!

Open LLM Leaderboard org Mar 5, 2024

Hi!
You can find the precise steps to reproduce our evaluations in the About tab - the Open LLM Leaderboard uses the harness for evaluation.

clefourrier changed discussion status to closed Mar 5, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment