Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

818

openbmb/UltraLM-65b 's score is worse than "meta-llama/Llama-2-70b-hf" but leaderboard say it is better

#171

by mhemetfaik - opened Aug 7, 2023

Discussion

mhemetfaik

Aug 7, 2023

openbmb/UltraLM-65b 's scores:

67.1 + 85 + 63.5 + 53.5 = 269.1

meta-llama/Llama-2-7 's scores:

67.3 + 87.3 + 69.8 + 44.9 = 269.3

Leaderboard screenshot:

clefourrier

Open LLM Leaderboard org Aug 7, 2023

Hi @mhemetfaik !
That's a super good point! We are ordering after the rounding up, I'll fix it to prevent these edge cases.

clefourrier

Open LLM Leaderboard org Aug 7, 2023

•

edited Aug 7, 2023

Hi!
I changed the rounding to 2 decimal points, which should fix most cases.
We'll still get edge cases (hopefully very rarely), as separating the displayed rounding and input numbers is not yet possible in gradio - I also opened an issue there, and I'll port their modif once it's done.

Thank you very much for raising!

clefourrier changed discussion status to closed Aug 7, 2023

mhemetfaik

Aug 7, 2023

Thanks for your interest. Like you said this will fix most cases and thank you for fixing it so quickly.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment