Spaces:

HuggingFaceH4
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

Resources

View closed (726)

💬 Discussion thread: Model contamination techniques 💬

#472 opened 5 months ago by

Future feature: system prompt and chat support

#459 opened 5 months ago by

💬 Discussion thread: Model scores and model performances 💬

#265 opened 8 months ago by

💎 Resources and community initiatives around the Leaderboard! 💎

#174 opened 9 months ago by

Models that used Nectar dataset

#749 opened about 2 hours ago by

apply-ruff-black

#748 opened 1 day ago by

Feature Request: Multilingual Evaluations 🌐

#745 opened 4 days ago by

Understanding raw result data files

#729 opened 15 days ago by

TRI-ML/mamba-7b-rw failed

#704 opened 25 days ago by

GPTQ and Mixtral models will need to be relaunched

#692 opened 29 days ago by

ALL Jamba models failing

#690 opened 30 days ago by

No good way to identify number of activated parameters causes MIxtral evaluation failures

#680 opened about 1 month ago by

Crowd-Source Hardware for the LeaderBoard?

#570 opened 4 months ago by

Eval models for data contamination?

#561 opened 4 months ago by

Feature request: Run 100B + models automatically

#434 opened 5 months ago by

Feature Request for Leaderboard: date added to hub

#425 opened 6 months ago by

Feature request: Using weights hash to identify duplicates

#422 opened 6 months ago by

Feature request: Add non AutoModelForCausalLM models

#391 opened 6 months ago by

KnutJaegersberg

Tool: Adding evaluation results to model cards

#370 opened 6 months ago by

Feature suggestion: average of selected (rather than all) columns

#368 opened 6 months ago by

Tool: Open LLM Leaderboard Model Renamer

#310 opened 7 months ago by

Checking for toxicity too

#53 opened 12 months ago by

ronald-d-rogers