Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
💬 Discussion thread: Model contamination techniques 💬
pinned
33
#472 opened 5 months ago
by
clefourrier
Future feature: system prompt and chat support
pinned
21
#459 opened 5 months ago
by
clefourrier
💬 Discussion thread: Model scores and model performances 💬
pinned
70
#265 opened 8 months ago
by
clefourrier
💎 Resources and community initiatives around the Leaderboard! 💎
pinned#174 opened 9 months ago
by
clefourrier
Models that used Nectar dataset
#749 opened about 2 hours ago
by
Stark2008
apply-ruff-black
3
#748 opened 1 day ago
by
alozowski
Feature Request: Multilingual Evaluations 🌐
#745 opened 4 days ago
by
eliot-christon
Understanding raw result data files
3
#729 opened 15 days ago
by
jerome-white
TRI-ML/mamba-7b-rw failed
8
#704 opened 25 days ago
by
devingulliver
GPTQ and Mixtral models will need to be relaunched
6
#692 opened 29 days ago
by
CombinHorizon
ALL Jamba models failing
15
#690 opened 30 days ago
by
devingulliver
No good way to identify number of activated parameters causes MIxtral evaluation failures
25
#680 opened about 1 month ago
by
0-hero
Crowd-Source Hardware for the LeaderBoard?
4
#570 opened 4 months ago
by
ibivibiv
Eval models for data contamination?
2
#561 opened 4 months ago
by
liyucheng
Feature request: Run 100B + models automatically
12
#434 opened 5 months ago
by
ChuckMcSneed
Feature Request for Leaderboard: date added to hub
2
#425 opened 6 months ago
by
madmaxbr5
Feature request: Using weights hash to identify duplicates
1
#422 opened 6 months ago
by
mrfakename
Feature request: Add non AutoModelForCausalLM models
3
#391 opened 6 months ago
by
KnutJaegersberg
Tool: Adding evaluation results to model cards
46
#370 opened 6 months ago
by
Weyaxi
Feature suggestion: average of selected (rather than all) columns
4
#368 opened 6 months ago
by
Minus0
Tool: Open LLM Leaderboard Model Renamer
31
#310 opened 7 months ago
by
Weyaxi
Checking for toxicity too
9
#53 opened 12 months ago
by
ronald-d-rogers