Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1078

Models disappearing from eval queue?

#805

by ArkaAbacus - opened Jun 27, 2024

Discussion

ArkaAbacus

Jun 27, 2024

Hello,

We added Smaug-Llama-3-70B-Instruct and Smaug-Qwen2-72B-Instruct to the new LLM leaderboard eval queue yesterday, but it seems they have disappeared today and also not yet turned up on the leaderboard.

Any idea what might have happened? Should we resubmit?

clefourrier

Open LLM Leaderboard org Jun 28, 2024

Hi!
I think you could check our FAQ :)
TLDR: Either we have a problem with the display atm, or they ran but failed. You'll get this info by looking for their request and result files.

nlpguy

Jun 28, 2024

•

edited Jun 28, 2024

@clefourrier From what I've seen the leaderboard does not update until restarted. Whether that is intentional or not, restarts from time to time would be nice until there is a better solution.

clefourrier

Open LLM Leaderboard org Jun 28, 2024

Interesting!
We actually have a new system with webhooks, where the leaderboard should be updated max 10 min after a change on our datasets (redownloaded with every change) - I'll take a look again at this

ArkaAbacus changed discussion status to closed Jun 28, 2024

ArkaAbacus changed discussion status to open Jun 28, 2024

ArkaAbacus

Jun 28, 2024

Thanks for the pointer. I found the status of Smaug-Llama:

{
"model": "abacusai/Smaug-Llama-3-70B-Instruct",
"base_model": "",
"revision": "8f558d6211b9d8f1712b80df40c5b65bea0b56ea",
"precision": "bfloat16",
"params": 70.554,
"architectures": "LlamaForCausalLM",
"weight_type": "Original",
"status": "FAILED",
"submitted_time": "2024-06-26T16:29:36Z",
"model_type": "\ud83d\udd36 : \ud83d\udd36 fine-tuned on domain-specific datasets",
"job_id": "7215733",
"job_start_time": "2024-06-27T00:10:42.571625",
"use_chat_template": true
}

It's not clear what caused the FAILURE - we know the model files are non-corrupt as it worked fine on the old leaderboard. In any case, I've resubmitted for now.

clefourrier

Open LLM Leaderboard org Jun 28, 2024

Hi! Please do not try to resubmit models which failed!
Instead, give us the link to the request file so we can investigate and relaunch if necessary!

ArkaAbacus

Jun 28, 2024

Ah - my apologies, I've already resubmitted. The requests file was originally here: https://huggingface.co/datasets/open-llm-leaderboard/requests/blob/main/abacusai/Smaug-Llama-3-70B-Instruct_eval_request_False_bfloat16_Original.json

although it has now been updated since I have resubmitted.

clefourrier

Open LLM Leaderboard org Jun 28, 2024

•

edited Jun 28, 2024

It should no longer be possible to resubmit a model which was already submitted, so thanks for raising the issue, at least this has been fixed.

Re-Smaug, it got preempted - normally the job should have been rescheduled but apparently was not, tagging @SaylorTwift - note that since it's PENDING again, it will be relaunched soon, when there is enough space on the cluster

clefourrier changed discussion status to closed Jun 28, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment