Spaces:

optimum
/

llm-perf-leaderboard

Running

Thanks for merging @baptistecolle , for a little more context, I just upstreamed one popular quantization flavor in torchao (int4_weight_only) for now to get some initial perf data for now, in the future I also plan to upstream another one called "autoquant" (https://github.com/pytorch/ao/blob/main/torchao/quantization/README.md#autoquantization) that will be able to automatically search through all available quantization flavors in torchao and get the best performing model on the specific hardware, under some accuracy constraint (sqnr).

jerryzh168

Dec 12, 2024

also how is the leaderboard updated? is it documented somewhere?

baptistecolle

Hugging Face Optimum org Dec 13, 2024

I just made the repo for the backend of the leaderboard public (again)
https://github.com/huggingface/llm-perf-backend

The documentation is lacking for now as it is more an internal tool to manage the leaderboard

jerryzh168

Dec 21, 2024

@baptistecolle thanks! when will we be able to see the update to the dashboard itself? Just trying to make sure the changes are reflected in the dashboard

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment