a
fredbarre
AI & ML interests
None yet
Recent Activity
reacted
to
csabakecskemeti's
post
with ๐
2 days ago
I've run the open llm leaderboard evaluations + hellaswag on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.
If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results
Am I made some mistake, or (at least this distilled version) not as good/better than the competition?
I'll run the same on the Qwen 7B distilled version too.
liked
a model
about 2 months ago
showlab/ShowUI-2B
liked
a model
3 months ago
Alfitaria/Q25-1.5B-VeoLu
Organizations
None yet
models
None public yet
datasets
None public yet