a's picture

3 12

a

fredbarre

·

AI & ML interests

None yet

Recent Activity

reacted to csabakecskemeti's post with 👀 about 2 months ago

I've run the open llm leaderboard evaluations + hellaswag on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall. If anyone wants to double check the results are posted here: https://github.com/csabakecskemeti/lm_eval_results Am I made some mistake, or (at least this distilled version) not as good/better than the competition? I'll run the same on the Qwen 7B distilled version too.

liked a model 3 months ago

showlab/ShowUI-2B

liked a model 4 months ago

Alfitaria/Q25-1.5B-VeoLu

View all activity

Organizations

None yet

models

None public yet

datasets

None public yet