a's picture

3 12

a

fredbarre

·

AI & ML interests

None yet

Recent Activity

reacted to csabakecskemeti's post with 👀 3 days ago

I've run the open llm leaderboard evaluations + hellaswag on https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall. If anyone wants to double check the results are posted here: https://github.com/csabakecskemeti/lm_eval_results Am I made some mistake, or (at least this distilled version) not as good/better than the competition? I'll run the same on the Qwen 7B distilled version too.

liked a model about 2 months ago

showlab/ShowUI-2B

liked a model 3 months ago

Alfitaria/Q25-1.5B-VeoLu

View all activity

Organizations

None yet

fredbarre's activity

reacted to csabakecskemeti's post with 👀 3 days ago

Post

2252

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

7 replies

·

liked a model about 2 months ago

showlab/ShowUI-2B

Updated about 11 hours ago • 25.3k • 227

liked 3 models 3 months ago

Alfitaria/Q25-1.5B-VeoLu

Updated Dec 9, 2024 • 8

fblgit/TheBeagle-v2beta-32B-MGS

Text Generation • Updated Oct 26, 2024 • 445 • 13

microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 1.61k • 1.54k

liked 2 models 4 months ago

Felladrin/Llama-160M-Chat-v1

Text Generation • Updated Jul 25, 2024 • 655 • 19

kenhktsui/nano-phi-192M-v0.1

Text Generation • Updated May 8, 2024 • 112 • 1

liked a model 5 months ago

google/siglip-so400m-patch14-224

Zero-Shot Image Classification • Updated Aug 23, 2024 • 9.95k • 52

liked a Space 6 months ago

FLUX.1 [Inpainting]

upvoted a paper 6 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

liked a model 6 months ago

Niansuh/Prompt-Guard-86M

Text Classification • Updated Jul 30, 2024 • 657 • 2

upvoted a collection 7 months ago

🇫🇷 Calme-2

New Calme-2 fine-tuned models • 30 items • Updated 29 days ago • 4

liked a Space 7 months ago

Running on Zero

SD3 Long Captioner

upvoted a paper 8 months ago

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 96

liked a Space 8 months ago

Running on A10G

Consistent Character

Create images of a given character in different poses

liked a model 8 months ago

fofr/consistent-character-weights

Updated Jun 5, 2024 • 4