2 5 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

liked a Space 22 days ago

nanotron/ultrascale-playbook

liked a model 24 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

mertege's activity

liked a Space 22 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 24 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

Text Generation • Updated 2 days ago • 10.1k • 93

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 18 days ago • 1.6M • • 1.26k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.52M • • 11.3k

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

New activity in kashif/gkd_openassistant-guanaco 5 months ago

Chat template on GKD Trainer

#1 opened 5 months ago by

mertege

liked a dataset 5 months ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 229 • 5

liked a dataset 6 months ago

BAAI/Infinity-Instruct

Viewer • Updated 17 days ago • 20.4M • 5.73k • 602

liked a model 6 months ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • Updated Jun 25, 2024 • 2.12k • 55

liked a dataset 6 months ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 103 • 21

liked 3 models 6 months ago

liked a Space 7 months ago

140

Open Arabic LLM Leaderboard

🏆

Track, rank and evaluate open Arabic LLMs and chatbots

upvoted an article 7 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

liked a model 8 months ago

haoranxu/ALMA-13B-Pretrain

Text Generation • Updated Oct 5, 2024 • 1.71k • 9

liked a dataset 9 months ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 418k • 209

upvoted a paper 9 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

liked a Space 9 months ago

Magpie

🐦

Generate and rate instruction-response pairs