Ali El Filali's picture

Ali El Filali

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Other interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

upvoted an article 2 days ago

Train 400x faster Static Embedding Models with Sentence Transformers

commented on an article 2 days ago

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

upvoted an article 2 days ago

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

View all activity

Articles

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

Introducing the Open Arabic LLM Leaderboard

Organizations

alielfilali01's activity

upvoted 2 articles 2 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

3 days ago

• 98

Article

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

9 days ago

• 14

upvoted a paper 3 days ago

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Paper • 2402.01781 • Published Feb 1, 2024 • 2

upvoted a paper 9 days ago

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 10 days ago • 83

upvoted a paper 15 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 16 days ago • 47

upvoted a paper 16 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 77

upvoted a collection 19 days ago

Deepseek Papers

Deepseek papers collection • 14 items • Updated 19 days ago • 9

upvoted 2 papers 19 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 22 days ago • 23

Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Paper • 2412.15255 • Published Dec 15, 2024 • 3

upvoted a paper 29 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 30 days ago • 340

upvoted a collection 30 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 10 days ago • 78

upvoted a collection about 1 month ago

Multilingual LLM Evaluation

Multilingual Evaluation Benchmarks • 6 items • Updated Dec 13, 2024 • 9

upvoted a paper about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 103

upvoted 4 collections about 1 month ago

🧪 FineWeb v1 data experiments

Ablation models trained for our data experiments. • 22 items • Updated Jun 12, 2024 • 4

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12, 2024 • 35

AraDICE

AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs • 12 items • Updated Dec 13, 2024 • 4

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 128

upvoted 2 articles about 1 month ago

Article

Rethinking Backpropagation: Thoughts on What's Wrong with Backpropagation

By

•

Dec 2, 2024

• 5

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

Dec 8, 2024

• 21

upvoted a collection about 1 month ago

🥂 FineWeb2

3 items • Updated Dec 8, 2024 • 12