Ali El Filali

alielfilali01

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Other interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

updated a dataset about 13 hours ago

OALL/requests

new activity about 20 hours ago

inceptionai/jais-30b-chat-v3:VLLM is not supporting JAISLMHeadModel

new activity about 20 hours ago

inceptionai/jais-30b-chat-v3:Does not stop generation

View all activity

Articles

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

about 1 month ago

• 29

Introducing the Open Arabic LLM Leaderboard

May 14, 2024

• 77

Organizations

alielfilali01's activity

updated a dataset about 13 hours ago

OALL/requests

Updated about 9 hours ago • 9.23k

New activity in inceptionai/jais-30b-chat-v3 about 20 hours ago

VLLM is not supporting JAISLMHeadModel

#3 opened 9 months ago by

MayaBsat02

Does not stop generation

#5 opened 8 months ago by

jeril

Access to Arabic translated evaluation dataset

#8 opened 7 months ago by

xaviermuller

Google Colab pro+ TPU and JAIS 30B

#10 opened 6 months ago by

HanaRasheed

Request: DOI

#12 opened 2 days ago by

HanaRasheed

I need a help to overcome Cuda out of memory

#13 opened 2 days ago by

HanaRasheed

Which Word Embedding to be used along Jais LLM model?

#9 opened 7 months ago by

mhaseeb1604

upvoted a paper about 20 hours ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 25 days ago • 72

reacted to merve's post with ❤️ about 20 hours ago

Post

3268

supercharge your LLM apps with smolagents 🔥

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by Hugging Face to make the LLM write code, do analysis and automate boring stuff!

Here's our blog for you to get started https://huggingface.co/blog/smolagents

updated a dataset 3 days ago

inceptionai/requests-dataset

Viewer • Updated 3 days ago • 54 • 983 • 1

upvoted a collection 4 days ago

Deepseek Papers

Collection

Deepseek papers collection • 14 items • Updated 4 days ago • 8

upvoted a paper 4 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published 7 days ago • 9

reacted to suayptalha's post with ❤️ 4 days ago

Post

1740

🚀 Introducing 𝐅𝐢𝐫𝐬𝐭 𝐇𝐮𝐠𝐠𝐢𝐧𝐠 𝐅𝐚𝐜𝐞 𝐈𝐧𝐭𝐞𝐠𝐫𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐦𝐢𝐧𝐆𝐑𝐔 𝐌𝐨𝐝𝐞𝐥𝐬 from the paper 𝐖𝐞𝐫𝐞 𝐑𝐍𝐍𝐬 𝐀𝐥𝐥 𝐖𝐞 𝐍𝐞𝐞𝐝𝐞𝐝?

🖥 I have integrated 𝐧𝐞𝐱𝐭-𝐠𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐨𝐧 𝐑𝐍𝐍𝐬, specifically minGRU, which offer faster performance compared to Transformer architectures, into HuggingFace. This allows users to leverage the lighter and more efficient minGRU models with the "𝐭𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫𝐬" 𝐥𝐢𝐛𝐫𝐚𝐫𝐲 for both usage and training.

💻 I integrated two main tasks: 𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧 and 𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐂𝐚𝐮𝐬𝐚𝐥𝐋𝐌.

𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧:
You can use this class for 𝐒𝐞𝐪𝐮𝐞𝐧𝐜𝐞 𝐂𝐥𝐚𝐬𝐬𝐢𝐟𝐢𝐜𝐚𝐭𝐢𝐨𝐧 tasks. I also trained a Sentiment Analysis model with stanfordnlp/imdb dataset.

𝐌𝐢𝐧𝐆𝐑𝐔𝐅𝐨𝐫𝐂𝐚𝐮𝐬𝐚𝐥𝐋𝐌:
You can use this class for 𝐂𝐚𝐮𝐬𝐚𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥 tasks such as GPT, Llama. I also trained an example model with roneneldan/TinyStories dataset. You can fine-tune and use it!

🔗 𝐋𝐢𝐧𝐤𝐬:
Models: suayptalha/mingru-676fe8d90760d01b7955d7ab
GitHub: https://github.com/suayptalha/minGRU-hf
LinkedIn Post: https://www.linkedin.com/posts/suayp-talha-kocabay_mingru-a-suayptalha-collection-activity-7278755484172439552-wNY1

📰 𝐂𝐫𝐞𝐝𝐢𝐭𝐬:
Paper Link: https://arxiv.org/abs/2410.01201

I am thankful to Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio and Hossein Hajimirsadeghi for their papers.

upvoted a paper 4 days ago

Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation

Paper • 2412.15255 • Published 18 days ago • 3

posted an update 4 days ago

Post

1627

~75% on the challenging GPQA with only 40M parameters 🔥🥳

GREAT ACHIEVEMENT ! Or is it ?

This new Work, "Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation", take out the mystery about many models i personally suspected their results. Speacially on leaderboards other than the english one, Like the Open Arabic LLM Leaderbaord OALL/Open-Arabic-LLM-Leaderboard.

The authors of this work, first started by training a model on the GPQA data, which, unsurprisingly, led to the model achieving 100% performance.

Afterward, they trained what they referred to as a 'legitimate' model on legitimate data (MedMCQA). However, they introduced a distillation loss from the earlier, 'cheated' model.

What they discovered was fascinating: the knowledge of GPQA leaked through this distillation loss, even though the legitimate model was never explicitly trained on GPQA during this stage.

This raises important questions about the careful use of distillation in model training, especially when the training data is opaque. As they demonstrated, it’s apparently possible to (intentionally or unintentionally) leak test data through this method.

Find out more: Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation (2412.15255)

1 reply

liked a Space 6 days ago

Running on CPU Upgrade

1.09k

🏢

Anychat

liked 2 models 6 days ago

deepseek-ai/DeepSeek-V3-Base

Updated 4 days ago • 6.64k • 1.1k

deepseek-ai/DeepSeek-V3

Updated 4 days ago • 45.5k • 984

updated a dataset 7 days ago

alielfilali01/fineweb-2-arb_Arab-text-only-2

Viewer • Updated 7 days ago • 57.8M • 39