Simeon Emanuilov PRO

s-emanuilov

AI & ML interests

Software Engineer & Ph.D. candidate | Specializing in ML/DL system development & applying AI to solve real-world business problems.

Recent Activity

liked a model 15 days ago

intfloat/mmE5-mllama-11b-instruct

liked a Space 16 days ago

nanotron/ultrascale-playbook

upvoted an article 16 days ago

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

View all activity

Organizations

s-emanuilov's activity

liked a model 15 days ago

intfloat/mmE5-mllama-11b-instruct

Zero-Shot Image Classification • Updated 13 days ago • 5.38k • 17

liked a Space 16 days ago

2.21k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 articles 16 days ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

21 days ago

• 65

Article

SigLIP 2: A better multilingual vision language encoder

19 days ago

• 130

liked 2 models 28 days ago

unsloth/Mistral-Small-24B-Instruct-2501-GGUF

Text Generation • Updated Jan 30 • 9.68k • 26

nomic-ai/nomic-embed-text-v2-moe

upvoted an article 28 days ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 102

liked 3 models 28 days ago

replied to their post 29 days ago

try to reduce gpu_memory_utilization to some lower coefficient

upvoted a paper about 1 month ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 49

replied to their post about 1 month ago

Thank you.

I’m also a big fan of Qwen models. However, in this case, I don’t think they are appropriate because I’m not entirely confident in their capabilities regarding multilingual contexts. That’s why I chose Llama.

Overall, I agree that the Qwen series is excellent for most tasks.

posted an update about 1 month ago

Post

5189

Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth

I wanted to share my experiment with training reasoning models in languages other than English/Chinese.

Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.

Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/

The model itself: s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

I hope this helps anyone looking to build reasoning models in their language.