VAGO solutions

company

https://vago-solutions.ai

VAGOsolutions

Activity Feed Request to join this org

AI & ML interests

We intend to release the best german large language model on the market.

Recent Activity

DaryoushV updated a collection about 1 month ago

SauerkrautLM-qwen

DaryoushV updated a collection about 1 month ago

SauerkrautLM-qwen

DavidGF updated a dataset about 1 month ago

VAGOsolutions/SauerkrautLM-Fermented-Irrelevance-GER-DPO

View all activity

VAGOsolutions's activity

DaryoushV

updated a collection about 1 month ago

SauerkrautLM-qwen

Collection

Find our empowered SauerrkautLM Versions of various Qwen models here • 4 items • Updated Nov 9

DavidGF

updated 2 datasets about 1 month ago

VAGOsolutions/SauerkrautLM-Fermented-Irrelevance-GER-DPO

Viewer • Updated Nov 9 • 2k • 70 • 4

VAGOsolutions/SauerkrautLM-Fermented-GER-DPO

Viewer • Updated Nov 9 • 3.31k • 86 • 7

DavidGF

in VAGOsolutions/SauerkrautLM-v2-14b-SFT about 1 month ago

Adding Evaluation Results

#1 opened about 1 month ago by

leaderboard-pr-bot

DavidGF

in VAGOsolutions/SauerkrautLM-v2-14b-DPO about 1 month ago

Adding Evaluation Results

#1 opened about 1 month ago by

leaderboard-pr-bot

DavidGF

updated a Space about 2 months ago

Running

👁

README

DavidGF

posted an update about 2 months ago

Post

3014

🎉 Celebrating One Year of #SauerkrautLM with Two Groundbreaking Releases!

We're thrilled to announce the release of SauerkrautLM-v2-14b in two specialized versions: VAGOsolutions/SauerkrautLM-v2-14b-SFT and VAGOsolutions/SauerkrautLM-v2-14b-DPO. Built on the robust Qwen2.5-14B foundation, these models represent a significant leap forward in multilingual AI capabilities.

🔬 Technical Breakthroughs:
💠 Innovative three-phase Fine-Tuning approach
💠 Two-step Spectrum SFT + one-step Spectrum DPO optimization phase for enhanced performance
💠 Balance of German and English language capabilities
💠 Advanced function calling - almost on par with Claude-3.5-Sonnet-20240620

🇩🇪 German Language Excellence:
What sets this release apart is our unique achievement in simultaneously improving both German and English capabilities. Through our specialized training approach with over 1.2B tokens across two phases, we've managed to:
💠 Enhance German language understanding and generation (SFT Version > DPO Version)
💠 Maintain authentic German linguistic nuances
💠 Improve cross-lingual capabilities
💠 Preserve cultural context awareness

📊 Training Innovation:
Our three-phase approach targeted specific layer percentages (15%, 20% and 25%) with carefully curated datasets, including:
💠 Mathematics-focused content (proprietary classifier-selected)
💠 High-quality German training data
💠 Specialized function calling datasets
💠 Premium multilingual content

🎁 Community Contribution:
We're also releasing two new datasets in a few days:
1️⃣ SauerkrautLM-Fermented-GER-DPO: 3,300 high-quality German training samples
2️⃣ SauerkrautLM-Fermented-Irrelevance-GER-DPO: 2,000 specialized samples for optimized function call irrelevance handling

Thank you to our incredible community and partners who have supported us throughout this journey. Here's to another year of AI innovation! 🚀