Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages Paper • 2411.12240 • Published 26 days ago • 6
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch Paper • 2411.11171 • Published 27 days ago • 8
Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published 10 days ago • 9