Text Generation
Transformers
Safetensors
llama
text-generation-inference
Inference Endpoints

Instruct finetune

#3
by kyynaama - opened

Is there a timeline for chat/instruct finetunes on these models?

LumiOpen org

Unfortunately no timeline to share yet. We're prioritizing and planning that work now.

Any news on an instruct version?

This really needs some effort; there are no capable enough open source translators and also the processor resource needs are out of reach for almost everyone.

If we want some digileap in Finland, we need chat and instruction models now..

This really needs some effort; there are no capable enough open source translators and also the processor resource needs are out of reach for almost everyone.

If we want some digileap in Finland, we need chat and instruction models now..

I would suggest you check out Gemma 3. It's currently probably the best model that is open-weight (that you can actually run on consumer hardware) for Finnish especially. The 27B model at the very least is pretty good at it. 12B model is also not bad, 4B is probably too small and suffers from it. I've somewhat lost hope for anyone training models from scratch to ever match other commercial models like Llama or Gemma/Qwen. (considering the amount of data they are trained on versus models like Viking etc.)

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment