Edit model card

Malaysian SmolLM2-360M Instruct

Continue finetuning https://huggingface.co/HuggingFaceTB/SmolLM2-360M on highly curated 1.5B tokens Malaysian instruction dataset.

Improvement

  1. Support respond in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
  2. Able to code in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
  3. Multi-turn Malaysian context such as related to Malaysian Legislation, politics, religions and languages.
  4. Malaysian role-playing.
  5. Standard RAG.

Still on training.

Downloads last month
17
Safetensors
Model size
409M params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for mesolitica/malaysian-SmolLM2-360M-Instruct

Quantizations
1 model