Malaysian SmolLM2-360M Instruct
Continue finetuning https://huggingface.co/HuggingFaceTB/SmolLM2-360M on highly curated 1.5B tokens Malaysian instruction dataset.
Improvement
- Support respond in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
- Able to code in Manglish, Mandarin, Tamil, Jawi, Johor, Kedah, Kelantan, Pahang, Perak, Sabah, Sarawak, Selangor, Negeri Sembilan and Terengganu.
- Multi-turn Malaysian context such as related to Malaysian Legislation, politics, religions and languages.
- Malaysian role-playing.
- Standard RAG.
Still on training.
- Downloads last month
- 17