Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
satpalsr 
posted an update Feb 1, 2024
Post
Introducing Gajendra!

An early release of our 7B Hindi-Hinglish-English Instruction fine-tuned language model.

Model: BhabhaAI/Gajendra-v0.1

We additionally explore ways to filter examples that can be translated from English to Hindi and are releasing initial versions of both dataset and model for it.

Model: BhabhaAI/Mistral-translation-classify
Dataset: BhabhaAI/translation-classify

Looking forward to collaborate with open source community to accelerate and release Hindi LLMs.

Any plans to make the SFT dataset public?

·

yes 💯, we'd make both datasets and code used for filtering public, but probably by month end. We are working on further scaling & then filtering the dataset.