Model Card for Model ID
DPO qlora adapter for Navarna, refer to https://huggingface.co/TokenBender/navarna_hindi_merged for SFT qlora merged model.
And final DPO adapter merged model is - https://huggingface.co/TokenBender/navaran_hindi_dpo_merged
- Downloads last month
- 2
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model’s pipeline type.
Model tree for TokenBender/navarna_dpo_qlora
Base model
TokenBender/navarna_hindi_merged