Model Card for Model ID

DPO qlora adapter for Navarna, refer to https://huggingface.co/TokenBender/navarna_hindi_merged for SFT qlora merged model.

And final DPO adapter merged model is - https://huggingface.co/TokenBender/navaran_hindi_dpo_merged

Downloads last month
2
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model’s pipeline type.

Model tree for TokenBender/navarna_dpo_qlora

Adapter
(1)
this model