Model Card for Model ID
DPO qlora adapter for Navarna, refer to https://huggingface.co/TokenBender/navarna_hindi_merged for SFT qlora merged model.
And final DPO adapter merged model is - https://huggingface.co/TokenBender/navaran_hindi_dpo_merged
- Downloads last month
- 4
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.
Model tree for TokenBender/navarna_dpo_qlora
Base model
TokenBender/navarna_hindi_merged