IlyaGusev
/

vikhr_nemo_orpo_dostoevsky_12b

Model card Files Files and versions Community

vikhr_nemo_orpo_dostoevsky_12b / README.md

IlyaGusev's picture

Update README.md

86ce10d verified 3 months ago

|

history blame contribute delete

600 Bytes

	---
	license: apache-2.0
	datasets:
	- 40umov/dostoevsky
	language:
	- ru
	base_model:
	- Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24
	library_name: unsloth
	---

	Vikhr-Nemo fine-tuned with contrastive Russian literature.

	* Base model: https://huggingface.co/Vikhrmodels/Vikhr-Nemo-12B-Instruct-R-21-09-24
	* Dataset: https://huggingface.co/datasets/40umov/dostoevsky
	* Method: [ORPO](https://arxiv.org/abs/2403.07691)
	* Training config: https://github.com/IlyaGusev/saiga/blob/main/configs/models/doestoevsky_nemo_12b_orpo_m1.json
	* WandB: https://wandb.ai/ilyagusev/rulm_self_instruct/runs/4v4pcgej