PJMixers-Archive
/

LLaMa-3-Instruct-SmallPrefMix-ORPO-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LLaMa-3-Instruct-SmallPrefMix-ORPO-8B / README.md

xzuyn's picture

Create README.md

2f0ea23 verified 5 months ago

|

history blame contribute delete

548 Bytes

	---
	datasets:
	- PJMixers/SmallPrefMix-PreferenceShareGPT
	---
	![train/rewards](https://huggingface.co/PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA/resolve/main/images/rewards.png)
	![train/logits](https://huggingface.co/PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA/resolve/main/images/logits.png)
	![train/logps](https://huggingface.co/PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA/resolve/main/images/logps.png)
	![train](https://huggingface.co/PJMixers/LLaMa-3-Instruct-SmallPrefMix-ORPO-8B-QDoRA/resolve/main/images/train.png)