Dumpling-Qwen2.5-32B
nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B finetuned on:
- nbeerbower/GreatFirewall-DPO
- nbeerbower/Schule-DPO
- nbeerbower/Purpura-DPO
- nbeerbower/Arkhaios-DPO
- jondurbin/truthy-dpo-v0.1
- antiven0m/physical-reasoning-dpo
- flammenai/Date-DPO-NoAsterisks
- flammenai/Prude-Phi3-DPO
- Atsunori/HelpSteer2-DPO (1,000 samples)
- jondurbin/gutenberg-dpo-v0.1
- nbeerbower/gutenberg2-dpo
- nbeerbower/gutenberg-moderne-dpo.
Method
QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
- Downloads last month
- 22
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for nbeerbower/Dumpling-Qwen2.5-1.5B
Base model
nbeerbower/EVA-abliterated-TIES-Qwen2.5-1.5B