bhavinjawade commited on
Commit
d22aa6e
1 Parent(s): 6a27cf0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -10,6 +10,8 @@ datasets:
10
  This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
11
  original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
12
 
 
 
13
  ## How to Use This Model
14
 
15
  To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps:
 
10
  This model card is instruction finetuned version of `upstage/SOLAR-10.7B-Instruct-v1.0` model. Trained on the Intel DPO Orca dataset using LoRA. Though it should be noted SOLAR-10.7B paper states that the
11
  original model for alignment was trained on Intel ORCA DPO pairs. Retraining using DPO and LoRA shows slight (<1%) improvement on OpenLLM Leaderboard benchmarks against `SOLAR 10.7B-Instruct` and significant over `SOLAR 10.7B`
12
 
13
+ ![model_card_image](SOLAR_ORCA.png)
14
+
15
  ## How to Use This Model
16
 
17
  To use the model `bhavinjawade/SOLAR-10B-OrcaDPO-Jawade`, follow these steps: