Youliang
/

llama3-70b-lora-derta

Generated from Trainer

Model card Files Files and versions Community

Youliang commited on Jul 20

Commit

56164c2

•

1 Parent(s): 0c266cb

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ This model is a fine-tuned version of [meta-llama/Meta-Llama-3-70B](https://hugg
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -45,6 +45,27 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: cosine
 - num_epochs: 2.0
 ### Training results

 ## Model description
+Please refer to the paper [Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training](https://arxiv.org/abs/2407.09121) and GitHub [DeRTa](https://github.com/RobustNLP/DeRTa).
 ## Intended uses & limitations
 - lr_scheduler_type: cosine
 - num_epochs: 2.0
+The lora config is:
+```
+{
+  "lora_r": 96,
+  "lora_alpha": 16,
+  "lora_dropout": 0.05,
+  "lora_target_modules": [
+    "q_proj",
+    "v_proj",
+    "k_proj",
+    "o_proj",
+    "gate_proj",
+    "down_proj",
+    "up_proj",
+    "w1",
+    "w2",
+    "w3"
+  ]
+}
+```
 ### Training results