athirdpath
/

Orca-2-13b-Alpaca-Uncensored

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

athirdpath commited on Nov 27, 2023

Commit

7297758

·

1 Parent(s): 7a246af

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -4,5 +4,7 @@ license: other
 license_name: microsoft-research-license
 ---
 This model is a fine-tuned version of microsoft/Orca-2-13b on a subset of the Vezora/Mini_Orca_Uncencored_Alpaca dataset, with some particularly spicy prompts added to reduce the risk of rejections.
-Only the q_proj and k_proj modules were targeted and a low rank (8) was used, in hopes of containing the adjustments to the prompt format and alignment.
 I'll test it tomorrow when my GGUF quants are done.

 license_name: microsoft-research-license
 ---
 This model is a fine-tuned version of microsoft/Orca-2-13b on a subset of the Vezora/Mini_Orca_Uncencored_Alpaca dataset, with some particularly spicy prompts added to reduce the risk of rejections.
+Only the q_proj and k_proj modules were targeted and a low rank (8) was used, in hopes of containing the adjustments to the prompt format and alignment. This is promising on paper, with the training's per-step loss averaging <0.9 for the last third of the run.
 I'll test it tomorrow when my GGUF quants are done.