jukofyork
/

creative-writer-v0.1-alfa-35b

Text Generation

creative-writing

creative-writer

multiplicative-lora

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jukofyork commited on Oct 27, 2024

Commit

5571b53

·

verified ·

1 Parent(s): 08b8aec

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ Other experimental models which attempt to encourage more diverse/creative text
 - [creative-writer-v0.1-bravo-35b](https://huggingface.co/jukofyork/creative-writer-v0.1-bravo-35b) - Scaled the pre-softmax logits by `1.1` during training (and then reset after training).
 - [creative-writer-v0.1-charlie-35b](https://huggingface.co/jukofyork/creative-writer-v0.1-charlie-35b) - Scaled the pre-softmax logits by `0.9` during training (and didn't reset after training).
-<details> <summary>Click to see some (brief) tests on effect of these changes</summary>
 #### Using `command-r-3-2024` with `temperature = 1` and `min-p = 0.01`:
@@ -76,6 +76,14 @@ Other experimental models which attempt to encourage more diverse/creative text
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/65995c45539c808e84c38bf1/PwnDkctZ273zMHC-Ta_YU.png)
 </details>
 ---

 - [creative-writer-v0.1-bravo-35b](https://huggingface.co/jukofyork/creative-writer-v0.1-bravo-35b) - Scaled the pre-softmax logits by `1.1` during training (and then reset after training).
 - [creative-writer-v0.1-charlie-35b](https://huggingface.co/jukofyork/creative-writer-v0.1-charlie-35b) - Scaled the pre-softmax logits by `0.9` during training (and didn't reset after training).
+<details> <summary>Click to see some (brief) tests on the effect of these changes</summary>
 #### Using `command-r-3-2024` with `temperature = 1` and `min-p = 0.01`:
 ![image.png](https://cdn-uploads.huggingface.co/production/uploads/65995c45539c808e84c38bf1/PwnDkctZ273zMHC-Ta_YU.png)
+---
+**Observations**:
+- Up-scaling of the pre-softmax logits during training used by `creative-writer-v0.1-bravo:35b` looks the most promising.
+- Down-scaling of the pre-softmax logits during training used by `creative-writer-v0.1-charlie:35b` looks to be very similar to inference-time temperature adjustment.
+- It may be better to just leave the pre-softmax logits up-scaled after training and then let the user perform inference-time temperature adjustment.
 </details>
 ---