maldv
/

badger-writer-llama-3-8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

maldv commited on Jun 17, 2024

Commit

3845cd5

·

verified ·

1 Parent(s): cac5f01

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 base_model:
 - maldv/badger-lambda-llama-3-8b
 - maldv/llama-3-fantasy-writer-8b
 library_name: transformers
 tags:
 - fourier
@@ -14,6 +15,8 @@ license: cc-by-nc-4.0
 Badger Writer is a *normalized fourier task addition* of maldv/badger-lambda-llama-3-8b and maldv/llama-3-fantasy-writer-8b.
 Rep-pen 1.1 ; Min-p 0.01 ; Temp 0.7 ; Dynatemp 0.4 ; 32k context ; llama 3 instruct template
 ```

 base_model:
 - maldv/badger-lambda-llama-3-8b
 - maldv/llama-3-fantasy-writer-8b
+- dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5
 library_name: transformers
 tags:
 - fourier
 Badger Writer is a *normalized fourier task addition* of maldv/badger-lambda-llama-3-8b and maldv/llama-3-fantasy-writer-8b.
+I also used the first and last layer directly from dreamgen-preview/opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5 due to the obvious advantages.  I didn't train either the lm_head or embed_token layers on the fantasy-writer, but opus is part of lambda ; so they all fit nicely together.
 Rep-pen 1.1 ; Min-p 0.01 ; Temp 0.7 ; Dynatemp 0.4 ; 32k context ; llama 3 instruct template
 ```