TheBloke
/

Tulu-30B-SuperHOT-8K-fp16

Text Generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jun 26, 2023

Commit

50bfc35

•

1 Parent(s): 555d5ab

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ license: other
 These files are fp16 model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b) merged with [Kaio Ken's SuperHOT 30B 8K LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) to produce a model capable of 8K context.
-[Kaio Ken's SuperHOT 30B LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) is merged on to the base model to produce a model capable of 8K context, via the provided monkey patch (`llama_rope_scaled_monkey_patch.py`)
 Alternatively, `config.json` can be modified to allow the monkey patch to load via trust_remote_code=True. I plan to update this repo shortly to include that method.

 These files are fp16 model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b) merged with [Kaio Ken's SuperHOT 30B 8K LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) to produce a model capable of 8K context.
+[Kaio Ken's SuperHOT 30B LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) is merged on to the base model, and then 8K context can be achieved during inference by using the provided monkey patch (`llama_rope_scaled_monkey_patch.py`)
 Alternatively, `config.json` can be modified to allow the monkey patch to load via trust_remote_code=True. I plan to update this repo shortly to include that method.