This is a frankenmerge of Mihaiii/Pallas-0.5 . It was done using mergekit.
It works well with long system prompts.
It isn't generic in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension.
This model is trained on a private dataset.
Prompt Format:
SYSTEM: <ANY SYSTEM CONTEXT>
USER:
ASSISTANT:
Merge config:
slices:
- sources:
- model: "Mihaiii/Pallas-0.5"
layer_range: [0, 60]
- sources:
- model: "Mihaiii/Pallas-0.5"
layer_range: [58, 60]
- sources:
- model: "Mihaiii/Pallas-0.5"
layer_range: [55, 56]
merge_method: passthrough
dtype: bfloat16
Quants:
TheBloke/Pallas-0.5-frankenmerge-GGUF
- Downloads last month
- 6