Text Generation
GGUF
English
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prose
vivid writing
fiction
roleplaying
bfloat16
swearing
rp
llama3
enhanced quants
max quants
maxcpu quants
horror
mergekit
Inference Endpoints
conversational
Update README.md
Browse files
README.md
CHANGED
@@ -74,7 +74,7 @@ dictate they have their own repos.
|
|
74 |
The Imatrix versions of this model have even lower perplexity (1/2 level of magnitude lower than this model, 1 full level of magnitude
|
75 |
lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhanced output.
|
76 |
|
77 |
-
<B>QUANT Updates Dec 21 2024: Refreshed, Upgraded and
|
78 |
|
79 |
- All quants have been "refreshed", quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
|
80 |
- All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
|
|
|
74 |
The Imatrix versions of this model have even lower perplexity (1/2 level of magnitude lower than this model, 1 full level of magnitude
|
75 |
lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhanced output.
|
76 |
|
77 |
+
<B>QUANT Updates Dec 21 2024: Refreshed, Upgraded and New quants:</B>
|
78 |
|
79 |
- All quants have been "refreshed", quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
|
80 |
- All quants have also been upgraded with "more bits" for output tensor (all set at Q8_0) and embed for better performance (this is in addition to the "refresh")
|