aifeifei798
/

llama3-8B-DarkIdol-2.2-Uncensored-1048K

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aifeifei798 commited on Jul 1

Commit

3fdc11b

•

1 Parent(s): 218834d

Upload README.md

Files changed (1) hide show

README.md +3 -7

README.md CHANGED Viewed

@@ -14,11 +14,7 @@ tags:
  - Lewdiculous's superb gguf version, thank you for your conscientious and responsible dedication.
  - https://huggingface.co/LWDCLS/llama3-8B-DarkIdol-2.2-Uncensored-1048K-GGUF-IQ-Imatrix-Request
-# fast quantizations
- - The difference with normal quantizations is that I quantize the output and embed tensors to f16.and the other tensors to 15_k,q6_k or q8_0.This creates models that are little or not degraded at all and have a smaller size.They run at about 3-6 t/sec on CPU only using llama.cpp And obviously faster on computers with potent GPUs
- - https://huggingface.co/ZeroWw/llama3-8B-DarkIdol-2.2-Uncensored-1048K-GGUF
- - More models here: https://huggingface.co/RobertSinclair
 ## Why 1048K?
 Due to the optimization of the preferred model, its performance is excellent across the range of 2000-1048K. Personal usage scenarios, such as 8186, 32K, etc., are insufficient. My primary role involves managing virtual idol Twitter accounts and assisting with singing, etc. A good conversation can be very lengthy, and sometimes even 32K is not enough. Imagine having a heated chat with your virtual girlfriend, only for it to abruptly cut off—that feeling is too painful.
@@ -39,8 +35,8 @@ The module combination has been readjusted to better fulfill various roles and h
 - DarkIdol:Roles that you can imagine and those that you cannot imagine.
 - Roleplay
 - Specialized in various role-playing scenarios
-- more look at test role. (https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-1.2/resolve/main/test)
-- more look at LM Studio presets (https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-1.2/resolve/main/config-presets)
 ![image/png](https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-2.2-Uncensored-1048K/resolve/main/llama3-8B-DarkIdol-2.2-Uncensored-1048K.png)

  - Lewdiculous's superb gguf version, thank you for your conscientious and responsible dedication.
  - https://huggingface.co/LWDCLS/llama3-8B-DarkIdol-2.2-Uncensored-1048K-GGUF-IQ-Imatrix-Request
 ## Why 1048K?
 Due to the optimization of the preferred model, its performance is excellent across the range of 2000-1048K. Personal usage scenarios, such as 8186, 32K, etc., are insufficient. My primary role involves managing virtual idol Twitter accounts and assisting with singing, etc. A good conversation can be very lengthy, and sometimes even 32K is not enough. Imagine having a heated chat with your virtual girlfriend, only for it to abruptly cut off—that feeling is too painful.
 - DarkIdol:Roles that you can imagine and those that you cannot imagine.
 - Roleplay
 - Specialized in various role-playing scenarios
+- more look at test role. (https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-1.2/tree/main/test)
+- more look at LM Studio presets (https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-1.2/tree/main/config-presets)
 ![image/png](https://huggingface.co/aifeifei798/llama3-8B-DarkIdol-2.2-Uncensored-1048K/resolve/main/llama3-8B-DarkIdol-2.2-Uncensored-1048K.png)