Updating model summary
Browse files
README.md
CHANGED
@@ -3,6 +3,7 @@ base_model: alpindale/Mistral-7B-v0.2-hf
|
|
3 |
library_name: transformers
|
4 |
language:
|
5 |
- en
|
|
|
6 |
tags:
|
7 |
- generated_from_trainer
|
8 |
- quantized
|
@@ -12,6 +13,14 @@ tags:
|
|
12 |
- endpoints_compatible
|
13 |
- text-generation-inference
|
14 |
- chatml
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
model-index:
|
16 |
- name: workspace/dolphin-2.8-mistral-7b
|
17 |
results: []
|
@@ -33,16 +42,29 @@ prompt_template: '<|im_start|>system
|
|
33 |
|
34 |
'
|
35 |
---
|
36 |
-
# cognitivecomputations/dolphin-2.8-mistral-7b-v02 AWQ
|
37 |
|
38 |
- Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
|
39 |
- Original model: [dolphin-2.8-mistral-7b-v02](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)
|
40 |
|
|
|
|
|
41 |
## Model Summary
|
42 |
|
43 |
-
|
44 |
-
|
45 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
|
47 |
## How to use
|
48 |
|
|
|
3 |
library_name: transformers
|
4 |
language:
|
5 |
- en
|
6 |
+
license: apache-2.0
|
7 |
tags:
|
8 |
- generated_from_trainer
|
9 |
- quantized
|
|
|
13 |
- endpoints_compatible
|
14 |
- text-generation-inference
|
15 |
- chatml
|
16 |
+
datasets:
|
17 |
+
- cognitivecomputations/dolphin
|
18 |
+
- cognitivecomputations/dolphin-coder
|
19 |
+
- cognitivecomputations/samantha-data
|
20 |
+
- jondurbin/airoboros-2.2.1
|
21 |
+
- teknium/openhermes-2.5
|
22 |
+
- m-a-p/Code-Feedback
|
23 |
+
- m-a-p/CodeFeedback-Filtered-Instruction
|
24 |
model-index:
|
25 |
- name: workspace/dolphin-2.8-mistral-7b
|
26 |
results: []
|
|
|
42 |
|
43 |
'
|
44 |
---
|
45 |
+
# cognitivecomputations/dolphin-2.8-mistral-7b-v02 🐬 AWQ
|
46 |
|
47 |
- Model creator: [cognitivecomputations](https://huggingface.co/cognitivecomputations)
|
48 |
- Original model: [dolphin-2.8-mistral-7b-v02](https://huggingface.co/cognitivecomputations/dolphin-2.8-mistral-7b-v02)
|
49 |
|
50 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/63111b2d88942700629f5771/ldkN1J0WIDQwU4vutGYiD.png" width="600" />
|
51 |
+
|
52 |
## Model Summary
|
53 |
|
54 |
+
My appreciation for the sponsors of Dolphin 2.8:
|
55 |
+
- [Crusoe Cloud](https://crusoe.ai/) - provided excellent on-demand 10xL40S node
|
56 |
+
- [Winston Sou](https://twitter.com/WinsonDabbles) - Along with a generous anonymous sponsor, donated a massive personally owned compute resource!
|
57 |
+
- [Abacus AI](https://abacus.ai/) - my employer and partner in many things.
|
58 |
+
|
59 |
+
This model is based on [Mistral-7b-v0.2](https://huggingface.co/alpindale/Mistral-7B-v0.2-hf) a new base model released by MistralAI on March 23, 2024 but they have not yet published on HuggingFace. Thanks to @alpindale for converting / publishing.
|
60 |
+
|
61 |
+
The base model has 32k context, and the full-weights fine-tune was with 16k sequence lengths.
|
62 |
+
|
63 |
+
It took 3 days on 10x L40S provided by [Crusoe Cloud](https://crusoe.ai/)
|
64 |
+
|
65 |
+
Dolphin-2.8 has a variety of instruction, conversational, and coding skills.
|
66 |
+
|
67 |
+
This model is uncensored. I have filtered the dataset to remove alignment and bias. This makes the model more compliant. You are advised to implement your own alignment layer before exposing the model as a service. It will be highly compliant to any requests, even unethical ones. Please read my blog post about uncensored models. https://erichartford.com/uncensored-models You are responsible for any content you create using this model. Enjoy responsibly.
|
68 |
|
69 |
## How to use
|
70 |
|