InferenceIllusionist
commited on
Commit
•
01bcde9
1
Parent(s):
1932bfa
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,7 @@ tags:
|
|
9 |
- mistral
|
10 |
- trl
|
11 |
- sft
|
|
|
12 |
base_model: unsloth/mistral-7b-v0.2-bnb-4bit
|
13 |
datasets:
|
14 |
- visheratin/realworldqa
|
@@ -18,19 +19,47 @@ datasets:
|
|
18 |
<img src="https://i.imgur.com/Pf53ms5.jpeg" width="400"/>
|
19 |
|
20 |
|
21 |
-
An experiment with the goal of reducing
|
22 |
-
|
|
|
|
|
|
|
23 |
|
24 |
-
Release Notes:
|
25 |
* v0.1 - Initial Release
|
26 |
-
* v0.2 (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
|
27 |
|
|
|
28 |
|
29 |
Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
|
30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
31 |
- **Developed by:** InferenceIllusionist
|
32 |
- **License:** apache-2.0
|
33 |
- **Finetuned from model :** mistral-community/Mistral-7B-v0.2
|
34 |
|
|
|
35 |
|
36 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
9 |
- mistral
|
10 |
- trl
|
11 |
- sft
|
12 |
+
- gguf
|
13 |
base_model: unsloth/mistral-7b-v0.2-bnb-4bit
|
14 |
datasets:
|
15 |
- visheratin/realworldqa
|
|
|
19 |
<img src="https://i.imgur.com/Pf53ms5.jpeg" width="400"/>
|
20 |
|
21 |
|
22 |
+
An experiment with the goal of reducing hallucinations in [VQA](https://huggingface.co/tasks/visual-question-answering)
|
23 |
+
|
24 |
+
First in a series of experiments centering around fine-tuning for image captioning.
|
25 |
+
|
26 |
+
<h1>Release Notes</h1>
|
27 |
|
|
|
28 |
* v0.1 - Initial Release
|
29 |
+
* <b>v0.2</b> (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
|
30 |
|
31 |
+
<h2>Background & Methodology</h2>
|
32 |
|
33 |
Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
|
34 |
|
35 |
+
<h1>Vision Results</h1>
|
36 |
+
|
37 |
+
Example 1
|
38 |
+
<img src="https://i.imgur.com/E9mS4Xb.jpeg" width="400"/>
|
39 |
+
Example 2
|
40 |
+
<img src="https://i.imgur.com/SmTz1Yd.jpeg" width="400"/>
|
41 |
+
|
42 |
+
* Experiment yielded model that provides shorter, less verbose output for questions about pictures
|
43 |
+
* The likelihood of hallucinations in output has decreased, however, the model can still be easily influenced to be inaccurate by the user
|
44 |
+
* Best suited for captioning use cases that require concise descriptions and low token counts
|
45 |
+
* This model lacks the conversational prose of Excalibur-7b-DPO and is much "drier" in tone
|
46 |
+
|
47 |
+
<b>Requires additional mmproj file. You have two options for vision functionality (available inside this repo):</b>
|
48 |
+
1. [Quantized - Limited VRAM Option (197mb)](https://huggingface.co/InferenceIllusionist/Mistral-RealworldQA-v0.2-7b-SFT/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true)
|
49 |
+
2. [Unquantized - Premium Option / Best Quality (596mb)](https://huggingface.co/InferenceIllusionist/Mistral-RealworldQA-v0.2-7b-SFT/resolve/main/mmproj-model-f16.gguf?download=true)
|
50 |
+
|
51 |
+
Select the gguf file of your choice in [Koboldcpp](https://github.com/LostRuins/koboldcpp/releases/) as usual, then make sure to choose the mmproj file above in the LLaVA mmproj field of the model submenu:
|
52 |
+
<img src="https://i.imgur.com/x8vqH29.png" width="425"/>
|
53 |
+
|
54 |
+
## Prompt Format
|
55 |
+
Use Alpaca for best results.
|
56 |
+
|
57 |
+
|
58 |
+
## Other info
|
59 |
- **Developed by:** InferenceIllusionist
|
60 |
- **License:** apache-2.0
|
61 |
- **Finetuned from model :** mistral-community/Mistral-7B-v0.2
|
62 |
|
63 |
+
This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
64 |
|
65 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|