InferenceIllusionist commited on
Commit
01bcde9
1 Parent(s): 1932bfa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -5
README.md CHANGED
@@ -9,6 +9,7 @@ tags:
9
  - mistral
10
  - trl
11
  - sft
 
12
  base_model: unsloth/mistral-7b-v0.2-bnb-4bit
13
  datasets:
14
  - visheratin/realworldqa
@@ -18,19 +19,47 @@ datasets:
18
  <img src="https://i.imgur.com/Pf53ms5.jpeg" width="400"/>
19
 
20
 
21
- An experiment with the goal of reducing halucinations in [VQA](https://huggingface.co/tasks/visual-question-answering)
22
- First in of a series of projects centering around fine-tuning for image captioning.
 
 
 
23
 
24
- Release Notes:
25
  * v0.1 - Initial Release
26
- * v0.2 (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
27
 
 
28
 
29
  Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
30
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31
  - **Developed by:** InferenceIllusionist
32
  - **License:** apache-2.0
33
  - **Finetuned from model :** mistral-community/Mistral-7B-v0.2
34
 
 
35
 
36
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
9
  - mistral
10
  - trl
11
  - sft
12
+ - gguf
13
  base_model: unsloth/mistral-7b-v0.2-bnb-4bit
14
  datasets:
15
  - visheratin/realworldqa
 
19
  <img src="https://i.imgur.com/Pf53ms5.jpeg" width="400"/>
20
 
21
 
22
+ An experiment with the goal of reducing hallucinations in [VQA](https://huggingface.co/tasks/visual-question-answering)
23
+
24
+ First in a series of experiments centering around fine-tuning for image captioning.
25
+
26
+ <h1>Release Notes</h1>
27
 
 
28
  * v0.1 - Initial Release
29
+ * <b>v0.2</b> (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
30
 
31
+ <h2>Background & Methodology</h2>
32
 
33
  Mistral-7b-02 base model was fine-tuned using the [RealWorldQA dataset](https://huggingface.co/datasets/visheratin/realworldqa), originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
34
 
35
+ <h1>Vision Results</h1>
36
+
37
+ Example 1
38
+ <img src="https://i.imgur.com/E9mS4Xb.jpeg" width="400"/>
39
+ Example 2
40
+ <img src="https://i.imgur.com/SmTz1Yd.jpeg" width="400"/>
41
+
42
+ * Experiment yielded model that provides shorter, less verbose output for questions about pictures
43
+ * The likelihood of hallucinations in output has decreased, however, the model can still be easily influenced to be inaccurate by the user
44
+ * Best suited for captioning use cases that require concise descriptions and low token counts
45
+ * This model lacks the conversational prose of Excalibur-7b-DPO and is much "drier" in tone
46
+
47
+ <b>Requires additional mmproj file. You have two options for vision functionality (available inside this repo):</b>
48
+ 1. [Quantized - Limited VRAM Option (197mb)](https://huggingface.co/InferenceIllusionist/Mistral-RealworldQA-v0.2-7b-SFT/resolve/main/mistral-7b-mmproj-v1.5-Q4_1.gguf?download=true)
49
+ 2. [Unquantized - Premium Option / Best Quality (596mb)](https://huggingface.co/InferenceIllusionist/Mistral-RealworldQA-v0.2-7b-SFT/resolve/main/mmproj-model-f16.gguf?download=true)
50
+
51
+ Select the gguf file of your choice in [Koboldcpp](https://github.com/LostRuins/koboldcpp/releases/) as usual, then make sure to choose the mmproj file above in the LLaVA mmproj field of the model submenu:
52
+ <img src="https://i.imgur.com/x8vqH29.png" width="425"/>
53
+
54
+ ## Prompt Format
55
+ Use Alpaca for best results.
56
+
57
+
58
+ ## Other info
59
  - **Developed by:** InferenceIllusionist
60
  - **License:** apache-2.0
61
  - **Finetuned from model :** mistral-community/Mistral-7B-v0.2
62
 
63
+ This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
64
 
65
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)