metadata
language:
- en
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
- sft
base_model: unsloth/mistral-7b-v0.2-bnb-4bit
datasets:
- visheratin/realworldqa
Mistral-RealworldQA-v0.2-7b SFT GGUF
An experiment with the goal of reducing halucinations in VQA First in of a series of projects centering around fine-tuning for image captioning.
Release Notes:
- v0.1 - Initial Release
- v0.2 (Current)- Updating base model to official Mistral-7b fp16 release, refinements to dataset and instruction formating
Mistral-7b-02 base model was fine-tuned using the RealWorldQA dataset, originally provided by the X.Ai Team here: https://x.ai/blog/grok-1.5v
- Developed by: InferenceIllusionist
- License: apache-2.0
- Finetuned from model : mistral-community/Mistral-7B-v0.2