emre570
/

gemma-7b-us-minecraft

Model card Files Files and versions Community

emre570 commited on Jun 2, 2024

Commit

3416abf

·

verified ·

1 Parent(s): cdf7bea

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -9,6 +9,8 @@ datasets:
 This model is fine-tuned from [Unsloth's Gemma 1.1 7B Instruct quantized model](https://huggingface.co/unsloth/gemma-1.1-7b-it-bnb-4bit) with [naklecha's Minecraft Question-Answer dataset](https://huggingface.co/datasets/naklecha/minecraft-question-answer-700k).
 Fine-tuned with first 100k rows from dataset with 1 epoch, it took around 2 hours 20 minutes with NVIDIA RTX 4090.
 ## Important Notes
 - Model sometimes generates answers with no meanings. I am currently investigating this. This process can be long since I am a beginner in this field. If you have any suggestions, feel free to say it on model's Community page.
 - Model is using bitsandbytes so use it with a CUDA supported GPU.

 This model is fine-tuned from [Unsloth's Gemma 1.1 7B Instruct quantized model](https://huggingface.co/unsloth/gemma-1.1-7b-it-bnb-4bit) with [naklecha's Minecraft Question-Answer dataset](https://huggingface.co/datasets/naklecha/minecraft-question-answer-700k).
 Fine-tuned with first 100k rows from dataset with 1 epoch, it took around 2 hours 20 minutes with NVIDIA RTX 4090.
+Model can now generate some good answers. But sometimes it can generate inappropriate answers. I think this problem is based on lack of data.
 ## Important Notes
 - Model sometimes generates answers with no meanings. I am currently investigating this. This process can be long since I am a beginner in this field. If you have any suggestions, feel free to say it on model's Community page.
 - Model is using bitsandbytes so use it with a CUDA supported GPU.