metadata
base_model: unsloth/Llama-3.2-1B-Instruct-bnb-4bit
language:
- en
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
datasets:
- yahma/alpaca-cleaned
Description
- Motivation: Fine-tuned version of Llama 3.2 model to generate a quiz for a given context. This version is just poorly trained on the Alpaca version (Chatbot usecase). Also this dataset only have 80 data points on json task [https://huggingface.co/datasets/yahma/alpaca-cleaned/viewer/default/train?q=json] so the model also weak on structured json output.
Uploaded model
- Developed by: vietphuon
- License: apache-2.0
- Finetuned from model : unsloth/Llama-3.2-1B-Instruct-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.