NotASI
/

FineTome-Llama3.2-1B-0929

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FineTome-Llama3.2-1B-0929 / README.md

NotASI's picture

Update README.md

8704c6f verified 4 months ago

|

867 Bytes

	---
	base_model: unsloth/Llama-3.2-1B-Instruct-bnb-4bit
	language:
	- en
	license: llama3.2
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- llama-3
	- trl
	- sft
	datasets:
	- mlabonne/FineTome-100k
	---

	# Uploaded model

	- Developed by: NotASI
	- License: apache-2.0
	- Finetuned from model : unsloth/Llama-3.2-1B-Instruct-bnb-4bit

	# Details

	This model was trained on mlabonne/FineTome-100k for 2 epochs with rslora + qlora, and achieve the final training loss: 0.796700.

	This model follows the same chat template as the base model one.

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)