Half the data was geared towards better reasoning (EvolKit-20k and reasoning-base-20k), the other half will help to de-censor the model (WizardLM data set).

Uploaded model

Developed by: theprint
License: apache-2.0
Finetuned from model : unsloth/meta-llama-3.1-8b-bnb-4bit

This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 256

Safetensors

Model size

8.03B params

Tensor type

BF16

Inference API

Unable to determine this model's library. Check the docs .

Datasets used to train theprint/ReWiz-Llama-3.1-8B

Collection including theprint/ReWiz-Llama-3.1-8B

ReWiz

Collection

The ReWiz series is based on a subset of data from 3 different data sets, which has been used for fine tuning. • 11 items • Updated 13 days ago • 1