Provide learning code

by gihong99 - opened Dec 21, 2023

Dec 21, 2023

I'm a student who learns and touches the LLM model. Can I get the information on the number of data and the learning code separately?

Sao10K

Owner Dec 21, 2023

Hi, what exactly do you mean for the learning code?

If it is hyperparameters, this is it:

qLoRA finetune.
2 Epochs - Batch size of 1 on 4x 4090s.
Learning Rate - 0.00035
Scheduler - Cosine
Optimizer - adamw_bnb_8bit
LoRA rank - 16 | LoRA alpha - 32
lora_target_modules - [ - gate_proj - down_proj - up_proj - q_proj - v_proj - k_proj - o_proj]

For the dataset itself, that is private. Number of samples would be based on the model card, so 10% is 5.2K, total 52K. I listed the composition there.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment