avrecum
/

mistral7b-v0.3-alpaca-cleaned

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

avrecum commited on May 31, 2024

Commit

d647b79

·

verified ·

1 Parent(s): 237c45c

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -7,3 +7,14 @@ pipeline_tag: text-generation
 <!-- Provide a quick summary of what the model is/does. -->
 Mistral 7B v0.3 finetuned on cleaned Stanford Alpaca dataset using LoRA

 <!-- Provide a quick summary of what the model is/does. -->
 Mistral 7B v0.3 finetuned on cleaned Stanford Alpaca dataset using LoRA
+Model was finetuned on for 1 epoch using paged_adamw_8bit optimizer with these params:
+per_device_train_batch_size = 10,
+gradient_accumulation_steps = 4,
+warmup_steps = 5,
+num_train_epochs=1,
+learning_rate = 2e-4,
+optim = "paged_adamw_8bit",
+weight_decay = 0.01,
+lr_scheduler_type = "linear",
+seed = 3407