zgce
/

qwen2.5-14B-scipy-lora

Model card Files Files and versions Community

zgce commited on Oct 14, 2024

Commit

1117552

·

verified ·

1 Parent(s): ad54932

Update README.md

Files changed (1) hide show

README.md +9 -15

README.md CHANGED Viewed

@@ -8,20 +8,6 @@ license: mit
 This model is a fine-tuned version of [/root/LLaMA-Factory/models/Qwen2.5-14B-Instruct-GPTQ-Int8](https://huggingface.co//root/LLaMA-Factory/models/Qwen2.5-14B-Instruct-GPTQ-Int8) on the airboros-31_en and the airboros-31_zh datasets.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -37,7 +23,15 @@ The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

 This model is a fine-tuned version of [/root/LLaMA-Factory/models/Qwen2.5-14B-Instruct-GPTQ-Int8](https://huggingface.co//root/LLaMA-Factory/models/Qwen2.5-14B-Instruct-GPTQ-Int8) on the airboros-31_en and the airboros-31_zh datasets.
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
+{
+    "epoch": 0.9997864616698697,
+    "num_input_tokens_seen": 74083488,
+    "total_flos": 4.6864422704480256e+17,
+    "train_loss": 0.692321076499046,
+    "train_runtime": 65496.9949,
+    "train_samples_per_second": 1.144,
+    "train_steps_per_second": 0.036
+}
 ### Framework versions