esfrankel17
/

llama3_8b_baseline_instructskillmix

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

esfrankel17 commited on 26 days ago

Commit

b8bb140

•

1 Parent(s): 1a6bef9

Model save

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 # llama3_8b_baseline_instructskillmix
-This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the PrincetonPLI/Instruct-SkillMix-SDD dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.7085
@@ -56,8 +56,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.5333 | 1    | 1.8346          |
-| No log        | 1.6    | 3    | 1.7085          |
 ### Framework versions

 # llama3_8b_baseline_instructskillmix
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 1.7085
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.9079        | 0.5333 | 1    | 1.8346          |
+| 1.7235        | 1.6    | 3    | 1.7085          |
 ### Framework versions