jtatman
/

llama-3.2-1b-trismegistus

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jtatman commited on 6 days ago

Commit

9cfdabd

•

1 Parent(s): 79afa21

Update README.md

Files changed (1) hide show

README.md +10 -18

README.md CHANGED Viewed

@@ -92,31 +92,23 @@ Use the code below to get started with the model.
 #### Training Hyperparameters
-- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 #### Speeds, Sizes, Times [optional]
-<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
-[More Information Needed]
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
 #### Metrics

 #### Training Hyperparameters
+- lora 4bit peft
 #### Speeds, Sizes, Times [optional]
+- global_step=16905
+- training_loss=1.169401215731269
+- train_runtime: 21882.4747
+- train_samples_per_second: 3.09
+- train_steps_per_second: 0.773
+- total_flos: 4.437195883099177e+17
+- train_loss': 1.169401215731269
+- epoch: 5.0
+## Evaluation and Metrics
 #### Metrics