metadata

license: apache-2.0
base_model: distilbert/distilgpt2
tags:
  - generated_from_trainer
model-index:
  - name: GSM8K_Main_DistilGPT2
    results: []

GSM8K_Main_DistilGPT2

This model is a fine-tuned version of distilbert/distilgpt2 on an unknown dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
No log	1.0	1	3.2749
No log	2.0	2	3.2384
No log	3.0	3	3.2112
No log	4.0	4	3.1927
No log	5.0	5	3.1837