jtatman commited on
Commit
9cfdabd
1 Parent(s): 79afa21

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -18
README.md CHANGED
@@ -92,31 +92,23 @@ Use the code below to get started with the model.
92
 
93
  #### Training Hyperparameters
94
 
95
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
 
97
  #### Speeds, Sizes, Times [optional]
98
 
99
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
 
 
 
 
 
 
 
100
 
101
- [More Information Needed]
102
-
103
- ## Evaluation
104
-
105
- <!-- This section describes the evaluation protocols and provides the results. -->
106
 
107
- ### Testing Data, Factors & Metrics
108
 
109
- #### Testing Data
110
 
111
- <!-- This should link to a Dataset Card if possible. -->
112
-
113
- [More Information Needed]
114
-
115
- #### Factors
116
-
117
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
-
119
- [More Information Needed]
120
 
121
  #### Metrics
122
 
 
92
 
93
  #### Training Hyperparameters
94
 
95
+ - lora 4bit peft
96
 
97
  #### Speeds, Sizes, Times [optional]
98
 
99
+ - global_step=16905
100
+ - training_loss=1.169401215731269
101
+ - train_runtime: 21882.4747
102
+ - train_samples_per_second: 3.09
103
+ - train_steps_per_second: 0.773
104
+ - total_flos: 4.437195883099177e+17
105
+ - train_loss': 1.169401215731269
106
+ - epoch: 5.0
107
 
108
+ ## Evaluation and Metrics
 
 
 
 
109
 
 
110
 
 
111
 
 
 
 
 
 
 
 
 
 
112
 
113
  #### Metrics
114