satyathakur
/

Mistral

Generated from Trainer

Model card Files Files and versions Community

satyathakur commited on Oct 21, 2023

Commit

7264a6b

•

1 Parent(s): a9c93bf

End of training

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
-license: llama2
-base_model: TheBloke/Xwin-LM-7B-V0.1-GPTQ
 tags:
 - generated_from_trainer
 model-index:
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Mistral
-This model is a fine-tuned version of [TheBloke/Xwin-LM-7B-V0.1-GPTQ](https://huggingface.co/TheBloke/Xwin-LM-7B-V0.1-GPTQ) on the None dataset.
 ## Model description
@@ -39,6 +39,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - training_steps: 250
 ### Training results
@@ -46,7 +47,7 @@ The following hyperparameters were used during training:
 ### Framework versions
-- Transformers 4.34.1
-- Pytorch 2.0.1+cu118
 - Datasets 2.14.5
 - Tokenizers 0.14.1

 ---
+license: apache-2.0
+base_model: TheBloke/Mistral-7B-Instruct-v0.1-GPTQ
 tags:
 - generated_from_trainer
 model-index:
 # Mistral
+This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.1-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GPTQ) on the None dataset.
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - training_steps: 250
+- mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions
+- Transformers 4.35.0.dev0
+- Pytorch 2.1.0+cu118
 - Datasets 2.14.5
 - Tokenizers 0.14.1