End of training

Browse files

Files changed (4) hide show

README.md +74 -0
adapter_model.bin +3 -0
adapter_model.safetensors +1 -1
runs/Dec17_09-54-11_c514349ff8e8/events.out.tfevents.1702806873.c514349ff8e8.42.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,74 @@

+---
+license: mit
+library_name: peft
+tags:
+- generated_from_trainer
+base_model: HuggingFaceH4/zephyr-7b-beta
+model-index:
+- name: Zephyr-Try2-17-12
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# Zephyr-Try2-17-12
+This model is a fine-tuned version of [HuggingFaceH4/zephyr-7b-beta](https://huggingface.co/HuggingFaceH4/zephyr-7b-beta) on the None dataset.
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0003
+- train_batch_size: 16
+- eval_batch_size: 16
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 150
+- num_epochs: 2
+- mixed_precision_training: Native AMP
+### Training results
+### Framework versions
+- Transformers 4.36.1
+- Pytorch 2.0.0
+- Datasets 2.15.0
+- Tokenizers 0.15.0
+## Training procedure
+The following `bitsandbytes` quantization config was used during training:
+- quant_method: bitsandbytes
+- load_in_8bit: False
+- load_in_4bit: True
+- llm_int8_threshold: 6.0
+- llm_int8_skip_modules: None
+- llm_int8_enable_fp32_cpu_offload: False
+- llm_int8_has_fp16_weight: False
+- bnb_4bit_quant_type: nf4
+- bnb_4bit_use_double_quant: True
+- bnb_4bit_compute_dtype: bfloat16
+### Framework versions
+- PEFT 0.6.2

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:338b655f203dec30ddb81a568864f44696b5627692e82032db5793439a304856
+size 113314765

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:91b2a92ff63916792ffd96ecd66ae9427811f797c9b18c377ca6c124b599cba2
 size 113271504

 version https://git-lfs.github.com/spec/v1
+oid sha256:34219819c440bba175b392a94731b266a2c00144993a2c707e6554d1ded861df
 size 113271504

runs/Dec17_09-54-11_c514349ff8e8/events.out.tfevents.1702806873.c514349ff8e8.42.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cc9fb60d68adf88af47b9aa7b7d7cde4a9ede65f005a3ac2fc6b1afd39560d42
-size 13254

 version https://git-lfs.github.com/spec/v1
+oid sha256:6c87bae8511a927da37f195a39704961a2827d5dcae83b7669f1e4888352ce02
+size 13765