saminyeasar
/

sft-pythia-1b-deduped-tldr-preference-sft-trl

+---
+library_name: transformers
+license: apache-2.0
+base_model: EleutherAI/pythia-1b-deduped
+tags:
+- trl
+- sft
+- generated_from_trainer
+model-index:
+- name: sft-pythia-1b-deduped-tldr-preference-sft-trl-style-20241031-202104
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# sft-pythia-1b-deduped-tldr-preference-sft-trl-style-20241031-202104
+This model is a fine-tuned version of [EleutherAI/pythia-1b-deduped](https://huggingface.co/EleutherAI/pythia-1b-deduped) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.4606
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1.41e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 16
+- total_train_batch_size: 128
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 1.0
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss |
+|:-------------:|:------:|:----:|:---------------:|
+| 2.5542        | 0.0221 | 20   | 2.5166          |
+| 2.5232        | 0.0441 | 40   | 2.4990          |
+| 2.5299        | 0.0662 | 60   | 2.4913          |
+| 2.5028        | 0.0883 | 80   | 2.4865          |
+| 2.4449        | 0.1104 | 100  | 2.4829          |
+| 2.4787        | 0.1324 | 120  | 2.4799          |
+| 2.4927        | 0.1545 | 140  | 2.4778          |
+| 2.4421        | 0.1766 | 160  | 2.4759          |
+| 2.4821        | 0.1986 | 180  | 2.4743          |
+| 2.4504        | 0.2207 | 200  | 2.4728          |
+| 2.4284        | 0.2428 | 220  | 2.4716          |
+| 2.4677        | 0.2649 | 240  | 2.4704          |
+| 2.4817        | 0.2869 | 260  | 2.4692          |
+| 2.4607        | 0.3090 | 280  | 2.4683          |
+| 2.4314        | 0.3311 | 300  | 2.4675          |
+| 2.4429        | 0.3532 | 320  | 2.4668          |
+| 2.4618        | 0.3752 | 340  | 2.4661          |
+| 2.4372        | 0.3973 | 360  | 2.4655          |
+| 2.444         | 0.4194 | 380  | 2.4649          |
+| 2.467         | 0.4414 | 400  | 2.4644          |
+| 2.4718        | 0.4635 | 420  | 2.4640          |
+| 2.452         | 0.4856 | 440  | 2.4635          |
+| 2.4283        | 0.5077 | 460  | 2.4632          |
+| 2.4349        | 0.5297 | 480  | 2.4628          |
+| 2.42          | 0.5518 | 500  | 2.4626          |
+| 2.4291        | 0.5739 | 520  | 2.4623          |
+| 2.4285        | 0.5959 | 540  | 2.4620          |
+| 2.4648        | 0.6180 | 560  | 2.4618          |
+| 2.4195        | 0.6401 | 580  | 2.4616          |
+| 2.4958        | 0.6622 | 600  | 2.4614          |
+| 2.4378        | 0.6842 | 620  | 2.4613          |
+| 2.4647        | 0.7063 | 640  | 2.4611          |
+| 2.4334        | 0.7284 | 660  | 2.4611          |
+| 2.4399        | 0.7504 | 680  | 2.4609          |
+| 2.4598        | 0.7725 | 700  | 2.4608          |
+| 2.4149        | 0.7946 | 720  | 2.4607          |
+| 2.4319        | 0.8167 | 740  | 2.4607          |
+| 2.4581        | 0.8387 | 760  | 2.4606          |
+| 2.4439        | 0.8608 | 780  | 2.4606          |
+| 2.4645        | 0.8829 | 800  | 2.4606          |
+| 2.5149        | 0.9050 | 820  | 2.4606          |
+| 2.482         | 0.9270 | 840  | 2.4606          |
+| 2.473         | 0.9491 | 860  | 2.4605          |
+| 2.4648        | 0.9712 | 880  | 2.4605          |
+| 2.4516        | 0.9932 | 900  | 2.4606          |
+### Framework versions
+- Transformers 4.44.2
+- Pytorch 2.4.0+cu121
+- Datasets 2.14.6
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 0,
+  "eos_token_id": 0,
+  "transformers_version": "4.44.2",
+  "use_cache": false
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8e4f5bee8fbdcf0ade00a76731350bb4f0d8e10a9e6deddfa2e9fa7c7ba76936
 size 2023586384

 version https://git-lfs.github.com/spec/v1
+oid sha256:08ef93deda297b8777c00b9abc6a1106420cea6cd9ab21029beee21e0275841d
 size 2023586384