chain-texts-0.1-mixtral-8x7b

Browse files

Files changed (4) hide show

README.md +0 -24
adapter_model.safetensors +2 -2
runs/Apr25_16-29-44_5a64c3d72f47/events.out.tfevents.1714062584.5a64c3d72f47.797.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -19,8 +19,6 @@ should probably proofread and complete it, then remove this comment. -->
 # mistral_instruct_generation
 This model is a fine-tuned version of [cognitivecomputations/dolphin-2.2.1-mistral-7b](https://huggingface.co/cognitivecomputations/dolphin-2.2.1-mistral-7b) on the generator dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.5008
 ## Model description
@@ -48,28 +46,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 0.03
 - num_epochs: 3
-### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 1.9388        | 0.1869 | 20   | 1.8326          |
-| 1.7574        | 0.3738 | 40   | 1.6963          |
-| 1.6736        | 0.5607 | 60   | 1.6514          |
-| 1.6703        | 0.7477 | 80   | 1.6321          |
-| 1.6708        | 0.9346 | 100  | 1.6113          |
-| 1.6049        | 1.1215 | 120  | 1.5912          |
-| 1.6446        | 1.3084 | 140  | 1.5891          |
-| 1.5222        | 1.4953 | 160  | 1.5691          |
-| 1.6022        | 1.6822 | 180  | 1.5599          |
-| 1.4326        | 1.8692 | 200  | 1.5512          |
-| 1.4876        | 2.0561 | 220  | 1.5548          |
-| 1.4821        | 2.2430 | 240  | 1.5379          |
-| 1.3868        | 2.4299 | 260  | 1.5326          |
-| 1.5277        | 2.6168 | 280  | 1.5286          |
-| 1.5205        | 2.8037 | 300  | 1.5084          |
-| 1.4125        | 2.9907 | 320  | 1.5008          |
 ### Framework versions
 - PEFT 0.10.0

 # mistral_instruct_generation
 This model is a fine-tuned version of [cognitivecomputations/dolphin-2.2.1-mistral-7b](https://huggingface.co/cognitivecomputations/dolphin-2.2.1-mistral-7b) on the generator dataset.
 ## Model description
 - lr_scheduler_warmup_steps: 0.03
 - num_epochs: 3
 ### Framework versions
 - PEFT 0.10.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e44ce263e6fd885f50d82ca515b9325375b43ee36ededb75acf161ce88bc2e41
-size 48

 version https://git-lfs.github.com/spec/v1
+oid sha256:3e93218612a35c41f3f154c7607a8f27c50bfcf507bc1c115991ba8d602604af
+size 109069176

runs/Apr25_16-29-44_5a64c3d72f47/events.out.tfevents.1714062584.5a64c3d72f47.797.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:20573f80d39a4591ab730ecf7bbe35b55f08f4c34be6ff4db51c6484327c2f78
+size 4184

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c6bb4d527fa0e60d02ccf1135f2075a099775812fcf7c96cb53a9792bf417e0
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:b4de7e0bc9eb6478f2441b556f02b1149fd84c0ae240ec8ac1e66d6966fdb56f
 size 4984