aashish1904 commited on
Commit
06b97cc
1 Parent(s): c8de753

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +70 -0
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ license: apache-2.0
5
+ base_model: distilgpt2
6
+ tags:
7
+ - generated_from_trainer
8
+ model-index:
9
+ - name: distilgpt2-finetuned-databricks
10
+ results: []
11
+
12
+ ---
13
+
14
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
15
+
16
+
17
+ # QuantFactory/distilgpt2-finetuned-databricks-GGUF
18
+ This is quantized version of [Vishaltiwari2019/distilgpt2-finetuned-databricks](https://huggingface.co/Vishaltiwari2019/distilgpt2-finetuned-databricks) created using llama.cpp
19
+
20
+ # Original Model Card
21
+
22
+
23
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
24
+ should probably proofread and complete it, then remove this comment. -->
25
+
26
+ # distilgpt2-finetuned-databricks
27
+
28
+ This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
29
+ It achieves the following results on the evaluation set:
30
+ - Loss: 3.2376
31
+
32
+ ## Model description
33
+
34
+ More information needed
35
+
36
+ ## Intended uses & limitations
37
+
38
+ More information needed
39
+
40
+ ## Training and evaluation data
41
+
42
+ More information needed
43
+
44
+ ## Training procedure
45
+
46
+ ### Training hyperparameters
47
+
48
+ The following hyperparameters were used during training:
49
+ - learning_rate: 2e-05
50
+ - train_batch_size: 8
51
+ - eval_batch_size: 8
52
+ - seed: 42
53
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
+ - lr_scheduler_type: linear
55
+ - num_epochs: 0.6
56
+
57
+ ### Training results
58
+
59
+ | Training Loss | Epoch | Step | Validation Loss |
60
+ |:-------------:|:-----:|:----:|:---------------:|
61
+ | 3.4404 | 0.6 | 543 | 3.2376 |
62
+
63
+
64
+ ### Framework versions
65
+
66
+ - Transformers 4.39.3
67
+ - Pytorch 2.2.1+cu121
68
+ - Datasets 2.18.0
69
+ - Tokenizers 0.15.2
70
+