Sansa commited on
Commit
7d726bb
·
verified ·
1 Parent(s): 925f0cd

update readme

Browse files
Files changed (1) hide show
  1. README.md +7 -24
README.md CHANGED
@@ -5,10 +5,14 @@ base_model: gpt2-medium
5
  tags:
6
  - llama-factory
7
  - full
8
- - generated_from_trainer
9
  model-index:
10
  - name: diffugpt-m
11
  results: []
 
 
 
 
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -20,32 +24,11 @@ This model is a fine-tuned version of [gpt2-medium](https://huggingface.co/gpt2-
20
 
21
  ## Model description
22
 
23
- More information needed
24
-
25
- ## Intended uses & limitations
26
-
27
- More information needed
28
-
29
- ## Training and evaluation data
30
-
31
- More information needed
32
-
33
- ## Training procedure
34
-
35
- ### Training hyperparameters
36
-
37
- The following hyperparameters were used during training:
38
- - learning_rate: 5e-05
39
- - train_batch_size: 8
40
- - eval_batch_size: 8
41
- - seed: 42
42
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
- - lr_scheduler_type: linear
44
- - num_epochs: 3.0
45
 
46
  ### Framework versions
47
 
48
  - Transformers 4.44.2
49
  - Pytorch 2.1.1+cu121
50
  - Datasets 2.21.0
51
- - Tokenizers 0.19.1
 
5
  tags:
6
  - llama-factory
7
  - full
8
+ - diffusion
9
  model-index:
10
  - name: diffugpt-m
11
  results: []
12
+ datasets:
13
+ - HuggingFaceFW/fineweb
14
+ language:
15
+ - en
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
24
 
25
  ## Model description
26
 
27
+ Details and model loading can be seen [https://github.com/HKUNLP/DiffuLLaMA](https://github.com/HKUNLP/DiffuLLaMA).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
 
29
  ### Framework versions
30
 
31
  - Transformers 4.44.2
32
  - Pytorch 2.1.1+cu121
33
  - Datasets 2.21.0
34
+ - Tokenizers 0.19.1