Sansa commited on
Commit
0b399ea
1 Parent(s): 01ea216

update readme

Browse files
Files changed (1) hide show
  1. README.md +10 -25
README.md CHANGED
@@ -1,13 +1,18 @@
1
  ---
2
  library_name: transformers
3
- base_model: output/llama2-ddm
 
4
  tags:
5
  - llama-factory
6
  - full
7
- - generated_from_trainer
8
  model-index:
9
  - name: diffullama
10
  results: []
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,36 +20,16 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # diffullama
17
 
18
- This model is a fine-tuned version of [output/llama2-ddm](https://huggingface.co/output/llama2-ddm) on an unknown dataset.
19
 
20
  ## Model description
21
 
22
- More information needed
23
 
24
- ## Intended uses & limitations
25
-
26
- More information needed
27
-
28
- ## Training and evaluation data
29
-
30
- More information needed
31
-
32
- ## Training procedure
33
-
34
- ### Training hyperparameters
35
-
36
- The following hyperparameters were used during training:
37
- - learning_rate: 5e-05
38
- - train_batch_size: 8
39
- - eval_batch_size: 8
40
- - seed: 42
41
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
- - lr_scheduler_type: linear
43
- - num_epochs: 3.0
44
 
45
  ### Framework versions
46
 
47
  - Transformers 4.44.2
48
  - Pytorch 2.1.1+cu121
49
  - Datasets 2.21.0
50
- - Tokenizers 0.19.1
 
1
  ---
2
  library_name: transformers
3
+ base_model:
4
+ - meta-llama/Llama-2-7b-hf
5
  tags:
6
  - llama-factory
7
  - full
8
+ - diffusion
9
  model-index:
10
  - name: diffullama
11
  results: []
12
+ license: mit
13
+ datasets:
14
+ - bigcode/starcoderdata
15
+ - cerebras/SlimPajama-627B
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
20
 
21
  # diffullama
22
 
23
+ This model is a fine-tuned version of [llama2].
24
 
25
  ## Model description
26
 
27
+ Details and model loading can be seen [https://github.com/HKUNLP/DiffuLLaMA](https://github.com/HKUNLP/DiffuLLaMA).
28
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
 
30
  ### Framework versions
31
 
32
  - Transformers 4.44.2
33
  - Pytorch 2.1.1+cu121
34
  - Datasets 2.21.0
35
+ - Tokenizers 0.19.1