.gitattributes CHANGED
@@ -33,4 +33,3 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
- tokenizer.json filter=lfs diff=lfs merge=lfs -text
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
README.md CHANGED
@@ -1,78 +1,3 @@
1
  ---
2
- language:
3
- - zh
4
- - en
5
- tags:
6
- - translation
7
- - gpt-style
8
- - chinese
9
- - english
10
- license: "bigscience-bloom-rail-1.0"
11
  ---
12
-
13
-
14
-
15
- ## English:
16
-
17
- ### ImmersiveL Model on Hugging Face
18
-
19
- This model, available on Hugging Face under `funstoryai/immersiveL-exp`, is a GPT-like model designed specifically for English-Chinese and Chinese-English translations.
20
-
21
- **Recommended Prompts:**
22
-
23
- For English to Chinese:
24
- ```
25
- 下面是一段英文文本,请将它翻译成中文。
26
- {terms}
27
- #英文文本:
28
- {input}
29
-
30
- #中文翻译:
31
- ```
32
-
33
- For Chinese to English:
34
- ```
35
- 下面是一段中文文本,请将它翻译成英文。
36
- {terms}
37
- #中文文本:
38
- {input}
39
-
40
- #英文翻译:
41
- ```
42
-
43
- For the corresponding GitHub project, please visit: [ImmersiveL on GitHub](https://github.com/immersive-translate/ImmersiveL).
44
- <https://github.com/immersive-translate/ImmersiveL>
45
- ---
46
-
47
- ## 中文:
48
-
49
- ### Hugging Face 上的 ImmersiveL 模型
50
-
51
- 此模型在 Hugging Face 的 `funstoryai/immersiveL-exp` 下可用,是专为英汉和汉英翻译设计的类GPT模型。
52
-
53
- **推荐提示词:**
54
-
55
- 英译中:
56
- ```
57
- 下面是一段英文文本,请将它翻译成中文。
58
- {terms}
59
- #英文文本:
60
- {input}
61
-
62
- #中文翻译:
63
- ```
64
-
65
- 中译英:
66
- ```
67
- 下面是一段中文文本,请将它翻译成英文。
68
- {terms}
69
- #中文文本:
70
- {input}
71
-
72
- #英文翻译:
73
- ```
74
-
75
- 对应的 GitHub 项目地址为: [ImmersiveL on GitHub](https://github.com/immersive-translate/ImmersiveL).
76
- <https://github.com/immersive-translate/ImmersiveL>
77
-
78
-
 
1
  ---
2
+ license: bigscience-bloom-rail-1.0
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
config.json DELETED
@@ -1,33 +0,0 @@
1
- {
2
- "_name_or_path": "bigscience/bloomz-1b1",
3
- "apply_residual_connection_post_layernorm": false,
4
- "architectures": [
5
- "BloomForCausalLM"
6
- ],
7
- "attention_dropout": 0.0,
8
- "attention_softmax_in_fp32": true,
9
- "bias_dropout_fusion": true,
10
- "bos_token_id": 1,
11
- "eos_token_id": 2,
12
- "hidden_dropout": 0.0,
13
- "hidden_size": 1536,
14
- "initializer_range": 0.02,
15
- "layer_norm_epsilon": 1e-05,
16
- "masked_softmax_fusion": true,
17
- "model_type": "bloom",
18
- "n_head": 16,
19
- "n_inner": null,
20
- "n_layer": 24,
21
- "offset_alibi": 100,
22
- "pad_token_id": 3,
23
- "pretraining_tp": 1,
24
- "seq_length": 2048,
25
- "skip_bias_add": true,
26
- "skip_bias_add_qkv": false,
27
- "slow_but_exact": false,
28
- "torch_dtype": "float32",
29
- "transformers_version": "4.29.0",
30
- "unk_token_id": 0,
31
- "use_cache": true,
32
- "vocab_size": 250880
33
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
generation_config.json DELETED
@@ -1,7 +0,0 @@
1
- {
2
- "_from_model_config": true,
3
- "bos_token_id": 1,
4
- "eos_token_id": 2,
5
- "pad_token_id": 3,
6
- "transformers_version": "4.29.0"
7
- }
 
 
 
 
 
 
 
 
model.safetensors DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:d5fa281125ecb310989edf7edc31b438e85da24b597f78bdb9cbcf5f341a3702
3
- size 5802698512
 
 
 
 
pytorch_model.bin DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:c6f8a8e8476d2dab3966aac90d62157d9c53cbdd44070a7e03639629a54506ca
3
- size 5802770517
 
 
 
 
special_tokens_map.json DELETED
@@ -1,6 +0,0 @@
1
- {
2
- "bos_token": "<s>",
3
- "eos_token": "</s>",
4
- "pad_token": "<pad>",
5
- "unk_token": "<unk>"
6
- }
 
 
 
 
 
 
 
tokenizer.json DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:17a208233d2ee8d8c83b23bc214df737c44806a1919f444e89b31e586cd956ba
3
- size 14500471
 
 
 
 
tokenizer_config.json DELETED
@@ -1,11 +0,0 @@
1
- {
2
- "add_prefix_space": false,
3
- "bos_token": "<s>",
4
- "clean_up_tokenization_spaces": false,
5
- "eos_token": "</s>",
6
- "model_max_length": 800,
7
- "pad_token": "<pad>",
8
- "padding_side": "left",
9
- "tokenizer_class": "BloomTokenizer",
10
- "unk_token": "<unk>"
11
- }