khalidalt commited on
Commit
69b42b6
1 Parent(s): 218763d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +68 -0
README.md CHANGED
@@ -1,3 +1,71 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - en
4
+ - zh
5
+ - de
6
+ - fr
7
+ - es
8
+ - pt
9
+ - ru
10
+ - it
11
+ - ja
12
+ - ko
13
+ - vi
14
+ - ar
15
+ tags:
16
+ - pytorch
17
+ - text-generation
18
+ - causal-lm
19
+ - rwkv
20
  license: apache-2.0
21
+ datasets:
22
+ - khalidalt/Joud
23
  ---
24
+
25
+ # RWKV-4-World-7b-Arabic
26
+
27
+ ## Model Description
28
+
29
+
30
+ RWKV-4-World-7b-Arabic is a pretrinaed version of RWKV-4-world that finetuned on Arabic datasets.
31
+
32
+ RWKV-4
33
+
34
+ How to use:
35
+ * use https://github.com/josStorer/RWKV-Runner for GUI
36
+ * use latest rwkv pip package (0.8.0+)
37
+ * use https://github.com/BlinkDL/ChatRWKV/blob/main/v2/benchmark_world.py and https://github.com/BlinkDL/ChatRWKV/blob/main/API_DEMO_WORLD.py to test it
38
+
39
+ The differences between World & Raven:
40
+ * set pipeline = PIPELINE(model, "rwkv_vocab_v20230424") instead of 20B_tokenizer.json (EXACTLY AS WRITTEN HERE. "rwkv_vocab_v20230424" is included in rwkv 0.7.4+)
41
+ * use Question/Answer or User/AI or Human/Bot for chat. **DO NOT USE Bob/Alice or Q/A**
42
+
43
+ For 0.1/0.4/1.5B models, use **fp32** for first layer (will overflow in fp16 at this moment - fixable in future), or bf16 if you have 30xx/40xx GPUs. Example strategy: cuda fp32 *1 -> cuda fp16
44
+
45
+ NOTE: the new greedy tokenizer (https://github.com/BlinkDL/ChatRWKV/blob/main/tokenizer/rwkv_tokenizer.py) will tokenize '\n\n' as one single token instead of ['\n','\n']
46
+
47
+ QA prompt (replace \n\n in xxx to \n):
48
+ ```
49
+ Question: xxx
50
+
51
+ Answer:
52
+ ```
53
+ and
54
+ ```
55
+ Instruction: xxx
56
+
57
+ Input: xxx
58
+
59
+ Response:
60
+ ```
61
+
62
+ A good chat prompt (replace \n\n in xxx to \n):
63
+ ```
64
+ User: hi
65
+
66
+ Assistant: Hi. I am your assistant and I will provide expert full response in full details. Please feel free to ask any question and I will always answer it.
67
+
68
+ User: xxx
69
+
70
+ Assistant:
71
+ ```