stvlynn commited on
Commit
31bad26
1 Parent(s): d9d20f3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -1
README.md CHANGED
@@ -9,4 +9,40 @@ tags:
9
  - Cantonese
10
  - 廣東話
11
  - 粤语
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  - Cantonese
10
  - 廣東話
11
  - 粤语
12
+ ---
13
+
14
+ # Qwen-7B-Chat-Cantonese
15
+ ## Intro
16
+ Qwen-7B-Chat-Cantonese is a fine-tuned version based on Qwen-7B-Chat, trained on a substantial amount of Cantonese language data.
17
+ Qwen-7B-Chat-Cantonese係基於Qwen-7B-Chat嘅微調版本,基於大量粵語數據進行訓練。
18
+
19
+ ## Training Parameters
20
+ | Parameter | Description | Value |
21
+ |-----------------|----------------------------------------|--------|
22
+ | Learning Rate | AdamW optimizer learning rate | 7e-5 |
23
+ | Weight Decay | Regularization strength | 0.8 |
24
+ | Gamma | Learning rate decay factor | 1.0 |
25
+ | Batch Size | Number of samples per batch | 1000 |
26
+ | Precision | Floating point precision | fp16 |
27
+ | Learning Policy | Learning rate adjustment policy | cosine |
28
+ | Warmup Steps | Initial steps without learning rate adjustment | 0 |
29
+ | Total Steps | Total training steps | 1024 |
30
+ | GPUs | Number of GPUs used | 2 |
31
+ | Gradient Accumulation Steps | Number of steps to accumulate gradients before updating | 8 |
32
+
33
+ ![loss](https://cdn.statically.io/gh/stvlynn/cloudimg@master/blog/2310/image.q9v1ak08ljk.webp)
34
+
35
+ ## Demo
36
+ ![深水埗有哪些美食](https://cdn.statically.io/gh/stvlynn/cloudimg@master/blog/2310/截屏2024-05-04-11.59.27.2bea6k113e68.webp)
37
+
38
+ ![鲁迅为什么打周树人](https://cdn.statically.io/gh/stvlynn/cloudimg@master/blog/2310/截屏2024-05-04-11.56.46.72tt5czl2gw0.webp)
39
+
40
+ ![树上几只鸟](https://cdn.statically.io/gh/stvlynn/cloudimg@master/blog/2310/截屏2024-05-04-12.00.38.267hvmc3z3c0.webp)
41
+
42
+ ##Special Note
43
+
44
+ This is my first fine-tuning LLM project. Pls forgive me if there's anything wrong.
45
+ If you have any questions or suggestions, feel free to contact me.
46
+ [Twitter @stv_lynn](https://x.com/stv_lynn)
47
+ [Telegram @stvlynn](https://t.me/stvlynn)
48
+ [email i@stv.pm](mailto://i@stv.pm)