stvlynn
/

Qwen-7B-Chat-Cantonese

Text Generation

feature-extraction

Model card Files Files and versions Community

stvlynn commited on May 4

Commit

0b22261

•

1 Parent(s): 2ea90cb

Update README.md

Files changed (1) hide show

README.md +27 -0

README.md CHANGED Viewed

@@ -17,6 +17,33 @@ Qwen-7B-Chat-Cantonese is a fine-tuned version based on Qwen-7B-Chat, trained on
 Qwen-7B-Chat-Cantonese係基於Qwen-7B-Chat嘅微調版本，基於大量粵語數據進行訓練。
 ## Training Parameters
 | Parameter       | Description                            | Value  |

 Qwen-7B-Chat-Cantonese係基於Qwen-7B-Chat嘅微調版本，基於大量粵語數據進行訓練。
+## Usage
+### Requirements
+* python 3.8 and above
+* pytorch 1.12 and above, 2.0 and above are recommended
+* CUDA 11.4 and above are recommended (this is for GPU users, flash-attention users, etc.)
+### Dependency
+To run Qwen-7B-Chat-Cantonese, please make sure you meet the above requirements, and then execute the following pip commands to install the dependent libraries.
+```bash
+pip install transformers==4.32.0 accelerate tiktoken einops scipy transformers_stream_generator==0.0.4 peft deepspeed
+```
+In addition, it is recommended to install the `flash-attention` library (**we support flash attention 2 now.**) for higher efficiency and lower memory usage.
+```bash
+git clone https://github.com/Dao-AILab/flash-attention
+cd flash-attention && pip install .
+```
+### Quickstart
+Pls turn to QwenLM/Qwen - [Quickstart](https://github.com/QwenLM/Qwen?tab=readme-ov-file#quickstart)
 ## Training Parameters
 | Parameter       | Description                            | Value  |