Qwen
/

Qwen1.5-MoE-A2.7B-Chat-GPTQ-Int4

Text Generation

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

bzheng commited on Mar 28

Commit

5132be9

•

1 Parent(s): 41f74f6

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ Qwen1.5-MoE employs Mixture of Experts (MoE) architecture, where the models are
 We pretrained the models with a large amount of data, and we post-trained the models with both supervised finetuning and direct preference optimization. However, DPO leads to improvements in human preference evaluation but degradation in benchmark evaluation. In the very near future, we will fix both problems.
 ## Requirements
-The code of Qwen1.5-MoE has been in the latest Hugging face transformers and we advise you to install `transformers>=4.39.0`, or you might encounter the following error:
 ```
 KeyError: 'qwen2_moe'.
 ```

 We pretrained the models with a large amount of data, and we post-trained the models with both supervised finetuning and direct preference optimization. However, DPO leads to improvements in human preference evaluation but degradation in benchmark evaluation. In the very near future, we will fix both problems.
 ## Requirements
+The code of Qwen1.5-MoE has been in the latest Hugging face transformers and we advise you to build from source with command `pip install git+https://github.com/huggingface/transformers`, or you might encounter the following error:
 ```
 KeyError: 'qwen2_moe'.
 ```