Commit
·
33f9c37
1
Parent(s):
cea4c12
Update README.md
Browse files
README.md
CHANGED
@@ -11,14 +11,14 @@ Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on
|
|
11 |
|
12 |
quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
|
13 |
```bash
|
14 |
-
pip install hf-hub-ctranslate2>=2.0.
|
15 |
```
|
16 |
Converted on 2023-05-20 using
|
17 |
```
|
18 |
ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
|
19 |
```
|
20 |
|
21 |
-
Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.
|
22 |
- `compute_type=int8_float16` for `device="cuda"`
|
23 |
- `compute_type=int8` for `device="cpu"`
|
24 |
|
|
|
11 |
|
12 |
quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
|
13 |
```bash
|
14 |
+
pip install hf-hub-ctranslate2>=2.0.8
|
15 |
```
|
16 |
Converted on 2023-05-20 using
|
17 |
```
|
18 |
ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
|
19 |
```
|
20 |
|
21 |
+
Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.8](https://github.com/michaelfeil/hf-hub-ctranslate2)
|
22 |
- `compute_type=int8_float16` for `device="cuda"`
|
23 |
- `compute_type=int8` for `device="cpu"`
|
24 |
|