michaelfeil commited on
Commit
33f9c37
·
1 Parent(s): cea4c12

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -11,14 +11,14 @@ Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on
11
 
12
  quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
13
  ```bash
14
- pip install hf-hub-ctranslate2>=2.0.6
15
  ```
16
  Converted on 2023-05-20 using
17
  ```
18
  ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
19
  ```
20
 
21
- Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.6](https://github.com/michaelfeil/hf-hub-ctranslate2)
22
  - `compute_type=int8_float16` for `device="cuda"`
23
  - `compute_type=int8` for `device="cpu"`
24
 
 
11
 
12
  quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
13
  ```bash
14
+ pip install hf-hub-ctranslate2>=2.0.8
15
  ```
16
  Converted on 2023-05-20 using
17
  ```
18
  ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
19
  ```
20
 
21
+ Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.8](https://github.com/michaelfeil/hf-hub-ctranslate2)
22
  - `compute_type=int8_float16` for `device="cuda"`
23
  - `compute_type=int8` for `device="cpu"`
24