michaelfeil
/

ct2fast-codet5p-770m

Inference Endpoints

Model card Files Files and versions Community

michaelfeil commited on May 20, 2023

Commit

33f9c37

·

1 Parent(s): cea4c12

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,14 +11,14 @@ Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on
 quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
 ```bash
-pip install hf-hub-ctranslate2>=2.0.6
 ```
 Converted on 2023-05-20 using
 ```
 ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
 ```
-Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.6](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`

 quantized version of [Salesforce/codet5p-770m](https://huggingface.co/Salesforce/codet5p-770m)
 ```bash
+pip install hf-hub-ctranslate2>=2.0.8
 ```
 Converted on 2023-05-20 using
 ```
 ct2-transformers-converter --model Salesforce/codet5p-770m --output_dir /home/michael/tmp-ct2fast-codet5p-770m --force --copy_files merges.txt README.md tokenizer_config.json vocab.json special_tokens_map.json added_tokens.json .gitattributes --quantization float16
 ```
+Checkpoint compatible to [ctranslate2>=3.13.0](https://github.com/OpenNMT/CTranslate2) and [hf-hub-ctranslate2>=2.0.8](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`