Intel
/

bart-large-mrpc-int8-dynamic-inc

Text Classification

text-classfication

Intel® Neural Compressor

PostTrainingDynamic

Inference Endpoints

Model card Files Files and versions Community

violetch24 commited on Nov 3, 2022

Commit

e064e32

•

1 Parent(s): cc68488

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ datasets:
 metrics:
 - f1
 model-index:
-- name: bart-large-mrpc-int8-static
   results:
   - task:
       name: Text Classification
@@ -30,7 +30,7 @@ model-index:
 ###  Post-training dynamic quantization
-This is an INT8  PyTorch model quantized with [Intel® Neural Compressor](https://github.com/intel/neural-compressor).
 The original fp32 model comes from the fine-tuned model [bart-large-mrpc](https://huggingface.co/Intel/bart-large-mrpc).
@@ -41,11 +41,11 @@ The original fp32 model comes from the fine-tuned model [bart-large-mrpc](https:
 | **Accuracy (eval-f1)** |0.9051|0.9120|
 | **Model size (MB)**  |547|1556.48|
-### Load with Intel® Neural Compressor:
 ```python
-from neural_compressor.utils.load_huggingface import OptimizedModel
-int8_model = OptimizedModel.from_pretrained(
     'Intel/bart-large-mrpc-int8-dynamic',
 )
 ```

 metrics:
 - f1
 model-index:
+- name: bart-large-mrpc-int8-dynamic
   results:
   - task:
       name: Text Classification
 ###  Post-training dynamic quantization
+This is an INT8  PyTorch model quantized with [huggingface/optimum-intel](https://github.com/huggingface/optimum-intel) through the usage of [Intel® Neural Compressor](https://github.com/intel/neural-compressor).
 The original fp32 model comes from the fine-tuned model [bart-large-mrpc](https://huggingface.co/Intel/bart-large-mrpc).
 | **Accuracy (eval-f1)** |0.9051|0.9120|
 | **Model size (MB)**  |547|1556.48|
+### Load with optimum:
 ```python
+from optimum.intel.neural_compressor.quantization import IncQuantizedModelForSequenceClassification
+int8_model = IncQuantizedModelForSequenceClassification.from_pretrained(
     'Intel/bart-large-mrpc-int8-dynamic',
 )
 ```