1pikachu1111 commited on
Commit
b7c4022
1 Parent(s): 3518c77

update int8 onnx model and readme

Browse files

Signed-off-by: dujun <jun.du@intel.com>

Files changed (2) hide show
  1. README.md +3 -3
  2. model.onnx +2 -2
README.md CHANGED
@@ -63,14 +63,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
63
 
64
  The original fp32 model comes from the fine-tuned model [electra-small-discriminator-mrpc](https://huggingface.co/Intel/electra-small-discriminator-mrpc).
65
 
66
- The calibration dataloader is the eval dataloader. The default calibration sampling size 100 isn't divisible exactly by batch size 8. So the real sampling size is 104.
67
 
68
  #### Test result
69
 
70
  | |INT8|FP32|
71
  |---|:---:|:---:|
72
- | **Accuracy (eval-f1)** |0.8993|0.8983|
73
- | **Model size (MB)** |32|52|
74
 
75
 
76
  #### Load ONNX model:
 
63
 
64
  The original fp32 model comes from the fine-tuned model [electra-small-discriminator-mrpc](https://huggingface.co/Intel/electra-small-discriminator-mrpc).
65
 
66
+ The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
67
 
68
  #### Test result
69
 
70
  | |INT8|FP32|
71
  |---|:---:|:---:|
72
+ | **Accuracy (eval-f1)** |0.8957|0.8983|
73
+ | **Model size (MB)** |16|52|
74
 
75
 
76
  #### Load ONNX model:
model.onnx CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9aaab0ea74e1aba289dae90f053c4d7dbdb9ebc577100b77cdd8736cee3f8683
3
- size 32868991
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:11550dc03b7a071d26cd3b5191c9df3a83113227bb82a6e78f28e6c2ca59e6b3
3
+ size 16314577