comaniac commited on
Commit
5a3007e
1 Parent(s): c5597b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -1,7 +1,7 @@
1
- ## Llama-3-70B-Instruct-FP8-v1
2
 
3
  * Weights and activations are per-tensor quantized to float8_e4m3.
4
- * Quantization with AutoFP8.
5
  * Calibration dataset: Ultrachat (mgoin/ultrachat_2k)
6
  * Samples: 1024
7
  * Sequence length: 4096
 
1
+ ## Llama-3-70B-Instruct-FP8-v2
2
 
3
  * Weights and activations are per-tensor quantized to float8_e4m3.
4
+ * Quantization with AutoFP8 with the updated activation scaling factor names.
5
  * Calibration dataset: Ultrachat (mgoin/ultrachat_2k)
6
  * Samples: 1024
7
  * Sequence length: 4096