Upload q4f16 decoder ONNX weights w/ float32 inputs_embeds
Browse files
onnx/decoder_model_merged_q4f16.onnx
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2d74ec46083829ddb18f58fcceb358d2ba58d2a1320bdab431c32e4d2896981d
|
3 |
+
size 965031477
|