ggml files of bge-large-en

You can use this ggml for https://github.com/skeskinen/bert.cpp

bge-large-en

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8807 129.10 0.5715 202.67
f16 0.8807 107.80 0.5712 177.37
q4_0 0.8798 81.91 0.5689 159.30
q4_1 0.8792 91.66 0.5709 164.45

all-MiniLM-L12-v2

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8306 13.36 0.4117 21.23
f16 0.8306 11.51 0.4119 20.08
q4_0 0.8310 11.27 0.4183 20.81
q4_1 0.8325 12.37 0.4093 19.38

all-MiniLM-L6-v2

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.8201 6.83 0.4082 11.34
f16 0.8201 6.17 0.4085 10.28
q4_0 0.8175 5.45 0.3911 10.63
q4_1 0.8223 6.79 0.4027 11.41

bert-base-uncased

Data Type STSBenchmark eval time EmotionClassification eval time
f32 0.4738 52.38 0.3361 88.56
f16 0.4739 33.24 0.3361 55.86
q4_0 0.4940 33.93 0.3375 57.82
q4_1 0.4612 36.86 0.3318 59.63
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.