sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • Updated 24 days ago • 95.7M • • 2.57k
bartowski/Ministral-8B-Instruct-2410-HF-GGUF-TEST Text Generation • Updated Oct 16 • 9.73k • 16
view post Post 1788 Reply Excited to announce the release of our high-quality Llama-3.1 8B 4-bit HQQ/calibrated quantized model! Achieving an impressive 99.3% relative performance to FP16, it also delivers the fastest inference speed for transformers. mobiuslabsgmbh/Llama-3.1-8b-instruct_4bitgs64_hqq_calib 1 reply · 🔥 9 9 +