LLAma 3.1 8B f16 looks better then Q3_KM
#3
pinned
by
gopi87
- opened
hi guys i just tested some coding task with llama 3.1 8B looks better then some 70B quant. i will check other quant
gopi87
changed discussion title from
LLAma 3.1 8B f16 looks better the Q3_KM
to LLAma 3.1 8B f16 looks better then Q3_KM
MaziyarPanahi
pinned discussion
Hi @gopi87
Thank you for testing these quants! This is a very nice direct comparison to know when to choose 8B over 70B within the same VRAM requirements.