MLX Format and Quantizations for Unslop Nemo 12b v4
Quantized to 8-bit precision and tested using the mlx_lm utility on a 64GiB URAM M1 Max.
See original model for further details.
Quantized to 8-bit precision and tested using the mlx_lm utility on a 64GiB URAM M1 Max.
See original model for further details.