MLX quants

#19
by ehartford - opened

Thanks to @awni for converting it to MLX!

It runs over 30 TPS on M2 ultra 192gb!

https://huggingface.co/mlx-community/Hunyuan-A52B-Instruct-3bit

Sign up or log in to comment