MLX quants
#19
by
ehartford
- opened
Thanks to @awni for converting it to MLX!
It runs over 30 TPS on M2 ultra 192gb!
https://huggingface.co/mlx-community/Hunyuan-A52B-Instruct-3bit
Thanks to @awni for converting it to MLX!
It runs over 30 TPS on M2 ultra 192gb!
https://huggingface.co/mlx-community/Hunyuan-A52B-Instruct-3bit