File size: 182 Bytes
0ea83af |
1 2 3 4 |
Model quantized using a modified [EETQ](https://github.com/NetEase-FuXi/EETQ) repo. Currently working on
decoupling its kernels from CUTLASS to make this a bit easier to use.
8bits. |
0ea83af |
1 2 3 4 |
Model quantized using a modified [EETQ](https://github.com/NetEase-FuXi/EETQ) repo. Currently working on
decoupling its kernels from CUTLASS to make this a bit easier to use.
8bits. |