arxiv:2407.02068
Kaixin Xu
kartmannXu
AI & ML interests
Neural Network Compression, Efficient AI
Organizations
None yet
Papers
3
models
11
kartmannXu/MiniCPM-2B-128k-q4f16_1_mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q0f16-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.3-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-ch-0.25-q4f16_2-mlc
Updated
kartmannXu/MiniCPM-2B-128k-prune-bl-0.3-q3f16_1-mlc
Updated
kartmannXu/MiniCPM-2B-128k-q4f16_2_mlc
Updated
kartmannXu/MiniCPM-2B-128k-q3f16_1_mlc_reduced
Updated
datasets
None public yet