llm AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 8 shenzhi-wang/Llama3-8B-Chinese-Chat Text Generation • Updated Jul 4 • 5.92k • 651
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 8