wenhua cheng's picture

9 5 14

wenhua cheng

wenhuach

·

wenhuach21

AI & ML interests

Model Compression, CV

Recent Activity

replied to their post about 1 month ago

Are we the only providers of INT4 quantized models for Llama 3.2 VL? https://huggingface.co/OPEA/Llama-3.2-90B-Vision-Instruct-int4-sym-inc https://huggingface.co/OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc

reacted to their post with 🚀 about 2 months ago

Are we the only providers of INT4 quantized models for Llama 3.2 VL? https://huggingface.co/OPEA/Llama-3.2-90B-Vision-Instruct-int4-sym-inc https://huggingface.co/OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc

posted an update about 2 months ago

Are we the only providers of INT4 quantized models for Llama 3.2 VL? https://huggingface.co/OPEA/Llama-3.2-90B-Vision-Instruct-int4-sym-inc https://huggingface.co/OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc

View all activity

Organizations

wenhuach's activity

New activity in OPEA/glm-4-9b-chat-int4-sym-inc 2 months ago

Update README.md

#1 opened 2 months ago by

New activity in kaitchup/Qwen2.5-1.5B-Instruct-AutoRound-GPTQ-asym-4bit 3 months ago

AutoRound now supports full range sym

#1 opened 3 months ago by

New activity in Intel/Meta-Llama-3.1-8B-Instruct-int4-inc 6 months ago

Model files missing?

#1 opened 6 months ago by

New activity in Intel/low_bit_open_llm_leaderboard 9 months ago

Add support for AQLM

#1 opened 9 months ago by

New activity in 01-ai/Yi-6B-Chat 9 months ago

Introducing AutoRound INT4 algorithm

#4 opened 10 months ago by

commented a paper 10 months ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45 •

New activity in baichuan-inc/Baichuan2-7B-Chat 10 months ago

Introducing AutoRound int4 algoirhtm

#14 opened 10 months ago by

New activity in Qwen/Qwen1.5-7B-Chat 11 months ago

Introducing AutoRound INT4 Algorithm

#12 opened 11 months ago by

commented a paper 11 months ago

Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

Paper • 2309.05516 • Published Sep 11, 2023 • 9 •