9 5 14

wenhua cheng

wenhuach

wenhuach21

AI & ML interests

Model Compression, CV

Recent Activity

reacted to their post with 🚀 6 days ago

Check out [DeepSeek-R1 INT2 model(https://huggingface.co/OPEA/DeepSeek-R1-int2-mixed-sym-inc). This 200GB DeepSeek-R1 model shows only about a 2% drop in MMLU, though it's quite slow due to kernel issue. | | BF16 | INT2-mixed | | ------------- | ------ | ---------- | | mmlu | 0.8514 | 0.8302 | | hellaswag | 0.6935 | 0.6657 | | winogrande | 0.7932 | 0.7940 | | arc_challenge | 0.6212 | 0.6084 |

posted an update 7 days ago

posted an update 17 days ago

OPEA Space has released several quantized DeepSeek models, including INT2. Explore them here https://huggingface.co/collections/OPEA/deepseek-6784a012d91191015587584a

View all activity

Organizations

Posts 8

Post

2443

Check out [DeepSeek-R1 INT2 model( OPEA/DeepSeek-R1-int2-mixed-sym-inc). This 200GB DeepSeek-R1 model shows only about a 2% drop in MMLU, though it's quite slow due to kernel issue.

| | BF16 | INT2-mixed |
| ------------- | ------ | ---------- |
| mmlu | 0.8514 | 0.8302 |
| hellaswag | 0.6935 | 0.6657 |
| winogrande | 0.7932 | 0.7940 |
| arc_challenge | 0.6212 | 0.6084 |

Post

706

OPEA Space has released several quantized DeepSeek models, including INT2. Explore them here
OPEA/deepseek-6784a012d91191015587584a

View all Posts

Papers 2

arxiv:2310.10944

arxiv:2309.05516

models

None public yet

datasets

None public yet