32 23 75

Fanqi Wan

Wanfq

https://fanqiwan.github.io/

AI & ML interests

Large Language Models, Model Fusion, Self-Improving, Instruction-Tuning, Hallucination Mitigation, Dialogue Systems

Organizations

Wanfq's activity

New activity in FuseAI/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview 17 days ago

Model Issue

#1 opened 26 days ago by

YOYO-AI

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview 18 days ago

Temperature's effect on the performance of long chain reasoning models. Why was 0.7 used for the evals?

#6 opened 18 days ago by

j456

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview 20 days ago

DeepSeek-R1-UD-IQ1_S merge

#3 opened 20 days ago by

heroOfOrion

Tool use

#4 opened 20 days ago by

valoomba

New activity in FuseAI/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview 20 days ago

Broken template for this version?

#2 opened 21 days ago by

pipilok

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview 21 days ago

Question about replicating the merges

#2 opened 21 days ago by

xi0v

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-Flash-32B-Preview 24 days ago

Flash

#1 opened 25 days ago by

Mushoz

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview 24 days ago

License of your model

#4 opened 24 days ago by

chewkokwah

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview 25 days ago

Evaluation

#3 opened 26 days ago by

PSM24

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B 26 days ago

System Prompt

#3 opened 28 days ago by

Wanfq

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-14B 26 days ago

System Prompt

#2 opened 28 days ago by

Wanfq

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B 26 days ago

System Prompt

#2 opened 29 days ago by

Wanfq

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-7B 26 days ago

System Prompt

#2 opened 28 days ago by

Wanfq

New activity in FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview 26 days ago

Merge with 32b coder?

#2 opened 28 days ago by

RDson

New activity in FuseAI/FuseChat-Llama-3.1-8B-Instruct about 1 month ago

Adding Evaluation Results

#1 opened about 1 month ago by

T145

commented a paper 3 months ago

Weighted-Reward Preference Optimization for Implicit Model Fusion

Paper • 2412.03187 • Published Dec 4, 2024 • 12 •

commented a paper 6 months ago

FuseChat: Knowledge Fusion of Chat Models

Paper • 2408.07990 • Published Aug 15, 2024 • 11 •

New activity in abacusai/Llama-3-Smaug-8B 10 months ago

What datasets were these trained on?

#2 opened 10 months ago by

rombodawg

New activity in mistral-community/Mixtral-8x22B-v0.1 10 months ago

Benchmarks are here!

#4 opened 10 months ago by

0-hero

New activity in Wanfq/FuseLLM-7B 11 months ago

Adding `safetensors` variant of this model

#4 opened 11 months ago by

SFconvertbot