weishen's picture

8 7 27

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

liked a model 20 days ago

Qwen/QwQ-32B-Preview

liked a dataset 20 days ago

HPAI-BSC/Aloe-Beta-Medical-Collection

upvoted a collection 24 days ago

Medical QA Datasets

View all activity

Organizations

fakerbaby's activity

liked a model 20 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated 24 days ago • 119k • • 1.38k

liked a dataset 20 days ago

HPAI-BSC/Aloe-Beta-Medical-Collection

Viewer • Updated Nov 4 • 102k • 54 • 3

upvoted a collection 24 days ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 20 items • Updated 28 days ago • 27

liked 5 datasets about 2 months ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1 • 467k • 403 • 16

nvidia/OpenMathInstruct-2

Viewer • Updated 27 days ago • 22M • 10.2k • 135

KbsdJames/Omni-MATH

Viewer • Updated Oct 12 • 4.43k • 984 • 61

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25 • 77k • 1.04k • 24

AI-MO/aimo-validation-aime

Viewer • Updated Jul 10 • 90 • 1.5k • 16

reacted to onekq's post with 👍 3 months ago

Post

2556

Here is my latest study on OpenAI🍓o1🍓.
A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)

I wrote an easy-to-read blogpost to explain finding.
https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-models

INSTRUCTION FOLLOWING is the key.

100% instruction following + Reasoning = new SOTA

But if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models.

upvoted a collection 3 months ago

Infinity Instruct

16 items • Updated 1 day ago • 6

liked 3 datasets 3 months ago

Magpie-Align/MagpieLM-SFT-Data-v0.1

Viewer • Updated 13 days ago • 550k • 112 • 15

MARIO-Math-Reasoning/Gaokao2023-Math-En

Viewer • Updated Jun 1 • 385 • 85 • 6

hfl/stem_zh_instruction

Viewer • Updated May 13 • 256k • 263 • 22

liked 2 Spaces 3 months ago

Qwen2.5

Chat-with-OpenAI-o1

upvoted a collection 3 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 81

liked a Space 3 months ago

Big Code Models Leaderboard

liked 3 datasets 4 months ago

BAAI/TACO

Updated Jun 19 • 1.39k • 76

BAAI/Infinity-Preference

Viewer • Updated Aug 30 • 59.4k • 110 • 65

argilla/magpie-ultra-v0.1

Viewer • Updated 27 days ago • 50k • 335 • 218