12 10 18

Wei Xiong

weqweasdas

https://weixiongust.github.io/WeiXiongUST/index.html

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset about 16 hours ago

selfcorrexp2/llama3_sft_lesscorr_norr

updated a dataset about 16 hours ago

selfcorrexp2/llama3_sft_balanced_norr

updated a dataset 1 day ago

mytestdpo/llama3_sft_gsm8k_first_corr_first_corr_prompt

View all activity

Organizations

weqweasdas's activity

liked a dataset about 2 months ago

RLHFlow/RLHFlow-SFT-Dataset-ver2

Viewer • Updated Nov 2, 2024 • 2.32M • 69 • 4

liked a model about 2 months ago

RLHFlow/Llama3.1-8B-PRM-Mistral-Data

Text Generation • Updated Nov 9, 2024 • 2.26k • 7

liked 2 models 5 months ago

NCSOFT/Llama-3-OffsetBias-RM-8B

Text Classification • Updated Sep 6, 2024 • 849 • 22

RLHFlow/LLaMA3-SFT

Text Generation • Updated Nov 3, 2024 • 4.73k • 8

liked a model 7 months ago

RLHFlow/LLaMA3-iterative-DPO-final

Text Generation • Updated Oct 14, 2024 • 7.06k • 40

liked 5 models 8 months ago

liked 2 models 9 months ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • Updated Oct 14, 2024 • 6.16k • 52

sfairXC/FsfairX-Zephyr-Chat-v0.1

Text Generation • Updated Apr 24, 2024 • 33 • 8

liked a model 10 months ago

weqweasdas/RM-Mistral-7B

Text Classification • Updated Mar 31, 2024 • 1.81k • 22

liked a Space 10 months ago

Running

309

📐

Reward Bench Leaderboard

liked 2 models 11 months ago

weqweasdas/RM-Gemma-7B

Text Classification • Updated Mar 22, 2024 • 178 • 8

weqweasdas/RM-Gemma-2B

Text Classification • Updated Mar 22, 2024 • 1.33k • 18

liked a model over 1 year ago

weqweasdas/hh_rlhf_rm_open_llama_3b

Text Classification • Updated Feb 25, 2024 • 203 • 17

liked a Space over 1 year ago

Runtime error

🔥