Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset about 1 hour ago
selfcorrexp2/llama31_sft_non_delete_300k
updated a dataset about 1 hour ago
selfcorrexp2/llama31_sft_non_delete_full
updated a dataset about 1 hour ago
selfcorrexp2/llama31_rr_4_star_selfreward_format
View all activity

Organizations

reward modeling's profile picture raft_study's profile picture Directional Preference Alignment's profile picture RLHFlow's profile picture RRLHF's profile picture TIRData's profile picture feedbackagent's profile picture selfrew's profile picture myselfrew's profile picture selfcorrexp's profile picture selfcorrexp2's profile picture