weishen's picture

8 7 27

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

liked a model 20 days ago

Qwen/QwQ-32B-Preview

liked a dataset 20 days ago

HPAI-BSC/Aloe-Beta-Medical-Collection

upvoted a collection 24 days ago

Medical QA Datasets

View all activity

Organizations

fakerbaby's activity

upvoted a collection 24 days ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 20 items • Updated 28 days ago • 27

upvoted 2 collections 3 months ago

Infinity Instruct

16 items • Updated 1 day ago • 6

DeepSeekCoder-V2

6 items • Updated Sep 5 • 81

upvoted a paper 6 months ago

Secrets of RLHF in Large Language Models Part I: PPO

Paper • 2307.04964 • Published Jul 11, 2023 • 28

upvoted 2 collections 7 months ago

MoEs papers reading list

60 items • Updated Nov 4 • 136

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 218

upvoted a paper about 1 year ago

Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback

Paper • 2310.05199 • Published Oct 8, 2023 • 1