Prakhar Dixit's picture

2 3 2

Prakhar Dixit

pdx97

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

pdx97/Schema_Based_Instruction_Dataset

View all activity

Organizations

pdx97's activity

upvoted 2 papers 8 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 34

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 50

upvoted an article 9 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 41