xirena's picture

xirena

xirena

AI & ML interests

DPO, PPO, Pre-training, Fine-tuning, and RLHF Training.

Organizations

Newstar Research ASIA's profile picture

models

None public yet

datasets

None public yet