Bram Vanroy PRO

BramVanroy

AI & ML interests

Artificial intelligence, natural language processing, computational linguistics

Recent Activity

New activity 2 days ago
HPLT/hplt_bert_base_fr
liked a model 3 days ago
PleIAs/celadon
New activity 7 days ago
ivdnt/galahad-corpus-data

Organizations

Posts 11

view post
Post
1537
The InstructGPT paper mentions that they insert 10% pretraining data during SFT, which they find improves the effect of PPO (IIUC). Has anyone else done later ablations on this? I've only seen the inverse suggested, mixing in SFT data during pretraining.
view post
Post
2233
All my models seem to be plagued by infinite lists. When you ask a question that requires it to write a list, it most often keeps adding bullet points or enumeration. I am wondering whether this is a result of using chatty GPT-4 as DPO preferences. Any thoughts?