Noah Lee's picture

3 1 4

Noah Lee

nlee-208

·

https://nlee-208.github.io/

AI & ML interests

LLM, Human Alignment, Uncertainty

Recent Activity

updated a dataset 4 days ago

iqwiki-kor/korean-lmsys-1M

updated a dataset 7 days ago

linkedin-xfact/Qwen2.5-3B-SFT-pairwise-L_RM

updated a dataset 10 days ago

nlee-208/persona_zero-janus-dpo-7b

View all activity

Organizations

nlee-208's activity

updated a dataset 4 days ago

iqwiki-kor/korean-lmsys-1M

Viewer • Updated 4 days ago • 1M • 2

updated a dataset 7 days ago

linkedin-xfact/Qwen2.5-3B-SFT-pairwise-L_RM

Viewer • Updated 7 days ago • 60.9k • 7

updated a dataset 10 days ago

nlee-208/persona_zero-janus-dpo-7b

Viewer • Updated 10 days ago • 500 • 17

updated a dataset 16 days ago

nlee-208/gqa_xling

Viewer • Updated 16 days ago • 3.41k • 47

updated 2 datasets 28 days ago

iqwiki-kor/wDPO-it-final1

Viewer • Updated 28 days ago • 10k • 38

iqwiki-kor/wDPO-ko

Viewer • Updated 28 days ago • 10k • 34

updated 2 models about 1 month ago

iqwiki-kor/Qwen2.5-3B-MP-RM

Text Classification • Updated Nov 19 • 1.22k

iqwiki-kor/Llama3.2-3B-MP-RM

Updated Nov 19 • 952

authored a paper about 2 months ago

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Paper • 2410.18027 • Published Oct 23

updated a collection about 2 months ago

Cross-lingual Transfer of Reward Models

This is the collection of synthetic preference data and trained reward models in "Cross-lingual Transfer of Reward Models in Multilingual Alignment". • 5 items • Updated Oct 31

updated 6 datasets about 2 months ago

iqwiki-kor/uf-g4o_translated-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed8049

Viewer • Updated Oct 30 • 56.8k • 33

iqwiki-kor/khs-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed6247

Viewer • Updated Oct 29 • 10.2k • 34

iqwiki-kor/khs-Qwen2.5-7B-distill-SFT-DPO-beta0.1-seed1903

Viewer • Updated Oct 29 • 10.2k • 33

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.1-op-samp4-seed6247

Viewer • Updated Oct 29 • 10.2k • 33

iqwiki-kor/Q2.5-7B-dist-op-pref-seed2938

Viewer • Updated Oct 29 • 56.8k • 31

iqwiki-kor/Qwen2.5-7B-distill-SFT-DPO-beta0.1-op-samp4-seed42

Viewer • Updated Oct 29 • 10.2k • 29