Jiwoo Hong's picture

Jiwoo Hong

JW17

·

https://jiwooya1000.github.io/

AI & ML interests

NLP, LLM, and any related topics

Recent Activity

published a dataset 2 days ago

JW17/UI-tagged-v0.2

updated a dataset 3 days ago

JW17/UI-tagged-v0.2

updated a model 11 days ago

Cambridge-KAIST2/SmolLM-14m-Dolma-v0.4-zloss

View all activity

Organizations

JW17's activity

upvoted a collection 5 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 509

upvoted an article 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 306

upvoted an article 8 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 73

upvoted a paper 8 months ago

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Paper • 2406.06424 • Published Jun 10, 2024 • 13

upvoted a collection 8 months ago

MaPO

This collection includes the models and datasets as a part of the MaPO release. • 9 items • Updated Jun 12, 2024 • 5

upvoted a paper 8 months ago

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29, 2024 • 10

upvoted 2 articles 10 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24, 2024

• 65

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted 2 collections 10 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 711

Zephyr ORPO

Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12, 2024 • 17

upvoted a paper 10 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

upvoted a collection 11 months ago

ORPO

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12, 2024 • 11