Juyoung Suk's picture

Juyoung Suk

scottsuk0306

·

https://scottsuk0306.github.io/

AI & ML interests

LLM

Recent Activity

new activity 6 days ago

scottsuk0306/Massive-Preferences-10K:Librarian Bot: Add language metadata for dataset

new activity 6 days ago

scottsuk0306/DepthQA:[bot] Conversion to Parquet

new activity 6 days ago

scottsuk0306/DepthQA:Librarian Bot: Add language metadata for dataset

View all activity

Organizations

scottsuk0306's activity

upvoted a collection 11 days ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated 22 days ago • 83

upvoted a paper 24 days ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 27 days ago • 43

upvoted 2 papers 29 days ago

Yi-Lightning Technical Report

Paper • 2412.01253 • Published 30 days ago • 25

Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling

Paper • 2411.18664 • Published Nov 27 • 23

upvoted 2 collections 4 months ago

Critique-out-Loud Reward Models

Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud • 7 items • Updated Sep 5 • 3

Tulu V2.5 Suite

A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! • 44 items • Updated Nov 27 • 14

upvoted a paper 7 months ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9 • 2

upvoted a collection 7 months ago

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 66

upvoted a collection 8 months ago

Prometheus 2

Quantized versions of Prometheus 2 - an alternative of GPT-4 evaluation when doing fine-grained evaluation of an underlying LLM. • 2 items • Updated May 8 • 1

upvoted a paper 8 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 119

upvoted a collection 9 months ago

The Feedback Collection

Dataset and Model for "Prometheus: Inducing Fine-grained Evaluation Capability in Language Models" • 6 items • Updated Nov 12, 2023 • 3

upvoted a paper 9 months ago

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

Paper • 2403.06412 • Published Mar 11 • 3

upvoted a paper about 1 year ago

FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Paper • 2307.10928 • Published Jul 20, 2023 • 12