HAE-RAE

non-profit

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

amphora new activity 1 day ago

HAERAE-HUB/HAE_RAE_BENCH_1.1:date understanding query 이슈

Cartinoe5930 updated a dataset 3 days ago

HAERAE-HUB/HRM8K

Dasool updated a dataset 11 days ago

HAERAE-HUB/butterflies_and_moths_vqa

View all activity

HAERAE-HUB's activity

amphora

in HAERAE-HUB/HAE_RAE_BENCH_1.1 1 day ago

date understanding query 이슈

#3 opened 20 days ago by

Cartinoe5930

updated a dataset 3 days ago

HAERAE-HUB/HRM8K

Viewer • Updated 3 days ago • 8.01k • 395 • 10

Dasool

updated a dataset 11 days ago

HAERAE-HUB/butterflies_and_moths_vqa

Viewer • Updated 11 days ago • 400 • 13

Dasool

published a dataset 11 days ago

HAERAE-HUB/butterflies_and_moths_vqa

Viewer • Updated 11 days ago • 400 • 13

amphora

updated a dataset 12 days ago

HAERAE-HUB/hret_agent_idavidrein_gpqa_diamond_translated

Viewer • Updated 12 days ago • 5 • 21

amphora

published a dataset 12 days ago

HAERAE-HUB/hret_agent_idavidrein_gpqa_diamond_translated

Viewer • Updated 12 days ago • 5 • 21

Cartinoe5930

authored a paper 19 days ago

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 1

amphora

updated a dataset 19 days ago

HAERAE-HUB/HRMCR

Viewer • Updated 19 days ago • 100 • 109 • 2

amphora

in HAERAE-HUB/HRMCR 19 days ago

Update README.md

#2 opened 19 days ago by

Cartinoe5930

updated a dataset 19 days ago

HAERAE-HUB/HRMCR

Viewer • Updated 19 days ago • 100 • 109 • 2

Cartinoe5930

in HAERAE-HUB/HRMCR 19 days ago

Update README.md

#2 opened 19 days ago by

Update README.md

#1 opened 28 days ago by

amphora

updated a dataset 19 days ago

HAERAE-HUB/HRM8K

Viewer • Updated 3 days ago • 8.01k • 395 • 10

Cartinoe5930

authored a paper 22 days ago

Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

Paper • 2501.02448 • Published 27 days ago

seungone

authored a paper 29 days ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published Dec 10, 2024 • 2

seungone

authored a paper about 1 month ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 8

seungone

authored a paper about 2 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 46

paws

authored a paper about 2 months ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published Dec 4, 2024 • 12

seungone

authored 2 papers 3 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

Better Instruction-Following Through Minimum Bayes Risk

Paper • 2410.02902 • Published Oct 3, 2024