Zorik's picture

1 10 6

Zorik

zorik

·

AI & ML interests

NLP

Recent Activity

liked a model 12 days ago

meta-llama/Meta-Llama-3-8B-Instruct

upvoted a paper about 2 months ago

Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

View all activity

Organizations

zorik's activity

upvoted a paper about 2 months ago

Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance

Paper • 2410.18889 • Published Oct 24 • 15

upvoted a paper 2 months ago

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Paper • 2410.05254 • Published Oct 7 • 80

upvoted 2 papers 3 months ago

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3 • 47

NL-Eye: Abductive NLI for Images

Paper • 2410.02613 • Published Oct 3 • 22

upvoted a paper 6 months ago

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Paper • 2405.05904 • Published May 9 • 6

upvoted a collection about 1 year ago

SEAHORSE release

The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated 9 days ago • 17

upvoted 4 papers over 1 year ago

On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method

Paper • 2206.14796 • Published Jun 29, 2022 • 1

KoBE: Knowledge-Based Machine Translation Evaluation

Paper • 2009.11027 • Published Sep 23, 2020 • 1

RED-ACE: Robust Error Detection for ASR using Confidence Embeddings

Paper • 2203.07172 • Published Mar 14, 2022 • 1

TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models

Paper • 2305.11171 • Published May 18, 2023 • 2