Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Paper • 2405.05904 • Published May 9 • 5
SEAHORSE release Collection The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated Jul 31 • 17
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method Paper • 2206.14796 • Published Jun 29, 2022 • 1
RED-ACE: Robust Error Detection for ASR using Confidence Embeddings Paper • 2203.07172 • Published Mar 14, 2022 • 1
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models Paper • 2305.11171 • Published May 18, 2023 • 2