MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 3
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles Paper • 2309.09369 • Published Sep 17, 2023
Art or Artifice? Large Language Models and the False Promise of Creativity Paper • 2309.14556 • Published Sep 25, 2023
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning Paper • 2306.01150 • Published Jun 1, 2023
Next Steps for Human-Centered Generative AI: A Technical Perspective Paper • 2306.15774 • Published Jun 27, 2023
MixQG: Neural Question Generation with Mixed Answer Types Paper • 2110.08175 • Published Oct 15, 2021
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization Paper • 2111.09525 • Published Nov 18, 2021
Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors Paper • 2205.12854 • Published May 25, 2022
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 3
Prompt Leakage effect and defense strategies for multi-turn LLM interactions Paper • 2404.16251 • Published Apr 24, 2024
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments Paper • 2411.02305 • Published Nov 4, 2024
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents Paper • 2408.07060 • Published Aug 13, 2024 • 40
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 28 days ago • 637