8 9 37

Philippe Laban

philippelaban

https://tingofurro.github.io/

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

Salesforce/CRMArena-Leaderboard

liked a model about 2 months ago

bespokelabs/Bespoke-MiniCheck-7B

upvoted a paper about 2 months ago

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

View all activity

Organizations

philippelaban's activity

liked a Space about 1 month ago

Running

🥇

CRMArena Leaderboard

A realistic benchmark with real CRM tasks for LLM agents.

liked a model about 2 months ago

bespokelabs/Bespoke-MiniCheck-7B

Text Classification • Updated 14 days ago • 5.17k • 57

upvoted a paper about 2 months ago

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Paper • 2404.10774 • Published Apr 16, 2024 • 3

authored 10 papers about 2 months ago

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

Paper • 2309.09369 • Published Sep 17, 2023

Art or Artifice? Large Language Models and the False Promise of Creativity

Paper • 2309.14556 • Published Sep 25, 2023

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Paper • 2306.01150 • Published Jun 1, 2023

Next Steps for Human-Centered Generative AI: A Technical Perspective

Paper • 2306.15774 • Published Jun 27, 2023

MixQG: Neural Question Generation with Mixed Answer Types

Paper • 2110.08175 • Published Oct 15, 2021

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Paper • 2111.09525 • Published Nov 18, 2021

Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors

Paper • 2205.12854 • Published May 25, 2022

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Paper • 2404.10774 • Published Apr 16, 2024 • 3

Prompt Leakage effect and defense strategies for multi-turn LLM interactions

Paper • 2404.16251 • Published Apr 24, 2024

CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

Paper • 2411.02305 • Published Nov 4, 2024

liked 4 datasets about 2 months ago

upvoted a paper 4 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 77

upvoted a paper 5 months ago

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Paper • 2408.07060 • Published Aug 13, 2024 • 40

upvoted a collection 5 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 28 days ago • 637