Kaidi Xu's picture

2 3

Kaidi Xu

KaidiXu1

·

https://kaidixu.com/

KaidiXu

AI & ML interests

Trustworthy AI

Recent Activity

upvoted a paper 15 days ago

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

authored a paper 16 days ago

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

liked a Space 5 months ago

jhao/llm-autobiography

View all activity

Organizations

None yet

KaidiXu1's activity

upvoted a paper 15 days ago

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Paper • 2502.14302 • Published 20 days ago • 9

authored a paper 16 days ago

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Paper • 2502.14302 • Published 20 days ago • 9

liked a Space 5 months ago

llm-autobiography

authored a paper 12 months ago

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

liked a Space about 1 year ago

GTBench

Explore and filter model evaluation results

liked a dataset about 1 year ago

ILSVRC/imagenet-1k

Updated Jul 16, 2024 • 41.9k • 473

upvoted a paper about 1 year ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69